Maneesh Gupta
f0618df8f4
Fix signature for unsigned long long variant of atomicExch
...
This fixes #888
Change-Id: I2b2dc5ed1ff2a26083ad9d102eccac90f67801ae
2019-02-13 09:45:07 +05:30
Maneesh Gupta
ca7a53b13d
Merge pull request #913 from aaronenyeshi/fix-texture-constant-addrspace
...
Fix texture using constant addrspace 4
2019-02-13 08:59:01 +05:30
Maneesh Gupta
f9daeb1c51
Merge pull request #912 from ROCm-Developer-Tools/remove-cpp-function-templates-for-malloc
...
Remove C++ function templates for hipMalloc and hipHostMalloc
2019-02-13 08:58:26 +05:30
Aaron Enye Shi
63b400beea
Fix texture using constant addrspace 4
2019-02-12 16:56:22 +00:00
Wen-Heng (Jack) Chung
eba3bd3082
Introduce __HIP_DISABLE_CPP_FUNCTIONS__ macro to disable C++ wrappers
...
hipMalloc and hipHostMalloc are C++ function templates by default. In some
applications they need to discover the decltype of underlying C functions. In
this commit, a new macro __HIP_DISABLE_CPP_FUNCTIONS__ is introduced to
suppress the C++ function templates.
Existing HIP applications don't need to be changed. hipMalloc and hipHostMalloc
will behave just the same.
For new applications which depends on identifying decltypes of underlying C
functions, __HIP_DISABLE_CPP_FUNCTIONS__ shall be defined prior to including
HIP headers.
2019-02-11 09:33:15 -06:00
Maneesh Gupta
87673d0750
Merge pull request #907 from yxsamliu/nan
...
Fix nan() for windows
2019-02-11 11:24:12 +05:30
Yaxun Sam Liu
ee4ce7ae49
Fix nan() for windows
2019-02-07 17:29:36 -05:00
Yaxun Sam Liu
a474dc7579
Fix hipcc and hip_vector_types.h for windows
...
Disable linux specific compilation and linking options for windows.
In hip_vector_types.h, hip-clang needs HIP's own implementation for vector types even
on windows. MSVC specific implementation is for building runtime itself, which is compiled
by MSVC.
2019-02-07 11:07:50 -05:00
Yaxun Sam Liu
43c144fe68
Fix build failure on windows due to vector type
2019-02-05 12:22:32 -05:00
Evgeny
cc5ff70b90
return false if PROF_API is disabled
2019-01-29 23:14:57 -06:00
Maneesh Gupta
50a12f1ce1
Merge pull request #863 from aaronenyeshi/add-dot-instructions
...
Add Dot functions as amd_mixed_dot function
2019-01-29 06:01:01 +05:30
Aaron Enye Shi
ede494bd5f
Fix mixed dot for Jenkins rocm_2_0
2019-01-24 22:48:35 +00:00
Aaron Enye Shi
712a4cbdc2
Use correct OCKL native vector types
2019-01-22 22:31:19 +00:00
Aaron Enye Shi
8bf588bdb1
Allow hip-clang to use amd_mixed_dot as well
2019-01-22 16:21:16 +00:00
Evgeny
0164464bcc
fixing HSA_INIT_API cid args
2019-01-16 23:45:44 -06:00
Maneesh Gupta
dd9bea7d25
Map more instrincis to ocml native
...
Change-Id: I54e55d5902931bafdabd0956e4e8c1d7b39f7173
2019-01-15 19:40:37 +05:30
Aaron Enye Shi
22f7570774
Add backwards compatibility to dot funcs
2019-01-14 23:14:49 +00:00
Aaron Enye Shi
f0b114ee1c
Move fp16 dot functions to hip_fp16
2019-01-14 21:50:32 +00:00
Maneesh Gupta
2b9d52b210
Switch to faster ocml variants for trig instrinsics
...
Change-Id: If62821e2fe1b0da91ad1b8c5580ebf1a009405e9
2019-01-14 13:01:57 +05:30
Aaron Enye Shi
aae1f4f96e
Add Dot functions as amd_mixed_dot function
...
Introduce the Dot functions which are available in the device library. Forward those prototypes, and introduce HIP API to expose the usage of the dot functions.
2019-01-11 23:31:02 +00:00
Laurent Morichetti
c9f552968d
Fix compilation issues with MSVC
2019-01-09 09:32:50 -08:00
Alex Voicu
ec14daa7ce
Hook into the creaky lazy-reinit machinery. Try to minimise race-risk.
2019-01-01 11:01:16 +05:30
Alex Voicu
25c7e5d609
Start re-working 731 for 2.0.
2019-01-01 11:00:57 +05:30
Maneesh Gupta
4d0266a495
Merge pull request #818 from yxsamliu/fix-hip-config
...
Add default arguments for hipConfigureCall
2019-01-01 10:25:25 +05:30
Yaxun Sam Liu
548f4dd4e5
Use __dparm for default parameter
2018-12-31 13:07:19 -05:00
Yaxun Sam Liu
ec03a8b352
Add default arguments for hipConfigureCall
2018-12-18 15:48:21 -05:00
Aaron Enye Shi
58dfeff27a
Add *_rn functions back into HIP intrinsics
...
Add back the round-to-nearest-even intrinsics back to HIP math
intrinsics as it caused regression.
2018-12-18 19:31:54 +00:00
Siu Chi Chan
1fbf639962
Revert "Rely on code object metadat for kernarg arguments alignof and sizeof."
...
This reverts commit fe1e963299 .
2018-11-29 11:38:37 -05:00
Siu Chi Chan
aeca2c8cdc
Revert "Handle (odd) corner case of argumentless __global__ function."
...
This reverts commit c0bd1a5af8 .
2018-11-29 11:38:34 -05:00
Maneesh Gupta
160c509e23
Merge pull request #760 from eshcherb/roctracer-hip-frontend-181113
...
Roctracer hip frontend 181113
2018-11-23 11:08:25 +05:30
Michael Kuron
8610128c3e
Merge branch 'master' into getsymboladdress
2018-11-20 12:03:22 +01:00
Michael Kuron
999b62b711
Fix hipGetSymbolAddress/hipGetSymbolSize on nvcc
2018-11-15 09:48:00 +01:00
Maneesh Gupta
0770c2b8da
Merge pull request #756 from yxsamliu/fix-heap
...
Define __hip_device_heap in header for hip-clang only
2018-11-14 13:22:31 +05:30
root
091124a766
rebase to master, tracer related changes
2018-11-13 15:50:34 +00:00
Evgeny
32c0008ef6
specialized spawner object
2018-11-13 15:33:26 +00:00
Evgeny
e5ba097afd
renaming HIP_INIT_CB_API to HIP_INIT_API
2018-11-13 15:33:26 +00:00
Evgeny
47f1d059d1
hip_prof_(api/str).h moving under hcc_detail dir
2018-11-13 15:33:26 +00:00
Evgeny
5d6a6a5c6c
compilation fix - struct name fix
2018-11-13 15:33:26 +00:00
Evgeny
ec989ffd96
fix
2018-11-13 15:33:26 +00:00
Evgeny
b8b1637ef7
adding activity prof layer
2018-11-13 15:33:26 +00:00
Yaxun Sam Liu
17ac81b69e
Define __hip_device_heap in header for hip-clang only
2018-11-12 16:39:41 -05:00
Alex Voicu
c0bd1a5af8
Handle (odd) corner case of argumentless __global__ function.
2018-11-12 00:32:35 +00:00
Alex Voicu
aa6815cdc8
Merge branch 'master' of https://github.com/ROCm-Developer-Tools/HIP into feature_get_alignment_and_size_from_metadata
2018-11-08 06:53:06 -08:00
Aaron Enye Shi
0cfaa52d15
Guard rcp rounded implementation as well
...
Since rcp implementations of non-default rounded versions are not correct or supported in OCML, guard them using the same macro OCML_BASIC_ROUNDED_OPERATIONS. Also update the docs and tests.
2018-11-06 19:53:28 +00:00
Aaron Enye Shi
e59c33250a
Guard the OCML rounded operations instead
...
Instead of commenting all these functions out, guard the functions with a macro OCML_BASIC_ROUNDED_OPERATIONS.
2018-11-06 16:32:14 +00:00
Michael Kuron
31acf1c268
Introduce ihipModuleGetGlobal
2018-11-06 09:54:34 +01:00
Aaron Enye Shi
2b108a2cfd
Remove non-working non-default-rounded math apis
...
In ROCm-Device-Libs, they have dropped the non-default-rounded versions of add, sub, mul, div, sqrt and fma. Therefore, ocml has removed the rte, rtp, rtn, and rtz counterparts. This will remove the same math APIs in HIP for _ru, _rd, _rn, and _rz.
2018-11-05 22:34:16 +00:00
Michael Kuron
4390c82121
Document return values of hipMemcpyToSymbol, hipGetSymbolAddress
2018-11-04 11:47:17 +01:00
Michael Kuron
73616582d6
Implement hipGetSymbolAddress and hipGetSymbolSize
2018-11-04 10:39:34 +01:00
Alex Voicu
f28cde3f98
Merge branch 'master' of https://github.com/ROCm-Developer-Tools/HIP into feature_get_alignment_and_size_from_metadata
2018-11-01 23:17:27 +00:00