Maneesh Gupta
fdbf84558c
Fix signature for unsigned long long variant of atomicExch
...
This fixes #888
Change-Id: I2b2dc5ed1ff2a26083ad9d102eccac90f67801ae
2019-02-13 09:45:07 +05:30
Maneesh Gupta
f1f9cc0d55
Merge pull request #913 from aaronenyeshi/fix-texture-constant-addrspace
...
Fix texture using constant addrspace 4
2019-02-13 08:59:01 +05:30
Maneesh Gupta
61d9553103
Merge pull request #912 from ROCm-Developer-Tools/remove-cpp-function-templates-for-malloc
...
Remove C++ function templates for hipMalloc and hipHostMalloc
2019-02-13 08:58:26 +05:30
Aaron Enye Shi
eec46be762
Fix texture using constant addrspace 4
2019-02-12 16:56:22 +00:00
Wen-Heng (Jack) Chung
f4e1f74e9e
Introduce __HIP_DISABLE_CPP_FUNCTIONS__ macro to disable C++ wrappers
...
hipMalloc and hipHostMalloc are C++ function templates by default. In some
applications they need to discover the decltype of underlying C functions. In
this commit, a new macro __HIP_DISABLE_CPP_FUNCTIONS__ is introduced to
suppress the C++ function templates.
Existing HIP applications don't need to be changed. hipMalloc and hipHostMalloc
will behave just the same.
For new applications which depends on identifying decltypes of underlying C
functions, __HIP_DISABLE_CPP_FUNCTIONS__ shall be defined prior to including
HIP headers.
2019-02-11 09:33:15 -06:00
Maneesh Gupta
3112c52742
Merge pull request #907 from yxsamliu/nan
...
Fix nan() for windows
2019-02-11 11:24:12 +05:30
Yaxun Sam Liu
fe5861770e
Fix nan() for windows
2019-02-07 17:29:36 -05:00
Yaxun Sam Liu
d0fc2d0f99
Fix hipcc and hip_vector_types.h for windows
...
Disable linux specific compilation and linking options for windows.
In hip_vector_types.h, hip-clang needs HIP's own implementation for vector types even
on windows. MSVC specific implementation is for building runtime itself, which is compiled
by MSVC.
2019-02-07 11:07:50 -05:00
Yaxun Sam Liu
6d5165ef62
Fix build failure on windows due to vector type
2019-02-05 12:22:32 -05:00
Evgeny
a32c3c7384
return false if PROF_API is disabled
2019-01-29 23:14:57 -06:00
Maneesh Gupta
95fe4b770f
Merge pull request #863 from aaronenyeshi/add-dot-instructions
...
Add Dot functions as amd_mixed_dot function
2019-01-29 06:01:01 +05:30
Aaron Enye Shi
2ab6fd374c
Fix mixed dot for Jenkins rocm_2_0
2019-01-24 22:48:35 +00:00
Aaron Enye Shi
b63597bb95
Use correct OCKL native vector types
2019-01-22 22:31:19 +00:00
Aaron Enye Shi
b0e0ec3c67
Allow hip-clang to use amd_mixed_dot as well
2019-01-22 16:21:16 +00:00
Evgeny
47625cb8fd
fixing HSA_INIT_API cid args
2019-01-16 23:45:44 -06:00
Maneesh Gupta
f74816efe6
Map more instrincis to ocml native
...
Change-Id: I54e55d5902931bafdabd0956e4e8c1d7b39f7173
2019-01-15 19:40:37 +05:30
Aaron Enye Shi
bdb84f3d9c
Add backwards compatibility to dot funcs
2019-01-14 23:14:49 +00:00
Aaron Enye Shi
7ba7d9a0f6
Move fp16 dot functions to hip_fp16
2019-01-14 21:50:32 +00:00
Maneesh Gupta
54e4463d16
Switch to faster ocml variants for trig instrinsics
...
Change-Id: If62821e2fe1b0da91ad1b8c5580ebf1a009405e9
2019-01-14 13:01:57 +05:30
Aaron Enye Shi
84a028b9e1
Add Dot functions as amd_mixed_dot function
...
Introduce the Dot functions which are available in the device library. Forward those prototypes, and introduce HIP API to expose the usage of the dot functions.
2019-01-11 23:31:02 +00:00
Laurent Morichetti
167bbffc4f
Fix compilation issues with MSVC
2019-01-09 09:32:50 -08:00
Alex Voicu
3470692998
Hook into the creaky lazy-reinit machinery. Try to minimise race-risk.
2019-01-01 11:01:16 +05:30
Alex Voicu
3e8f198b58
Start re-working 731 for 2.0.
2019-01-01 11:00:57 +05:30
Maneesh Gupta
aa932d542a
Merge pull request #818 from yxsamliu/fix-hip-config
...
Add default arguments for hipConfigureCall
2019-01-01 10:25:25 +05:30
Yaxun Sam Liu
57d1f6e6cb
Use __dparm for default parameter
2018-12-31 13:07:19 -05:00
Yaxun Sam Liu
02deeadaa4
Add default arguments for hipConfigureCall
2018-12-18 15:48:21 -05:00
Aaron Enye Shi
2d50392c17
Add *_rn functions back into HIP intrinsics
...
Add back the round-to-nearest-even intrinsics back to HIP math
intrinsics as it caused regression.
2018-12-18 19:31:54 +00:00
Siu Chi Chan
4251a70bc0
Revert "Rely on code object metadat for kernarg arguments alignof and sizeof."
...
This reverts commit 299b9d1382 .
2018-11-29 11:38:37 -05:00
Siu Chi Chan
30a9be5643
Revert "Handle (odd) corner case of argumentless __global__ function."
...
This reverts commit 927b06f8d4 .
2018-11-29 11:38:34 -05:00
Maneesh Gupta
99bb89b756
Merge pull request #760 from eshcherb/roctracer-hip-frontend-181113
...
Roctracer hip frontend 181113
2018-11-23 11:08:25 +05:30
Michael Kuron
e9b88711e2
Merge branch 'master' into getsymboladdress
2018-11-20 12:03:22 +01:00
Michael Kuron
f71b8cbc7d
Fix hipGetSymbolAddress/hipGetSymbolSize on nvcc
2018-11-15 09:48:00 +01:00
Maneesh Gupta
0c72d32b44
Merge pull request #756 from yxsamliu/fix-heap
...
Define __hip_device_heap in header for hip-clang only
2018-11-14 13:22:31 +05:30
root
0774b275c8
rebase to master, tracer related changes
2018-11-13 15:50:34 +00:00
Evgeny
1e49cff87a
specialized spawner object
2018-11-13 15:33:26 +00:00
Evgeny
e362688adf
renaming HIP_INIT_CB_API to HIP_INIT_API
2018-11-13 15:33:26 +00:00
Evgeny
8c88932f30
hip_prof_(api/str).h moving under hcc_detail dir
2018-11-13 15:33:26 +00:00
Evgeny
d35567f40a
compilation fix - struct name fix
2018-11-13 15:33:26 +00:00
Evgeny
2f2c955ae0
fix
2018-11-13 15:33:26 +00:00
Evgeny
084a68be63
adding activity prof layer
2018-11-13 15:33:26 +00:00
Yaxun Sam Liu
c44f2795d5
Define __hip_device_heap in header for hip-clang only
2018-11-12 16:39:41 -05:00
Alex Voicu
927b06f8d4
Handle (odd) corner case of argumentless __global__ function.
2018-11-12 00:32:35 +00:00
Alex Voicu
392e2ba1ba
Merge branch 'master' of https://github.com/ROCm-Developer-Tools/HIP into feature_get_alignment_and_size_from_metadata
2018-11-08 06:53:06 -08:00
Aaron Enye Shi
7b3bbc85c5
Guard rcp rounded implementation as well
...
Since rcp implementations of non-default rounded versions are not correct or supported in OCML, guard them using the same macro OCML_BASIC_ROUNDED_OPERATIONS. Also update the docs and tests.
2018-11-06 19:53:28 +00:00
Aaron Enye Shi
9aa92238ab
Guard the OCML rounded operations instead
...
Instead of commenting all these functions out, guard the functions with a macro OCML_BASIC_ROUNDED_OPERATIONS.
2018-11-06 16:32:14 +00:00
Michael Kuron
4da2d92281
Introduce ihipModuleGetGlobal
2018-11-06 09:54:34 +01:00
Aaron Enye Shi
cef6e8ef1f
Remove non-working non-default-rounded math apis
...
In ROCm-Device-Libs, they have dropped the non-default-rounded versions of add, sub, mul, div, sqrt and fma. Therefore, ocml has removed the rte, rtp, rtn, and rtz counterparts. This will remove the same math APIs in HIP for _ru, _rd, _rn, and _rz.
2018-11-05 22:34:16 +00:00
Michael Kuron
8049cdafab
Document return values of hipMemcpyToSymbol, hipGetSymbolAddress
2018-11-04 11:47:17 +01:00
Michael Kuron
0b6f5791f8
Implement hipGetSymbolAddress and hipGetSymbolSize
2018-11-04 10:39:34 +01:00
Alex Voicu
1c525d7222
Merge branch 'master' of https://github.com/ROCm-Developer-Tools/HIP into feature_get_alignment_and_size_from_metadata
2018-11-01 23:17:27 +00:00