Alex Voicu
ea0fcf3e61
dlopen() fixes ( #929 )
...
* Initial attempt to switch over to internally linked state.
* Add missing CMake update.
* hipLaunchKernelGGLImpl must be inline as well. Ensure internal linkage.
* Ensure global retrieval uses internally linked state.
* Hide HC in the implementation. Minimise ADL woes.
* Strange software exists, and must be catered to.
* Use a less spammy mechanism for ensuring internal linkage / non-export.
* Remove leftover internal detail.
2019-03-06 17:31:44 +05:30
Maneesh Gupta
8db717c769
Update hip_runtime_api.h
...
Use hipCUResultTohipError instead of hipCUDAErrorTohipError in hipMemsetD32 & hipMemsetD32Async.
2019-03-05 12:10:01 +05:30
Wen-Heng (Jack) Chung
5cbd28f29b
Address code review comments to use hipDeviceptr_t
2019-03-05 05:51:05 +00:00
Wen-Heng (Jack) Chung
0337b686ef
Add implementation for NVCC path
2019-03-04 20:11:12 -08:00
Wen-Heng (Jack) Chung
7ebbbd3525
Add hipMemsetD32 and hipMemsetD32Async
...
Add 2 extra memset functions which fills memory with integer-typed data
Also change the parameters of ihipMemset to better explain the semantic
2019-03-04 17:00:33 +00:00
Maneesh Gupta
07b0e9574c
Merge pull request #937 from yxsamliu/nan2
...
Fix nan for windows
2019-02-28 07:14:27 +05:30
Yaxun Sam Liu
278007218a
Fix nan for windows
2019-02-27 12:33:26 -05:00
Rahul Garg
55145feae6
Fix forceinline for non HCC compilation
2019-02-26 07:50:09 +05:30
Maneesh Gupta
f0618df8f4
Fix signature for unsigned long long variant of atomicExch
...
This fixes #888
Change-Id: I2b2dc5ed1ff2a26083ad9d102eccac90f67801ae
2019-02-13 09:45:07 +05:30
Maneesh Gupta
ca7a53b13d
Merge pull request #913 from aaronenyeshi/fix-texture-constant-addrspace
...
Fix texture using constant addrspace 4
2019-02-13 08:59:01 +05:30
Maneesh Gupta
f9daeb1c51
Merge pull request #912 from ROCm-Developer-Tools/remove-cpp-function-templates-for-malloc
...
Remove C++ function templates for hipMalloc and hipHostMalloc
2019-02-13 08:58:26 +05:30
Aaron Enye Shi
63b400beea
Fix texture using constant addrspace 4
2019-02-12 16:56:22 +00:00
Wen-Heng (Jack) Chung
eba3bd3082
Introduce __HIP_DISABLE_CPP_FUNCTIONS__ macro to disable C++ wrappers
...
hipMalloc and hipHostMalloc are C++ function templates by default. In some
applications they need to discover the decltype of underlying C functions. In
this commit, a new macro __HIP_DISABLE_CPP_FUNCTIONS__ is introduced to
suppress the C++ function templates.
Existing HIP applications don't need to be changed. hipMalloc and hipHostMalloc
will behave just the same.
For new applications which depends on identifying decltypes of underlying C
functions, __HIP_DISABLE_CPP_FUNCTIONS__ shall be defined prior to including
HIP headers.
2019-02-11 09:33:15 -06:00
Maneesh Gupta
87673d0750
Merge pull request #907 from yxsamliu/nan
...
Fix nan() for windows
2019-02-11 11:24:12 +05:30
Yaxun Sam Liu
ee4ce7ae49
Fix nan() for windows
2019-02-07 17:29:36 -05:00
Yaxun Sam Liu
a474dc7579
Fix hipcc and hip_vector_types.h for windows
...
Disable linux specific compilation and linking options for windows.
In hip_vector_types.h, hip-clang needs HIP's own implementation for vector types even
on windows. MSVC specific implementation is for building runtime itself, which is compiled
by MSVC.
2019-02-07 11:07:50 -05:00
Yaxun Sam Liu
43c144fe68
Fix build failure on windows due to vector type
2019-02-05 12:22:32 -05:00
Evgeny
cc5ff70b90
return false if PROF_API is disabled
2019-01-29 23:14:57 -06:00
Maneesh Gupta
50a12f1ce1
Merge pull request #863 from aaronenyeshi/add-dot-instructions
...
Add Dot functions as amd_mixed_dot function
2019-01-29 06:01:01 +05:30
Aaron Enye Shi
ede494bd5f
Fix mixed dot for Jenkins rocm_2_0
2019-01-24 22:48:35 +00:00
Aaron Enye Shi
712a4cbdc2
Use correct OCKL native vector types
2019-01-22 22:31:19 +00:00
Aaron Enye Shi
8bf588bdb1
Allow hip-clang to use amd_mixed_dot as well
2019-01-22 16:21:16 +00:00
Evgeny
0164464bcc
fixing HSA_INIT_API cid args
2019-01-16 23:45:44 -06:00
Maneesh Gupta
dd9bea7d25
Map more instrincis to ocml native
...
Change-Id: I54e55d5902931bafdabd0956e4e8c1d7b39f7173
2019-01-15 19:40:37 +05:30
Aaron Enye Shi
22f7570774
Add backwards compatibility to dot funcs
2019-01-14 23:14:49 +00:00
Aaron Enye Shi
f0b114ee1c
Move fp16 dot functions to hip_fp16
2019-01-14 21:50:32 +00:00
Maneesh Gupta
2b9d52b210
Switch to faster ocml variants for trig instrinsics
...
Change-Id: If62821e2fe1b0da91ad1b8c5580ebf1a009405e9
2019-01-14 13:01:57 +05:30
Aaron Enye Shi
aae1f4f96e
Add Dot functions as amd_mixed_dot function
...
Introduce the Dot functions which are available in the device library. Forward those prototypes, and introduce HIP API to expose the usage of the dot functions.
2019-01-11 23:31:02 +00:00
Laurent Morichetti
c9f552968d
Fix compilation issues with MSVC
2019-01-09 09:32:50 -08:00
Alex Voicu
ec14daa7ce
Hook into the creaky lazy-reinit machinery. Try to minimise race-risk.
2019-01-01 11:01:16 +05:30
Alex Voicu
25c7e5d609
Start re-working 731 for 2.0.
2019-01-01 11:00:57 +05:30
Maneesh Gupta
4d0266a495
Merge pull request #818 from yxsamliu/fix-hip-config
...
Add default arguments for hipConfigureCall
2019-01-01 10:25:25 +05:30
Yaxun Sam Liu
548f4dd4e5
Use __dparm for default parameter
2018-12-31 13:07:19 -05:00
Yaxun Sam Liu
ec03a8b352
Add default arguments for hipConfigureCall
2018-12-18 15:48:21 -05:00
Aaron Enye Shi
58dfeff27a
Add *_rn functions back into HIP intrinsics
...
Add back the round-to-nearest-even intrinsics back to HIP math
intrinsics as it caused regression.
2018-12-18 19:31:54 +00:00
Siu Chi Chan
1fbf639962
Revert "Rely on code object metadat for kernarg arguments alignof and sizeof."
...
This reverts commit fe1e963299 .
2018-11-29 11:38:37 -05:00
Siu Chi Chan
aeca2c8cdc
Revert "Handle (odd) corner case of argumentless __global__ function."
...
This reverts commit c0bd1a5af8 .
2018-11-29 11:38:34 -05:00
Maneesh Gupta
160c509e23
Merge pull request #760 from eshcherb/roctracer-hip-frontend-181113
...
Roctracer hip frontend 181113
2018-11-23 11:08:25 +05:30
Michael Kuron
8610128c3e
Merge branch 'master' into getsymboladdress
2018-11-20 12:03:22 +01:00
Michael Kuron
999b62b711
Fix hipGetSymbolAddress/hipGetSymbolSize on nvcc
2018-11-15 09:48:00 +01:00
Maneesh Gupta
0770c2b8da
Merge pull request #756 from yxsamliu/fix-heap
...
Define __hip_device_heap in header for hip-clang only
2018-11-14 13:22:31 +05:30
root
091124a766
rebase to master, tracer related changes
2018-11-13 15:50:34 +00:00
Evgeny
32c0008ef6
specialized spawner object
2018-11-13 15:33:26 +00:00
Evgeny
e5ba097afd
renaming HIP_INIT_CB_API to HIP_INIT_API
2018-11-13 15:33:26 +00:00
Evgeny
47f1d059d1
hip_prof_(api/str).h moving under hcc_detail dir
2018-11-13 15:33:26 +00:00
Evgeny
5d6a6a5c6c
compilation fix - struct name fix
2018-11-13 15:33:26 +00:00
Evgeny
ec989ffd96
fix
2018-11-13 15:33:26 +00:00
Evgeny
b8b1637ef7
adding activity prof layer
2018-11-13 15:33:26 +00:00
Yaxun Sam Liu
17ac81b69e
Define __hip_device_heap in header for hip-clang only
2018-11-12 16:39:41 -05:00
Alex Voicu
c0bd1a5af8
Handle (odd) corner case of argumentless __global__ function.
2018-11-12 00:32:35 +00:00