Grafik Komit

3360 Melakukan

Penulis SHA1 Pesan Tanggal
Alex Voicu ea0fcf3e61 dlopen() fixes (#929)
* Initial attempt to switch over to internally linked state.

* Add missing CMake update.

* hipLaunchKernelGGLImpl must be inline as well. Ensure internal linkage.

* Ensure global retrieval uses internally linked state.

* Hide HC in the implementation. Minimise ADL woes.

* Strange software exists, and must be catered to.

* Use a less spammy mechanism for ensuring internal linkage / non-export.

* Remove leftover internal detail.
2019-03-06 17:31:44 +05:30
Maneesh Gupta 2d67bc5e9c Merge pull request #936 from mangupta/swdev-174923
[hipconfig] Update HIP_PLATFORM detection logic
2019-03-06 06:08:11 +05:30
Evgeny Mankov 23de66bc7e Merge pull request #948 from emankov/master
[HIPIFY] Change CUDA Driver's functions' cuMemsetD32(Async) mapping
2019-03-05 18:18:39 +03:00
Evgeny Mankov a87d798d90 [HIPIFY] Change CUDA Driver's functions' cuMemsetD32(Async) mapping
cuMemsetD32(Async) -> hipMemsetD32(Async) (was hipMemset(Async))

based on:
[#933] https://github.com/ROCm-Developer-Tools/HIP/pull/933
2019-03-05 18:13:18 +03:00
Maneesh Gupta 2ed31e9f6c Merge pull request #933 from ROCm-Developer-Tools/fix_hipmemset
Add HIP memset APIs to cope with non-zero initial values of integer types
2019-03-05 14:31:38 +05:30
Maneesh Gupta 51f29b9cee Update hipMemset.cpp
Address build issues on nvcc path.
2019-03-05 12:11:11 +05:30
Maneesh Gupta 8db717c769 Update hip_runtime_api.h
Use hipCUResultTohipError instead of hipCUDAErrorTohipError in hipMemsetD32 & hipMemsetD32Async.
2019-03-05 12:10:01 +05:30
Wen-Heng (Jack) Chung 5cbd28f29b Address code review comments to use hipDeviceptr_t 2019-03-05 05:51:05 +00:00
Wen-Heng (Jack) Chung 0337b686ef Add implementation for NVCC path 2019-03-04 20:11:12 -08:00
Wen-Heng (Jack) Chung bfde8a7fab Add direct test for hipMemsetD32 and hipMemsetD32Async 2019-03-04 17:20:32 +00:00
Wen-Heng (Jack) Chung 7ebbbd3525 Add hipMemsetD32 and hipMemsetD32Async
Add 2 extra memset functions which fills memory with integer-typed data

Also change the parameters of ihipMemset to better explain the semantic
2019-03-04 17:00:33 +00:00
Maneesh Gupta de4a9b8446 Merge pull request #939 from gargrahul/update_hipmemset_test
[dtest] Update hipMemset test
2019-03-03 20:29:55 +05:30
Rahul Garg 131c0ea326 Fix review comments 2019-03-02 23:38:37 +05:30
Maneesh Gupta d28c13ed69 Merge pull request #945 from wkwchau/hipMemset3D_fix
Fix hipMemset3D test
2019-03-01 21:18:12 +05:30
Maneesh Gupta b051d44d24 Merge pull request #942 from yxsamliu/v3
revert hipcc changes about code object v3
2019-03-01 21:17:10 +05:30
Wilkin Chau 8d92d1ebd7 Fix hipMemset3D test
Calculate the allocated size based on the width, height and depth.
2019-02-28 22:42:46 +00:00
Rahul Garg 77c6d5451f Fix hipMemset test for HIP/NVCC 2019-03-01 03:46:57 +05:30
Yaxun Sam Liu e8556388f9 Revert "hipcc should consume -mcode-object-v3 flag"
This reverts commit eb33507774.
2019-02-28 11:21:47 -05:00
Yaxun Sam Liu 1bd8d24278 Revert "Change code-object flag to only HIP-Clang"
This reverts commit 1c6365fbe7.
2019-02-28 11:20:04 -05:00
Yaxun Sam Liu 737972d31c Revert "Consume the code obj args to prevent duplicates"
This reverts commit 2cd34f3c01.
2019-02-28 11:19:35 -05:00
Maneesh Gupta b93666bdde Merge pull request #938 from gargrahul/fix_hipBusBW_p2p_bidir
Fix hipBusBW sample for P2P bidirectional test
2019-02-28 07:14:38 +05:30
Maneesh Gupta 07b0e9574c Merge pull request #937 from yxsamliu/nan2
Fix nan for windows
2019-02-28 07:14:27 +05:30
Maneesh Gupta 2da14e97f2 Merge pull request #935 from gargrahul/fix_hipbusbw_beatsoverflow
Fix hipBusBW overflow with setting beats/iterations
2019-02-28 07:14:16 +05:30
Maneesh Gupta 2619b11efc Merge pull request #934 from gargrahul/fix_forceinline_non_hcc
Fix forceinline for non HCC compilation
2019-02-28 07:14:05 +05:30
Rahul Garg 026dc9457e Update hipMemset test 2019-02-28 06:54:49 +05:30
Rahul Garg 70278f1777 Fix hipBusBW sample for P2P bidirectional test 2019-02-28 00:56:07 +05:30
Yaxun Sam Liu 278007218a Fix nan for windows 2019-02-27 12:33:26 -05:00
Maneesh Gupta 8b25f7b4cf [hipcofig] Update HIP_PLATFORM detection logic
HIP_PLATFORM detection logic relied on finding a working KFD. If it was
found, the platform was set as hcc else as nvcc.
However this logic is flawed since it is possible for the development
system to only have the user mode bits to build HIP application code.
Hence the better logic is to rely on finding a suitable compiler.
The new logic is as follows:
- look for a working HCC. If found, platform is set as hcc.
- else look for a working NVCC. If found, platform is set as nvcc.
- else the platform defaults to hcc for now.

Change-Id: Ifcc42c29a19f722153d5c23c55f1a8765dceaf6b
2019-02-27 14:10:21 +05:30
Rahul Garg 673ecd02fe Fix hipBusBW overflow with setting beats/iterations 2019-02-27 00:18:52 +05:30
Rahul Garg 55145feae6 Fix forceinline for non HCC compilation 2019-02-26 07:50:09 +05:30
Evgeny Mankov e8ca9cbf95 Merge pull request #930 from emankov/master
[HIPIFY][doc] Update README.md
2019-02-25 18:29:26 +03:00
Evgeny Mankov 391aa7221b [HIPIFY][doc] Update README.md
+ Populate Dependencies section with upcoming LLVM versions
+ Add clang bugs for not working configs LLVM+CUDA
+ Update Testing section
2019-02-25 18:26:25 +03:00
Evgeny Mankov 70dcb13852 Merge pull request #928 from emankov/master
[HIPIFY][tests] caffe2 test fix
2019-02-25 17:39:06 +03:00
Evgeny Mankov 3a8ef9c8a2 [HIPIFY][tests] caffe2 test fix 2019-02-25 17:12:32 +03:00
Evgeny Mankov 284bf085ac Merge pull request #927 from emankov/master
[HIPIFY][Caffe2] Initial Caffe2 support
2019-02-25 16:41:17 +03:00
Evgeny Mankov 2619f22e5c [HIPIFY][Caffe2] Initial Caffe2 support 2019-02-23 20:46:22 +03:00
Maneesh Gupta 31c4cc78b1 Merge pull request #925 from yxsamliu/h2f
Add __gnu_h2f_ieee and __gnu_f2h_ieee
2019-02-22 13:38:15 +05:30
Maneesh Gupta 53f98edc3d Merge pull request #923 from aaronenyeshi/fix-co-v3-arg
Consume the code obj args to prevent duplicates
2019-02-22 13:38:08 +05:30
Yaxun Sam Liu c5e813f64c Add __gnu_h2f_ieee and __gnu_f2h_ieee
The implementation is copied from HCC runtime.

For hcc it has no effect since apps can find them in either hcc runtime or HIP
runtime.

hip-clang needs it in HIP/HCC runtime so that HIP/HCC and HIP/VDI runtime are
swappable.
2019-02-21 12:48:28 -05:00
Aaron Enye Shi 2cd34f3c01 Consume the code obj args to prevent duplicates 2019-02-19 17:02:27 +00:00
Evgeny Mankov 6eb8256883 Merge pull request #919 from emankov/master
[HIPIFY][DNN] Revise and sync with CUDNN 10 and hipDNN
2019-02-15 15:06:28 +03:00
Maneesh Gupta 438cf792b7 Merge pull request #916 from mangupta/swdev-179677
Fix signature for unsigned long long variant of atomicExch
2019-02-14 12:46:32 +05:30
Evgeny Mankov d26dc3cea7 [HIPIFY][DNN] Revise and sync with CUDNN 10 and hipDNN
+ CUDNN_API_supported_by_HIP.md update
+ CUDA_Runtime_API_functions_supported_by_HIP.md update
+ libraryPropertyType_t is moved to Runtime
+ test update
2019-02-13 19:40:59 +03:00
Maneesh Gupta 22b55a9b83 Merge pull request #918 from ROCm-Developer-Tools/ci_updates
[ci] Switch tests in CI infrastructure to cuda-10.x
2019-02-13 14:05:23 +05:30
Maneesh Gupta 95ba10fcb9 Merge pull request #917 from mangupta/fix_hip_lib_path
Initialize HIP_LIB_PATH on hcc path
2019-02-13 14:05:13 +05:30
Maneesh Gupta 336a781a2d [ci] Switch CI tests to cuda-10.x
Change-Id: I99e99bcf528d0c38924bd3a86a3067628f2bb3ea
2019-02-13 13:22:08 +05:30
Maneesh Gupta c8579ead67 Initialize HIP_LIB_PATH on hcc path
Change-Id: I8f392fffcd6a3714c20cb17e876dbaad999c33a0
2019-02-13 09:57:21 +05:30
Maneesh Gupta f0618df8f4 Fix signature for unsigned long long variant of atomicExch
This fixes #888

Change-Id: I2b2dc5ed1ff2a26083ad9d102eccac90f67801ae
2019-02-13 09:45:07 +05:30
Maneesh Gupta ca7a53b13d Merge pull request #913 from aaronenyeshi/fix-texture-constant-addrspace
Fix texture using constant addrspace 4
2019-02-13 08:59:01 +05:30
Maneesh Gupta f9daeb1c51 Merge pull request #912 from ROCm-Developer-Tools/remove-cpp-function-templates-for-malloc
Remove C++ function templates for hipMalloc and hipHostMalloc
2019-02-13 08:58:26 +05:30