Maneesh Gupta
3f5e937afc
Merge pull request #949 from gargrahul/single_stream_concurrent_kernels
...
Add extension for kernel concurrency on same stream
[ROCm/hip commit: 352b17346c ]
2019-03-06 17:34:54 +05:30
Maneesh Gupta
df7d8c46ed
Merge pull request #932 from ROCm-Developer-Tools/feature_maybe_dlopen_test
...
Add one test case for complex dynamic loading behavior
[ROCm/hip commit: 0c4a40efcc ]
2019-03-06 17:32:23 +05:30
Alex Voicu
0c16497abd
dlopen() fixes ( #929 )
...
* Initial attempt to switch over to internally linked state.
* Add missing CMake update.
* hipLaunchKernelGGLImpl must be inline as well. Ensure internal linkage.
* Ensure global retrieval uses internally linked state.
* Hide HC in the implementation. Minimise ADL woes.
* Strange software exists, and must be catered to.
* Use a less spammy mechanism for ensuring internal linkage / non-export.
* Remove leftover internal detail.
[ROCm/hip commit: ea0fcf3e61 ]
2019-03-06 17:31:44 +05:30
Rahul Garg
5d6fd17fbe
Add extension for kernel concurrency on same stream
...
[ROCm/hip commit: 59081c69fc ]
2019-03-06 12:55:39 +05:30
Maneesh Gupta
2264e69da3
Merge pull request #936 from mangupta/swdev-174923
...
[hipconfig] Update HIP_PLATFORM detection logic
[ROCm/hip commit: 2d67bc5e9c ]
2019-03-06 06:08:11 +05:30
Evgeny Mankov
537e13c2ca
Merge pull request #948 from emankov/master
...
[HIPIFY] Change CUDA Driver's functions' cuMemsetD32(Async) mapping
[ROCm/hip commit: 23de66bc7e ]
2019-03-05 18:18:39 +03:00
Evgeny Mankov
effdffc006
[HIPIFY] Change CUDA Driver's functions' cuMemsetD32(Async) mapping
...
cuMemsetD32(Async) -> hipMemsetD32(Async) (was hipMemset(Async))
based on:
[#933 ] https://github.com/ROCm-Developer-Tools/HIP/pull/933
[ROCm/hip commit: a87d798d90 ]
2019-03-05 18:13:18 +03:00
Maneesh Gupta
31457951ba
Merge pull request #933 from ROCm-Developer-Tools/fix_hipmemset
...
Add HIP memset APIs to cope with non-zero initial values of integer types
[ROCm/hip commit: 2ed31e9f6c ]
2019-03-05 14:31:38 +05:30
Maneesh Gupta
54c72c1079
Update hipMemset.cpp
...
Address build issues on nvcc path.
[ROCm/hip commit: 51f29b9cee ]
2019-03-05 12:11:11 +05:30
Maneesh Gupta
19b6b2b570
Update hip_runtime_api.h
...
Use hipCUResultTohipError instead of hipCUDAErrorTohipError in hipMemsetD32 & hipMemsetD32Async.
[ROCm/hip commit: 8db717c769 ]
2019-03-05 12:10:01 +05:30
Wen-Heng (Jack) Chung
da589e38ed
Address code review comments to use hipDeviceptr_t
...
[ROCm/hip commit: 5cbd28f29b ]
2019-03-05 05:51:05 +00:00
Wen-Heng (Jack) Chung
ea689f63ca
Add implementation for NVCC path
...
[ROCm/hip commit: 0337b686ef ]
2019-03-04 20:11:12 -08:00
Wen-Heng (Jack) Chung
085c6c15b2
Add direct test for hipMemsetD32 and hipMemsetD32Async
...
[ROCm/hip commit: bfde8a7fab ]
2019-03-04 17:20:32 +00:00
Wen-Heng (Jack) Chung
0b7f38d100
Add hipMemsetD32 and hipMemsetD32Async
...
Add 2 extra memset functions which fills memory with integer-typed data
Also change the parameters of ihipMemset to better explain the semantic
[ROCm/hip commit: 7ebbbd3525 ]
2019-03-04 17:00:33 +00:00
Maneesh Gupta
a60c0222f0
Merge pull request #939 from gargrahul/update_hipmemset_test
...
[dtest] Update hipMemset test
[ROCm/hip commit: de4a9b8446 ]
2019-03-03 20:29:55 +05:30
Rahul Garg
7a9ebee08a
Fix review comments
...
[ROCm/hip commit: 131c0ea326 ]
2019-03-02 23:38:37 +05:30
Maneesh Gupta
91f6e6b1e8
Merge pull request #945 from wkwchau/hipMemset3D_fix
...
Fix hipMemset3D test
[ROCm/hip commit: d28c13ed69 ]
2019-03-01 21:18:12 +05:30
Maneesh Gupta
87b3859ec2
Merge pull request #942 from yxsamliu/v3
...
revert hipcc changes about code object v3
[ROCm/hip commit: b051d44d24 ]
2019-03-01 21:17:10 +05:30
Wilkin Chau
4a0d68ba3f
Fix hipMemset3D test
...
Calculate the allocated size based on the width, height and depth.
[ROCm/hip commit: 8d92d1ebd7 ]
2019-02-28 22:42:46 +00:00
Rahul Garg
4093c03dd7
Fix hipMemset test for HIP/NVCC
...
[ROCm/hip commit: 77c6d5451f ]
2019-03-01 03:46:57 +05:30
Yaxun Sam Liu
d9f59b18f8
Revert "hipcc should consume -mcode-object-v3 flag"
...
This reverts commit 1a8b50e63c .
[ROCm/hip commit: e8556388f9 ]
2019-02-28 11:21:47 -05:00
Yaxun Sam Liu
e748fd2ae0
Revert "Change code-object flag to only HIP-Clang"
...
This reverts commit 17a853c277 .
[ROCm/hip commit: 1bd8d24278 ]
2019-02-28 11:20:04 -05:00
Yaxun Sam Liu
f46d2a3010
Revert "Consume the code obj args to prevent duplicates"
...
This reverts commit b19e4b5b63 .
[ROCm/hip commit: 737972d31c ]
2019-02-28 11:19:35 -05:00
Maneesh Gupta
f3def1c107
Merge pull request #938 from gargrahul/fix_hipBusBW_p2p_bidir
...
Fix hipBusBW sample for P2P bidirectional test
[ROCm/hip commit: b93666bdde ]
2019-02-28 07:14:38 +05:30
Maneesh Gupta
f0bb5690f1
Merge pull request #937 from yxsamliu/nan2
...
Fix nan for windows
[ROCm/hip commit: 07b0e9574c ]
2019-02-28 07:14:27 +05:30
Maneesh Gupta
2e700104de
Merge pull request #935 from gargrahul/fix_hipbusbw_beatsoverflow
...
Fix hipBusBW overflow with setting beats/iterations
[ROCm/hip commit: 2da14e97f2 ]
2019-02-28 07:14:16 +05:30
Maneesh Gupta
c3b91c0bcd
Merge pull request #934 from gargrahul/fix_forceinline_non_hcc
...
Fix forceinline for non HCC compilation
[ROCm/hip commit: 2619b11efc ]
2019-02-28 07:14:05 +05:30
Rahul Garg
fa0c9d8a73
Update hipMemset test
...
[ROCm/hip commit: 026dc9457e ]
2019-02-28 06:54:49 +05:30
Rahul Garg
c7620fc4e9
Fix hipBusBW sample for P2P bidirectional test
...
[ROCm/hip commit: 70278f1777 ]
2019-02-28 00:56:07 +05:30
Yaxun Sam Liu
48ab0aa588
Fix nan for windows
...
[ROCm/hip commit: 278007218a ]
2019-02-27 12:33:26 -05:00
Maneesh Gupta
55da014b79
[hipcofig] Update HIP_PLATFORM detection logic
...
HIP_PLATFORM detection logic relied on finding a working KFD. If it was
found, the platform was set as hcc else as nvcc.
However this logic is flawed since it is possible for the development
system to only have the user mode bits to build HIP application code.
Hence the better logic is to rely on finding a suitable compiler.
The new logic is as follows:
- look for a working HCC. If found, platform is set as hcc.
- else look for a working NVCC. If found, platform is set as nvcc.
- else the platform defaults to hcc for now.
Change-Id: Ifcc42c29a19f722153d5c23c55f1a8765dceaf6b
[ROCm/hip commit: 8b25f7b4cf ]
2019-02-27 14:10:21 +05:30
Rahul Garg
6391066444
Fix hipBusBW overflow with setting beats/iterations
...
[ROCm/hip commit: 673ecd02fe ]
2019-02-27 00:18:52 +05:30
Rahul Garg
67723bd3e7
Fix forceinline for non HCC compilation
...
[ROCm/hip commit: 55145feae6 ]
2019-02-26 07:50:09 +05:30
Wen-Heng (Jack) Chung
c79a5a798e
Add one test case for complex dynamic loading behavior
...
Existing HIT syntax doesn't seem to support the expected build and run steps
for this test.
[ROCm/hip commit: 8e5bb3c57a ]
2019-02-25 17:03:31 +00:00
Evgeny Mankov
2606b86405
Merge pull request #930 from emankov/master
...
[HIPIFY][doc] Update README.md
[ROCm/hip commit: e8ca9cbf95 ]
2019-02-25 18:29:26 +03:00
Evgeny Mankov
55d9cd6a65
[HIPIFY][doc] Update README.md
...
+ Populate Dependencies section with upcoming LLVM versions
+ Add clang bugs for not working configs LLVM+CUDA
+ Update Testing section
[ROCm/hip commit: 391aa7221b ]
2019-02-25 18:26:25 +03:00
Evgeny Mankov
d5c833ea8a
Merge pull request #928 from emankov/master
...
[HIPIFY][tests] caffe2 test fix
[ROCm/hip commit: 70dcb13852 ]
2019-02-25 17:39:06 +03:00
Evgeny Mankov
5f4b9a8f8f
[HIPIFY][tests] caffe2 test fix
...
[ROCm/hip commit: 3a8ef9c8a2 ]
2019-02-25 17:12:32 +03:00
Evgeny Mankov
1e879f1c85
Merge pull request #927 from emankov/master
...
[HIPIFY][Caffe2] Initial Caffe2 support
[ROCm/hip commit: 284bf085ac ]
2019-02-25 16:41:17 +03:00
Evgeny Mankov
3a57fc0a4b
[HIPIFY][Caffe2] Initial Caffe2 support
...
[ROCm/hip commit: 2619f22e5c ]
2019-02-23 20:46:22 +03:00
Maneesh Gupta
992f431c99
Merge pull request #925 from yxsamliu/h2f
...
Add __gnu_h2f_ieee and __gnu_f2h_ieee
[ROCm/hip commit: 31c4cc78b1 ]
2019-02-22 13:38:15 +05:30
Maneesh Gupta
2e52e50010
Merge pull request #923 from aaronenyeshi/fix-co-v3-arg
...
Consume the code obj args to prevent duplicates
[ROCm/hip commit: 53f98edc3d ]
2019-02-22 13:38:08 +05:30
Yaxun Sam Liu
7ef8b1138f
Add __gnu_h2f_ieee and __gnu_f2h_ieee
...
The implementation is copied from HCC runtime.
For hcc it has no effect since apps can find them in either hcc runtime or HIP
runtime.
hip-clang needs it in HIP/HCC runtime so that HIP/HCC and HIP/VDI runtime are
swappable.
[ROCm/hip commit: c5e813f64c ]
2019-02-21 12:48:28 -05:00
Aaron Enye Shi
b19e4b5b63
Consume the code obj args to prevent duplicates
...
[ROCm/hip commit: 2cd34f3c01 ]
2019-02-19 17:02:27 +00:00
Evgeny Mankov
0d99882b85
Merge pull request #919 from emankov/master
...
[HIPIFY][DNN] Revise and sync with CUDNN 10 and hipDNN
[ROCm/hip commit: 6eb8256883 ]
2019-02-15 15:06:28 +03:00
Maneesh Gupta
b46dd95907
Merge pull request #916 from mangupta/swdev-179677
...
Fix signature for unsigned long long variant of atomicExch
[ROCm/hip commit: 438cf792b7 ]
2019-02-14 12:46:32 +05:30
Evgeny Mankov
d91590a98b
[HIPIFY][DNN] Revise and sync with CUDNN 10 and hipDNN
...
+ CUDNN_API_supported_by_HIP.md update
+ CUDA_Runtime_API_functions_supported_by_HIP.md update
+ libraryPropertyType_t is moved to Runtime
+ test update
[ROCm/hip commit: d26dc3cea7 ]
2019-02-13 19:40:59 +03:00
Maneesh Gupta
3052464cb8
Merge pull request #918 from ROCm-Developer-Tools/ci_updates
...
[ci] Switch tests in CI infrastructure to cuda-10.x
[ROCm/hip commit: 22b55a9b83 ]
2019-02-13 14:05:23 +05:30
Maneesh Gupta
6ea56b5f95
Merge pull request #917 from mangupta/fix_hip_lib_path
...
Initialize HIP_LIB_PATH on hcc path
[ROCm/hip commit: 95ba10fcb9 ]
2019-02-13 14:05:13 +05:30
Maneesh Gupta
dd980e7e72
[ci] Switch CI tests to cuda-10.x
...
Change-Id: I99e99bcf528d0c38924bd3a86a3067628f2bb3ea
[ROCm/hip commit: 336a781a2d ]
2019-02-13 13:22:08 +05:30