Commit Graph

3364 Commits

Author SHA1 Message Date
Maneesh Gupta 3f5e937afc Merge pull request #949 from gargrahul/single_stream_concurrent_kernels
Add extension for kernel concurrency on same stream

[ROCm/hip commit: 352b17346c]
2019-03-06 17:34:54 +05:30
Maneesh Gupta df7d8c46ed Merge pull request #932 from ROCm-Developer-Tools/feature_maybe_dlopen_test
Add one test case for complex dynamic loading behavior

[ROCm/hip commit: 0c4a40efcc]
2019-03-06 17:32:23 +05:30
Alex Voicu 0c16497abd dlopen() fixes (#929)
* Initial attempt to switch over to internally linked state.

* Add missing CMake update.

* hipLaunchKernelGGLImpl must be inline as well. Ensure internal linkage.

* Ensure global retrieval uses internally linked state.

* Hide HC in the implementation. Minimise ADL woes.

* Strange software exists, and must be catered to.

* Use a less spammy mechanism for ensuring internal linkage / non-export.

* Remove leftover internal detail.


[ROCm/hip commit: ea0fcf3e61]
2019-03-06 17:31:44 +05:30
Rahul Garg 5d6fd17fbe Add extension for kernel concurrency on same stream
[ROCm/hip commit: 59081c69fc]
2019-03-06 12:55:39 +05:30
Maneesh Gupta 2264e69da3 Merge pull request #936 from mangupta/swdev-174923
[hipconfig] Update HIP_PLATFORM detection logic

[ROCm/hip commit: 2d67bc5e9c]
2019-03-06 06:08:11 +05:30
Evgeny Mankov 537e13c2ca Merge pull request #948 from emankov/master
[HIPIFY] Change CUDA Driver's functions' cuMemsetD32(Async) mapping

[ROCm/hip commit: 23de66bc7e]
2019-03-05 18:18:39 +03:00
Evgeny Mankov effdffc006 [HIPIFY] Change CUDA Driver's functions' cuMemsetD32(Async) mapping
cuMemsetD32(Async) -> hipMemsetD32(Async) (was hipMemset(Async))

based on:
[#933] https://github.com/ROCm-Developer-Tools/HIP/pull/933


[ROCm/hip commit: a87d798d90]
2019-03-05 18:13:18 +03:00
Maneesh Gupta 31457951ba Merge pull request #933 from ROCm-Developer-Tools/fix_hipmemset
Add HIP memset APIs to cope with non-zero initial values of integer types

[ROCm/hip commit: 2ed31e9f6c]
2019-03-05 14:31:38 +05:30
Maneesh Gupta 54c72c1079 Update hipMemset.cpp
Address build issues on nvcc path.

[ROCm/hip commit: 51f29b9cee]
2019-03-05 12:11:11 +05:30
Maneesh Gupta 19b6b2b570 Update hip_runtime_api.h
Use hipCUResultTohipError instead of hipCUDAErrorTohipError in hipMemsetD32 & hipMemsetD32Async.

[ROCm/hip commit: 8db717c769]
2019-03-05 12:10:01 +05:30
Wen-Heng (Jack) Chung da589e38ed Address code review comments to use hipDeviceptr_t
[ROCm/hip commit: 5cbd28f29b]
2019-03-05 05:51:05 +00:00
Wen-Heng (Jack) Chung ea689f63ca Add implementation for NVCC path
[ROCm/hip commit: 0337b686ef]
2019-03-04 20:11:12 -08:00
Wen-Heng (Jack) Chung 085c6c15b2 Add direct test for hipMemsetD32 and hipMemsetD32Async
[ROCm/hip commit: bfde8a7fab]
2019-03-04 17:20:32 +00:00
Wen-Heng (Jack) Chung 0b7f38d100 Add hipMemsetD32 and hipMemsetD32Async
Add 2 extra memset functions which fills memory with integer-typed data

Also change the parameters of ihipMemset to better explain the semantic


[ROCm/hip commit: 7ebbbd3525]
2019-03-04 17:00:33 +00:00
Maneesh Gupta a60c0222f0 Merge pull request #939 from gargrahul/update_hipmemset_test
[dtest] Update hipMemset test

[ROCm/hip commit: de4a9b8446]
2019-03-03 20:29:55 +05:30
Rahul Garg 7a9ebee08a Fix review comments
[ROCm/hip commit: 131c0ea326]
2019-03-02 23:38:37 +05:30
Maneesh Gupta 91f6e6b1e8 Merge pull request #945 from wkwchau/hipMemset3D_fix
Fix hipMemset3D test

[ROCm/hip commit: d28c13ed69]
2019-03-01 21:18:12 +05:30
Maneesh Gupta 87b3859ec2 Merge pull request #942 from yxsamliu/v3
revert hipcc changes about code object v3

[ROCm/hip commit: b051d44d24]
2019-03-01 21:17:10 +05:30
Wilkin Chau 4a0d68ba3f Fix hipMemset3D test
Calculate the allocated size based on the width, height and depth.


[ROCm/hip commit: 8d92d1ebd7]
2019-02-28 22:42:46 +00:00
Rahul Garg 4093c03dd7 Fix hipMemset test for HIP/NVCC
[ROCm/hip commit: 77c6d5451f]
2019-03-01 03:46:57 +05:30
Yaxun Sam Liu d9f59b18f8 Revert "hipcc should consume -mcode-object-v3 flag"
This reverts commit 1a8b50e63c.


[ROCm/hip commit: e8556388f9]
2019-02-28 11:21:47 -05:00
Yaxun Sam Liu e748fd2ae0 Revert "Change code-object flag to only HIP-Clang"
This reverts commit 17a853c277.


[ROCm/hip commit: 1bd8d24278]
2019-02-28 11:20:04 -05:00
Yaxun Sam Liu f46d2a3010 Revert "Consume the code obj args to prevent duplicates"
This reverts commit b19e4b5b63.


[ROCm/hip commit: 737972d31c]
2019-02-28 11:19:35 -05:00
Maneesh Gupta f3def1c107 Merge pull request #938 from gargrahul/fix_hipBusBW_p2p_bidir
Fix hipBusBW sample for P2P bidirectional test

[ROCm/hip commit: b93666bdde]
2019-02-28 07:14:38 +05:30
Maneesh Gupta f0bb5690f1 Merge pull request #937 from yxsamliu/nan2
Fix nan for windows

[ROCm/hip commit: 07b0e9574c]
2019-02-28 07:14:27 +05:30
Maneesh Gupta 2e700104de Merge pull request #935 from gargrahul/fix_hipbusbw_beatsoverflow
Fix hipBusBW overflow with setting beats/iterations

[ROCm/hip commit: 2da14e97f2]
2019-02-28 07:14:16 +05:30
Maneesh Gupta c3b91c0bcd Merge pull request #934 from gargrahul/fix_forceinline_non_hcc
Fix forceinline for non HCC compilation

[ROCm/hip commit: 2619b11efc]
2019-02-28 07:14:05 +05:30
Rahul Garg fa0c9d8a73 Update hipMemset test
[ROCm/hip commit: 026dc9457e]
2019-02-28 06:54:49 +05:30
Rahul Garg c7620fc4e9 Fix hipBusBW sample for P2P bidirectional test
[ROCm/hip commit: 70278f1777]
2019-02-28 00:56:07 +05:30
Yaxun Sam Liu 48ab0aa588 Fix nan for windows
[ROCm/hip commit: 278007218a]
2019-02-27 12:33:26 -05:00
Maneesh Gupta 55da014b79 [hipcofig] Update HIP_PLATFORM detection logic
HIP_PLATFORM detection logic relied on finding a working KFD. If it was
found, the platform was set as hcc else as nvcc.
However this logic is flawed since it is possible for the development
system to only have the user mode bits to build HIP application code.
Hence the better logic is to rely on finding a suitable compiler.
The new logic is as follows:
- look for a working HCC. If found, platform is set as hcc.
- else look for a working NVCC. If found, platform is set as nvcc.
- else the platform defaults to hcc for now.

Change-Id: Ifcc42c29a19f722153d5c23c55f1a8765dceaf6b


[ROCm/hip commit: 8b25f7b4cf]
2019-02-27 14:10:21 +05:30
Rahul Garg 6391066444 Fix hipBusBW overflow with setting beats/iterations
[ROCm/hip commit: 673ecd02fe]
2019-02-27 00:18:52 +05:30
Rahul Garg 67723bd3e7 Fix forceinline for non HCC compilation
[ROCm/hip commit: 55145feae6]
2019-02-26 07:50:09 +05:30
Wen-Heng (Jack) Chung c79a5a798e Add one test case for complex dynamic loading behavior
Existing HIT syntax doesn't seem to support the expected build and run steps
for this test.


[ROCm/hip commit: 8e5bb3c57a]
2019-02-25 17:03:31 +00:00
Evgeny Mankov 2606b86405 Merge pull request #930 from emankov/master
[HIPIFY][doc] Update README.md

[ROCm/hip commit: e8ca9cbf95]
2019-02-25 18:29:26 +03:00
Evgeny Mankov 55d9cd6a65 [HIPIFY][doc] Update README.md
+ Populate Dependencies section with upcoming LLVM versions
+ Add clang bugs for not working configs LLVM+CUDA
+ Update Testing section


[ROCm/hip commit: 391aa7221b]
2019-02-25 18:26:25 +03:00
Evgeny Mankov d5c833ea8a Merge pull request #928 from emankov/master
[HIPIFY][tests] caffe2 test fix

[ROCm/hip commit: 70dcb13852]
2019-02-25 17:39:06 +03:00
Evgeny Mankov 5f4b9a8f8f [HIPIFY][tests] caffe2 test fix
[ROCm/hip commit: 3a8ef9c8a2]
2019-02-25 17:12:32 +03:00
Evgeny Mankov 1e879f1c85 Merge pull request #927 from emankov/master
[HIPIFY][Caffe2] Initial Caffe2 support

[ROCm/hip commit: 284bf085ac]
2019-02-25 16:41:17 +03:00
Evgeny Mankov 3a57fc0a4b [HIPIFY][Caffe2] Initial Caffe2 support
[ROCm/hip commit: 2619f22e5c]
2019-02-23 20:46:22 +03:00
Maneesh Gupta 992f431c99 Merge pull request #925 from yxsamliu/h2f
Add __gnu_h2f_ieee and __gnu_f2h_ieee

[ROCm/hip commit: 31c4cc78b1]
2019-02-22 13:38:15 +05:30
Maneesh Gupta 2e52e50010 Merge pull request #923 from aaronenyeshi/fix-co-v3-arg
Consume the code obj args to prevent duplicates

[ROCm/hip commit: 53f98edc3d]
2019-02-22 13:38:08 +05:30
Yaxun Sam Liu 7ef8b1138f Add __gnu_h2f_ieee and __gnu_f2h_ieee
The implementation is copied from HCC runtime.

For hcc it has no effect since apps can find them in either hcc runtime or HIP
runtime.

hip-clang needs it in HIP/HCC runtime so that HIP/HCC and HIP/VDI runtime are
swappable.


[ROCm/hip commit: c5e813f64c]
2019-02-21 12:48:28 -05:00
Aaron Enye Shi b19e4b5b63 Consume the code obj args to prevent duplicates
[ROCm/hip commit: 2cd34f3c01]
2019-02-19 17:02:27 +00:00
Evgeny Mankov 0d99882b85 Merge pull request #919 from emankov/master
[HIPIFY][DNN] Revise and sync with CUDNN 10 and hipDNN

[ROCm/hip commit: 6eb8256883]
2019-02-15 15:06:28 +03:00
Maneesh Gupta b46dd95907 Merge pull request #916 from mangupta/swdev-179677
Fix signature for unsigned long long variant of atomicExch

[ROCm/hip commit: 438cf792b7]
2019-02-14 12:46:32 +05:30
Evgeny Mankov d91590a98b [HIPIFY][DNN] Revise and sync with CUDNN 10 and hipDNN
+ CUDNN_API_supported_by_HIP.md update
+ CUDA_Runtime_API_functions_supported_by_HIP.md update
+ libraryPropertyType_t is moved to Runtime
+ test update


[ROCm/hip commit: d26dc3cea7]
2019-02-13 19:40:59 +03:00
Maneesh Gupta 3052464cb8 Merge pull request #918 from ROCm-Developer-Tools/ci_updates
[ci] Switch tests in CI infrastructure to cuda-10.x

[ROCm/hip commit: 22b55a9b83]
2019-02-13 14:05:23 +05:30
Maneesh Gupta 6ea56b5f95 Merge pull request #917 from mangupta/fix_hip_lib_path
Initialize HIP_LIB_PATH on hcc path

[ROCm/hip commit: 95ba10fcb9]
2019-02-13 14:05:13 +05:30
Maneesh Gupta dd980e7e72 [ci] Switch CI tests to cuda-10.x
Change-Id: I99e99bcf528d0c38924bd3a86a3067628f2bb3ea


[ROCm/hip commit: 336a781a2d]
2019-02-13 13:22:08 +05:30