Граф коммитов

3071 Коммитов

Автор SHA1 Сообщение Дата
Maneesh Gupta 3f5e937afc Merge pull request #949 from gargrahul/single_stream_concurrent_kernels
Add extension for kernel concurrency on same stream

[ROCm/hip commit: 352b17346c]
2019-03-06 17:34:54 +05:30
Maneesh Gupta df7d8c46ed Merge pull request #932 from ROCm-Developer-Tools/feature_maybe_dlopen_test
Add one test case for complex dynamic loading behavior

[ROCm/hip commit: 0c4a40efcc]
2019-03-06 17:32:23 +05:30
Alex Voicu 0c16497abd dlopen() fixes (#929)
* Initial attempt to switch over to internally linked state.

* Add missing CMake update.

* hipLaunchKernelGGLImpl must be inline as well. Ensure internal linkage.

* Ensure global retrieval uses internally linked state.

* Hide HC in the implementation. Minimise ADL woes.

* Strange software exists, and must be catered to.

* Use a less spammy mechanism for ensuring internal linkage / non-export.

* Remove leftover internal detail.


[ROCm/hip commit: ea0fcf3e61]
2019-03-06 17:31:44 +05:30
Rahul Garg 5d6fd17fbe Add extension for kernel concurrency on same stream
[ROCm/hip commit: 59081c69fc]
2019-03-06 12:55:39 +05:30
Maneesh Gupta 2264e69da3 Merge pull request #936 from mangupta/swdev-174923
[hipconfig] Update HIP_PLATFORM detection logic

[ROCm/hip commit: 2d67bc5e9c]
2019-03-06 06:08:11 +05:30
Evgeny Mankov effdffc006 [HIPIFY] Change CUDA Driver's functions' cuMemsetD32(Async) mapping
cuMemsetD32(Async) -> hipMemsetD32(Async) (was hipMemset(Async))

based on:
[#933] https://github.com/ROCm-Developer-Tools/HIP/pull/933


[ROCm/hip commit: a87d798d90]
2019-03-05 18:13:18 +03:00
Maneesh Gupta 54c72c1079 Update hipMemset.cpp
Address build issues on nvcc path.

[ROCm/hip commit: 51f29b9cee]
2019-03-05 12:11:11 +05:30
Maneesh Gupta 19b6b2b570 Update hip_runtime_api.h
Use hipCUResultTohipError instead of hipCUDAErrorTohipError in hipMemsetD32 & hipMemsetD32Async.

[ROCm/hip commit: 8db717c769]
2019-03-05 12:10:01 +05:30
Wen-Heng (Jack) Chung da589e38ed Address code review comments to use hipDeviceptr_t
[ROCm/hip commit: 5cbd28f29b]
2019-03-05 05:51:05 +00:00
Wen-Heng (Jack) Chung ea689f63ca Add implementation for NVCC path
[ROCm/hip commit: 0337b686ef]
2019-03-04 20:11:12 -08:00
Wen-Heng (Jack) Chung 085c6c15b2 Add direct test for hipMemsetD32 and hipMemsetD32Async
[ROCm/hip commit: bfde8a7fab]
2019-03-04 17:20:32 +00:00
Wen-Heng (Jack) Chung 0b7f38d100 Add hipMemsetD32 and hipMemsetD32Async
Add 2 extra memset functions which fills memory with integer-typed data

Also change the parameters of ihipMemset to better explain the semantic


[ROCm/hip commit: 7ebbbd3525]
2019-03-04 17:00:33 +00:00
Maneesh Gupta a60c0222f0 Merge pull request #939 from gargrahul/update_hipmemset_test
[dtest] Update hipMemset test

[ROCm/hip commit: de4a9b8446]
2019-03-03 20:29:55 +05:30
Rahul Garg 7a9ebee08a Fix review comments
[ROCm/hip commit: 131c0ea326]
2019-03-02 23:38:37 +05:30
Maneesh Gupta 91f6e6b1e8 Merge pull request #945 from wkwchau/hipMemset3D_fix
Fix hipMemset3D test

[ROCm/hip commit: d28c13ed69]
2019-03-01 21:18:12 +05:30
Wilkin Chau 4a0d68ba3f Fix hipMemset3D test
Calculate the allocated size based on the width, height and depth.


[ROCm/hip commit: 8d92d1ebd7]
2019-02-28 22:42:46 +00:00
Rahul Garg 4093c03dd7 Fix hipMemset test for HIP/NVCC
[ROCm/hip commit: 77c6d5451f]
2019-03-01 03:46:57 +05:30
Yaxun Sam Liu d9f59b18f8 Revert "hipcc should consume -mcode-object-v3 flag"
This reverts commit 1a8b50e63c.


[ROCm/hip commit: e8556388f9]
2019-02-28 11:21:47 -05:00
Yaxun Sam Liu e748fd2ae0 Revert "Change code-object flag to only HIP-Clang"
This reverts commit 17a853c277.


[ROCm/hip commit: 1bd8d24278]
2019-02-28 11:20:04 -05:00
Yaxun Sam Liu f46d2a3010 Revert "Consume the code obj args to prevent duplicates"
This reverts commit b19e4b5b63.


[ROCm/hip commit: 737972d31c]
2019-02-28 11:19:35 -05:00
Maneesh Gupta f3def1c107 Merge pull request #938 from gargrahul/fix_hipBusBW_p2p_bidir
Fix hipBusBW sample for P2P bidirectional test

[ROCm/hip commit: b93666bdde]
2019-02-28 07:14:38 +05:30
Maneesh Gupta f0bb5690f1 Merge pull request #937 from yxsamliu/nan2
Fix nan for windows

[ROCm/hip commit: 07b0e9574c]
2019-02-28 07:14:27 +05:30
Maneesh Gupta 2e700104de Merge pull request #935 from gargrahul/fix_hipbusbw_beatsoverflow
Fix hipBusBW overflow with setting beats/iterations

[ROCm/hip commit: 2da14e97f2]
2019-02-28 07:14:16 +05:30
Rahul Garg fa0c9d8a73 Update hipMemset test
[ROCm/hip commit: 026dc9457e]
2019-02-28 06:54:49 +05:30
Rahul Garg c7620fc4e9 Fix hipBusBW sample for P2P bidirectional test
[ROCm/hip commit: 70278f1777]
2019-02-28 00:56:07 +05:30
Yaxun Sam Liu 48ab0aa588 Fix nan for windows
[ROCm/hip commit: 278007218a]
2019-02-27 12:33:26 -05:00
Maneesh Gupta 55da014b79 [hipcofig] Update HIP_PLATFORM detection logic
HIP_PLATFORM detection logic relied on finding a working KFD. If it was
found, the platform was set as hcc else as nvcc.
However this logic is flawed since it is possible for the development
system to only have the user mode bits to build HIP application code.
Hence the better logic is to rely on finding a suitable compiler.
The new logic is as follows:
- look for a working HCC. If found, platform is set as hcc.
- else look for a working NVCC. If found, platform is set as nvcc.
- else the platform defaults to hcc for now.

Change-Id: Ifcc42c29a19f722153d5c23c55f1a8765dceaf6b


[ROCm/hip commit: 8b25f7b4cf]
2019-02-27 14:10:21 +05:30
Rahul Garg 6391066444 Fix hipBusBW overflow with setting beats/iterations
[ROCm/hip commit: 673ecd02fe]
2019-02-27 00:18:52 +05:30
Rahul Garg 67723bd3e7 Fix forceinline for non HCC compilation
[ROCm/hip commit: 55145feae6]
2019-02-26 07:50:09 +05:30
Wen-Heng (Jack) Chung c79a5a798e Add one test case for complex dynamic loading behavior
Existing HIT syntax doesn't seem to support the expected build and run steps
for this test.


[ROCm/hip commit: 8e5bb3c57a]
2019-02-25 17:03:31 +00:00
Evgeny Mankov 55d9cd6a65 [HIPIFY][doc] Update README.md
+ Populate Dependencies section with upcoming LLVM versions
+ Add clang bugs for not working configs LLVM+CUDA
+ Update Testing section


[ROCm/hip commit: 391aa7221b]
2019-02-25 18:26:25 +03:00
Evgeny Mankov 5f4b9a8f8f [HIPIFY][tests] caffe2 test fix
[ROCm/hip commit: 3a8ef9c8a2]
2019-02-25 17:12:32 +03:00
Evgeny Mankov 3a57fc0a4b [HIPIFY][Caffe2] Initial Caffe2 support
[ROCm/hip commit: 2619f22e5c]
2019-02-23 20:46:22 +03:00
Maneesh Gupta 992f431c99 Merge pull request #925 from yxsamliu/h2f
Add __gnu_h2f_ieee and __gnu_f2h_ieee

[ROCm/hip commit: 31c4cc78b1]
2019-02-22 13:38:15 +05:30
Yaxun Sam Liu 7ef8b1138f Add __gnu_h2f_ieee and __gnu_f2h_ieee
The implementation is copied from HCC runtime.

For hcc it has no effect since apps can find them in either hcc runtime or HIP
runtime.

hip-clang needs it in HIP/HCC runtime so that HIP/HCC and HIP/VDI runtime are
swappable.


[ROCm/hip commit: c5e813f64c]
2019-02-21 12:48:28 -05:00
Aaron Enye Shi b19e4b5b63 Consume the code obj args to prevent duplicates
[ROCm/hip commit: 2cd34f3c01]
2019-02-19 17:02:27 +00:00
Evgeny Mankov 0d99882b85 Merge pull request #919 from emankov/master
[HIPIFY][DNN] Revise and sync with CUDNN 10 and hipDNN

[ROCm/hip commit: 6eb8256883]
2019-02-15 15:06:28 +03:00
Maneesh Gupta b46dd95907 Merge pull request #916 from mangupta/swdev-179677
Fix signature for unsigned long long variant of atomicExch

[ROCm/hip commit: 438cf792b7]
2019-02-14 12:46:32 +05:30
Evgeny Mankov d91590a98b [HIPIFY][DNN] Revise and sync with CUDNN 10 and hipDNN
+ CUDNN_API_supported_by_HIP.md update
+ CUDA_Runtime_API_functions_supported_by_HIP.md update
+ libraryPropertyType_t is moved to Runtime
+ test update


[ROCm/hip commit: d26dc3cea7]
2019-02-13 19:40:59 +03:00
Maneesh Gupta 3052464cb8 Merge pull request #918 from ROCm-Developer-Tools/ci_updates
[ci] Switch tests in CI infrastructure to cuda-10.x

[ROCm/hip commit: 22b55a9b83]
2019-02-13 14:05:23 +05:30
Maneesh Gupta dd980e7e72 [ci] Switch CI tests to cuda-10.x
Change-Id: I99e99bcf528d0c38924bd3a86a3067628f2bb3ea


[ROCm/hip commit: 336a781a2d]
2019-02-13 13:22:08 +05:30
Maneesh Gupta 7094c2d28d Initialize HIP_LIB_PATH on hcc path
Change-Id: I8f392fffcd6a3714c20cb17e876dbaad999c33a0


[ROCm/hip commit: c8579ead67]
2019-02-13 09:57:21 +05:30
Maneesh Gupta 8f1b3d34f7 Fix signature for unsigned long long variant of atomicExch
This fixes #888

Change-Id: I2b2dc5ed1ff2a26083ad9d102eccac90f67801ae


[ROCm/hip commit: f0618df8f4]
2019-02-13 09:45:07 +05:30
Maneesh Gupta f18ddd57da Merge pull request #913 from aaronenyeshi/fix-texture-constant-addrspace
Fix texture using constant addrspace 4

[ROCm/hip commit: ca7a53b13d]
2019-02-13 08:59:01 +05:30
Maneesh Gupta 6d16a4b679 Merge pull request #912 from ROCm-Developer-Tools/remove-cpp-function-templates-for-malloc
Remove C++ function templates for hipMalloc and hipHostMalloc

[ROCm/hip commit: f9daeb1c51]
2019-02-13 08:58:26 +05:30
Maneesh Gupta 4441aff847 Merge pull request #910 from aaronenyeshi/hip-code-obj-v3-passing
hipcc should consume -mcode-object-v3 flag

[ROCm/hip commit: d91bbf881c]
2019-02-13 08:58:19 +05:30
Aaron Enye Shi 1dad8c4329 Fix texture using constant addrspace 4
[ROCm/hip commit: 63b400beea]
2019-02-12 16:56:22 +00:00
Aaron Enye Shi 17a853c277 Change code-object flag to only HIP-Clang
[ROCm/hip commit: 1c6365fbe7]
2019-02-11 17:18:12 +00:00
Wen-Heng (Jack) Chung abd8e75c47 Introduce __HIP_DISABLE_CPP_FUNCTIONS__ macro to disable C++ wrappers
hipMalloc and hipHostMalloc are C++ function templates by default. In some
applications they need to discover the decltype of underlying C functions. In
this commit, a new macro __HIP_DISABLE_CPP_FUNCTIONS__ is introduced to
suppress the C++ function templates.

Existing HIP applications don't need to be changed. hipMalloc and hipHostMalloc
will behave just the same.

For new applications which depends on identifying decltypes of underlying C
functions, __HIP_DISABLE_CPP_FUNCTIONS__ shall be defined prior to including
HIP headers.


[ROCm/hip commit: eba3bd3082]
2019-02-11 09:33:15 -06:00
Maneesh Gupta 9ab6e7b7f6 Merge pull request #911 from lmoriche/hip_clang
Allow HIP_LIB_PATH to be overriden at the command line. Use HIP_LIB_P…

[ROCm/hip commit: 31c4a0dff1]
2019-02-11 11:26:39 +05:30