Граф коммитов

3773 Коммитов

Автор SHA1 Сообщение Дата
Sarbojit2019 55fa6a83c6 Enabled gcc for hip host code (#1214)
* Enabled gcc for hip host code

* Adding tests for hip code + (gcc & g++), without kernels

* Excluding nvcc platforms for gcc and g++ tests + Addressing review comments

* minor code clean-up

* Add rocm include path

* Added relative path for library

* Hiding non supported functions for gcc

* Incorporating review comments


[ROCm/clr commit: f23c1a1499]
2019-08-05 09:51:36 +00:00
Jeff Daily 68f674205e consolidate thread local storage (#915)
* all thread local access now through single struct

* clean up old commented-out code, more use of GET_TLS()

* fewer calls to GET_TLS by passing tls as a funtion argument

* revert unnecessary change to printf

* fix failing tests due to TLS change

* fix merge conflicts in ihipOccupancyMaxActiveBlocksPerMultiprocessor


[ROCm/clr commit: f337ae1edb]
2019-08-05 09:51:02 +00:00
Evgeny Mankov 12804fe223 Merge pull request #1282 from emankov/hipify-clang
[HIPIFY][fix][#211] Taking into account include guard controlling macro

[ROCm/clr commit: bb7cfaf91a]
2019-08-04 16:31:53 +03:00
Evgeny Mankov e278fbf2f9 [HIPIFY][fix][#211] Taking into account include guard controlling macro
...while including HIP main header file, which is inserted now after #indef controlling macro, or after #pragma once, if it's occurred earlier.

+ Add a couple of unit tests.
ToDo: Check backward compatibility on older clang versions.


[ROCm/clr commit: fedef02c37]
2019-08-02 16:46:45 +03:00
Maneesh Gupta f36dced5d7 Merge pull request #1278 from gargrahul/fix_hipfuncGetAttribute_logstatus
Fix missing logstatus in hipFuncGetAttributes

[ROCm/clr commit: a489877153]
2019-08-02 10:00:38 +00:00
wkwchau 7b0f478767 Added CooperativeLaunch and CooperativeMultiDeviceLaunch flag and property for hipDeviceGetAttribute() and hipGetDeviceProperties() (#1247)
[ROCm/clr commit: ed04e96e2d]
2019-08-02 10:00:25 +00:00
Rahul Garg b064d7cab2 Fix missing logstatus in hipFuncGetAttributes
[ROCm/clr commit: 20e9aba94e]
2019-08-02 11:51:34 +05:30
wkwchau 75f1bb21b3 Added query of hipDeviceAttributeHdpMemFlushCntl and hipDeviceAttribu… (#1238)
* Added query of hipDeviceAttributeHdpMemFlushCntl and hipDeviceAttributeHdpRegFlushCntl

* Added NVCC blocker for the hip*FlushCntl test cases


[ROCm/clr commit: abe6776677]
2019-08-01 16:03:35 +00:00
Maneesh Gupta 98669d1867 Merge pull request #1277 from mangupta/nvcc_devprop
[nvcc] Populate missing fields in hipGetDeviceProperties

[ROCm/clr commit: d5a3202a47]
2019-08-01 08:59:58 +00:00
Maneesh Gupta 969eddc258 Merge pull request #1276 from vsytch/SWDEV-197675
[hip][tests] Don't use a hardcoded warp size, since it can be dynamically changed.…

[ROCm/clr commit: 6ee5fcc07c]
2019-08-01 08:59:43 +00:00
Maneesh Gupta b458d72079 Merge pull request #1275 from yxsamliu/fix-std
Fix -std=c++14 for windows

[ROCm/clr commit: d87e243c51]
2019-08-01 08:59:27 +00:00
Maneesh Gupta 60210d987c Merge pull request #1243 from jeffdaily/master-stream-lock-fix
remove stream locks where it is safe to do so

[ROCm/clr commit: 3de3f57468]
2019-08-01 08:59:13 +00:00
wkwchau a19b4fbd8b Added support of hipOccupancyMaxActiveBlocksPerMultiprocessor & hipOc… (#1240)
* Added support of hipOccupancyMaxActiveBlocksPerMultiprocessor & hipOccupancyMaxActiveBlocksPerMultiprocessorWithFlags APIs

* Taking into account of SGPR usage to determine the max active blocks in hipOccupancyMaxActiveBlocksPerMultiprocessor()


[ROCm/clr commit: 7b9801fe9a]
2019-08-01 08:58:48 +00:00
Maneesh Gupta d5f73d22aa [nvcc] Populate missing fields in hipGetDeviceProperties
Change-Id: Ie90e02674d503e385f144f1ead3d53ff7b49cecc


[ROCm/clr commit: b24a4000f8]
2019-08-01 13:16:39 +05:30
Vladislav Sytchenko 5b201fe854 Don't use a hardcoded warp size, since it can be dynamically changed. Query it from the runtime instead.
[ROCm/clr commit: 9a1835ddc3]
2019-07-31 17:04:31 -04:00
Yaxun (Sam) Liu 05b71135e9 Fix -std=c++14 for windows
[ROCm/clr commit: c1dc675e3d]
2019-07-31 16:36:47 -04:00
Evgeny Mankov f99c7e2bb6 Merge pull request #1274 from emankov/cuDNN
[HIP][doc] Populate CUDA Runtime API doc with CUDA version field

[ROCm/clr commit: a6dcaf4bcd]
2019-07-31 23:01:32 +03:00
Evgeny Mankov d261b0593a [HIP][doc] Populate CUDA Runtime API doc with CUDA version field
+ CUDA version - version in which API has appeared and (optional) last version before abandoning it; no value in case of earlier versions < 7.5.
+ Fix typos, add missing references.


[ROCm/clr commit: b149219167]
2019-07-31 22:59:05 +03:00
Maneesh Gupta b297398add Merge pull request #1269 from gargrahul/fix_ptr_attr_unkonwn_to_invalid
hipPointerGetAttributes- Change hipErrorUnknown to hipErrorInvalidValue

[ROCm/clr commit: e0397d3d1f]
2019-07-31 15:43:06 +00:00
Maneesh Gupta f7e8d957bd Merge pull request #1265 from gargrahul/fix_hip_porting_guide_texture_ref_use
[docs]Fix texture reference APIs usage part

[ROCm/clr commit: 56d41344c6]
2019-07-31 15:42:54 +00:00
Rahul Garg f9eaac9561 Add hip init in hipExtLaunchMultiKernelMultiDevice (#1263)
* Add hip init in hipExtLaunchMultiKernelMultiDevice

* Add more logstatus for multiple return paths

* Fix missing i in function name


[ROCm/clr commit: 8df47255c5]
2019-07-31 15:42:29 +00:00
Rahul Garg 3b5dac1d9d Add HIP init in hipFuncGetAttributes (#1262)
* Add HIP init in hipFuncGetAttributes

* [dtest]Remove explicit hip init call in hipFuncGetAttributes dtest


[ROCm/clr commit: c610159b85]
2019-07-31 15:42:08 +00:00
Maneesh Gupta 0e56fee8e8 Merge pull request #1270 from mangupta/ci_stablity
[ci] Disable flaky hipMemoryAllocateCoherentDriver on CI for now

[ROCm/clr commit: 25a90ee0d6]
2019-07-31 05:03:25 +00:00
ansurya 733178f492 Testcase to validate signed/unsigned char,short as normalized float (#1267)
* Testcase to validate signed/unsigned char,short as normalized float

* corrected test_common.cpp file path


[ROCm/clr commit: 440c5f1677]
2019-07-31 05:02:35 +00:00
ansurya d46e575ec1 Add HSA_PATH to hip_Includes in cmake and hipconfig (#1260)
* Add HSA_PATH to hip_Includes in cmake and hipconfig

* HSA_PATH to CACHE path,checks for HSA include path

* Removed new lines at EOF


[ROCm/clr commit: 53b5c917cc]
2019-07-31 05:02:20 +00:00
Maneesh Gupta f32d31b79e [ci] Disable flaky hipMemoryAllocateCoherentDriver on CI for now
Change-Id: Ib90dd390ed71d0b3867e5dc36a41988cc4d42a99


[ROCm/clr commit: 08062ca607]
2019-07-31 09:35:43 +05:30
Rahul Garg 7a21d085ad Change hipErrorUnknown to hipErrorInvalidValue
[ROCm/clr commit: 1c49943ac3]
2019-07-31 00:28:30 +05:30
Evgeny Mankov 189e6b5d44 Merge pull request #1268 from emankov/cuDNN
[HIPIFY][DNN][doc] Populate cuDNN API doc with CUDA version field

[ROCm/clr commit: 41b4f5295e]
2019-07-30 20:55:08 +03:00
Evgeny Mankov ae01ed798f [HIPIFY][DNN][doc] Populate cuDNN API doc with CUDA version field
+ CUDA version - version in which API has appeared and (optional) last version before abandoning it; no value in case of earlier versions < 7.5.
+ Fix typos.


[ROCm/clr commit: 98ce4725fd]
2019-07-30 20:53:57 +03:00
Rahul Garg 8245cb797e [docs]Fix texture reference APIs usage part
[ROCm/clr commit: ebdc3a9cb3]
2019-07-30 02:56:47 +05:30
Evgeny Mankov 88b31c4478 Merge pull request #1261 from emankov/cuDNN
[HIPIFY][SPARSE] Sync cuSPARSE 10.1 - HIP - HIPIFY (Step 2 of 2)

[ROCm/clr commit: 4d78435488]
2019-07-29 21:14:17 +03:00
Evgeny Mankov c649564657 [HIPIFY][SPARSE] Sync cuSPARSE 10.1 - HIP - HIPIFY (Step 2 of 2)
+ Add undocumented but presented in cusparse.h functions since CUDA 10.1 Update 1


[ROCm/clr commit: ec755e0005]
2019-07-29 21:12:35 +03:00
Evgeny Mankov f93f935edd Merge pull request #1259 from emankov/cuDNN
[HIPIFY][SPARSE] Sync cuSPARSE 10.1 - HIP - HIPIFY (Step 1 of 2)

[ROCm/clr commit: ee0a95b416]
2019-07-26 21:35:52 +03:00
Evgeny Mankov 8c8c8ca153 [HIPIFY][SPARSE] Sync cuSPARSE 10.1 - HIP - HIPIFY (Step 1 of 2)
[ROCm/clr commit: ea02797cc7]
2019-07-26 21:34:36 +03:00
Evgeny Mankov 81c695e640 Merge pull request #1258 from emankov/cuDNN
[HIPIFY][SPARSE][doc] Populate cuSPARSE API doc with CUDA version field

[ROCm/clr commit: 324849dd2c]
2019-07-26 19:08:11 +03:00
Evgeny Mankov ab1de116f2 [HIPIFY][SPARSE][doc] Populate cuSPARSE API doc with CUDA version field
+ CUDA version - version in which API has appeared and (optional) last version before abandoning it; no value in case of earlier versions < 7.5.
+ Fix typos


[ROCm/clr commit: e145850f26]
2019-07-26 19:05:42 +03:00
Evgeny Mankov 55982bbd88 Merge pull request #1256 from emankov/cuDNN
[HIPIFY][FFT][doc] Populate cuFFT API doc with CUDA version field

[ROCm/clr commit: 575bc39c0d]
2019-07-25 19:34:50 +03:00
Evgeny Mankov 31a851036d [HIPIFY][FFT][doc] Populate cuFFT API doc with CUDA version field
CUDA version - version in which API has appeared and (optional) last version before abandoning it; no value in case of earlier versions < 7.5.


[ROCm/clr commit: 9547bd5ddb]
2019-07-25 19:32:50 +03:00
Evgeny Mankov 33ef3b1266 Merge pull request #1255 from emankov/cuDNN
[HIPIFY][BLAS][doc] Populate cuBlas API doc with CUDA version field

[ROCm/clr commit: 3f41ab38d2]
2019-07-25 18:50:25 +03:00
Evgeny Mankov 4b9727f8c5 [HIPIFY][BLAS][doc] Populate cuBlas API doc with CUDA version field
CUDA version - version in which API has appeared and (optional) last version before abandoning it; no value in case of earlier versions < 7.5.


[ROCm/clr commit: e61a9d60f0]
2019-07-25 18:49:23 +03:00
Evgeny Mankov 767651da1f Merge pull request #1253 from emankov/cuDNN
[HIPIFY][doc] Fix typo

[ROCm/clr commit: d4bf0fb309]
2019-07-24 21:11:02 +03:00
Evgeny Mankov a5f8444a26 [HIPIFY][doc] Fix typo
[ROCm/clr commit: fa0ef27994]
2019-07-24 21:10:14 +03:00
Evgeny Mankov 77a3544c2d Merge pull request #1252 from emankov/cuDNN
[HIPIFY][doc] Fix typos

[ROCm/clr commit: d458ee20ad]
2019-07-24 21:05:59 +03:00
Evgeny Mankov dba1863c97 [HIPIFY][doc] Fix typos
[ROCm/clr commit: 325ddef6b6]
2019-07-24 21:04:41 +03:00
Evgeny Mankov f2555a7bf3 Merge pull request #1251 from emankov/cuDNN
[HIPIFY][doc] Populate Driver API doc with CUDA version field

[ROCm/clr commit: 60b2c701d2]
2019-07-24 20:53:57 +03:00
Evgeny Mankov ff629321fb [HIPIFY][doc] Populate Driver API doc with CUDA version field
CUDA version - version in which API has appeared and (optional) last version before abandoning it; no value in case of earlier versions < 7.5.


[ROCm/clr commit: d1a0ac6990]
2019-07-24 20:52:42 +03:00
Aryan Salmanpour 8b0c37a9ce [hip][tests] add a unit test for using hipExtLaunchMultiKernelMultiDevice API (#1250)
[ROCm/clr commit: 571af6d85b]
2019-07-24 07:57:39 +00:00
Aaron Enye Shi 73af1ac4bd Add GFX908 specific changes to HIP (#1229)
* Add GFX908 specific for HIP

* Fix missing __halfTest in hipTestNativeHalf


[ROCm/clr commit: 8c82f9db77]
2019-07-24 07:51:17 +00:00
Maneesh Gupta a1e4200ee8 [dtests] Fix complex_loading_behavior.cpp build issues on nvcc path (#1242)
[ROCm/clr commit: 7feda764b6]
2019-07-24 07:49:39 +00:00
Aaron Enye Shi 99ca4c2483 Fix hipMemcpy-size test running out of Host Mem (#1224)
* Fix hipMemcpy-size test running out of Host Mem

The hipMemcpy-size uses a maxElem calculated from the total GPU mem /8. Then it will allocate 4 times that amount of host memory. This tests begins failing when there is not enough host memory, such as on systems with 32GB GPU mem, and 16GB RAM. This fixes the test if not enough host memory is available on the system.

* Add windows support to hipMemcpy-size fix

* avoid linking extra libs for windows

* HIPMemcpy-size Remove freeCPU including swap


[ROCm/clr commit: c56876cc19]
2019-07-24 07:49:20 +00:00