Граф коммитов

3409 Коммитов

Автор SHA1 Сообщение Дата
Maneesh Gupta f36dced5d7 Merge pull request #1278 from gargrahul/fix_hipfuncGetAttribute_logstatus
Fix missing logstatus in hipFuncGetAttributes

[ROCm/clr commit: a489877153]
2019-08-02 10:00:38 +00:00
wkwchau 7b0f478767 Added CooperativeLaunch and CooperativeMultiDeviceLaunch flag and property for hipDeviceGetAttribute() and hipGetDeviceProperties() (#1247)
[ROCm/clr commit: ed04e96e2d]
2019-08-02 10:00:25 +00:00
Rahul Garg b064d7cab2 Fix missing logstatus in hipFuncGetAttributes
[ROCm/clr commit: 20e9aba94e]
2019-08-02 11:51:34 +05:30
wkwchau 75f1bb21b3 Added query of hipDeviceAttributeHdpMemFlushCntl and hipDeviceAttribu… (#1238)
* Added query of hipDeviceAttributeHdpMemFlushCntl and hipDeviceAttributeHdpRegFlushCntl

* Added NVCC blocker for the hip*FlushCntl test cases


[ROCm/clr commit: abe6776677]
2019-08-01 16:03:35 +00:00
Maneesh Gupta 98669d1867 Merge pull request #1277 from mangupta/nvcc_devprop
[nvcc] Populate missing fields in hipGetDeviceProperties

[ROCm/clr commit: d5a3202a47]
2019-08-01 08:59:58 +00:00
Maneesh Gupta 969eddc258 Merge pull request #1276 from vsytch/SWDEV-197675
[hip][tests] Don't use a hardcoded warp size, since it can be dynamically changed.…

[ROCm/clr commit: 6ee5fcc07c]
2019-08-01 08:59:43 +00:00
Maneesh Gupta b458d72079 Merge pull request #1275 from yxsamliu/fix-std
Fix -std=c++14 for windows

[ROCm/clr commit: d87e243c51]
2019-08-01 08:59:27 +00:00
Maneesh Gupta 60210d987c Merge pull request #1243 from jeffdaily/master-stream-lock-fix
remove stream locks where it is safe to do so

[ROCm/clr commit: 3de3f57468]
2019-08-01 08:59:13 +00:00
wkwchau a19b4fbd8b Added support of hipOccupancyMaxActiveBlocksPerMultiprocessor & hipOc… (#1240)
* Added support of hipOccupancyMaxActiveBlocksPerMultiprocessor & hipOccupancyMaxActiveBlocksPerMultiprocessorWithFlags APIs

* Taking into account of SGPR usage to determine the max active blocks in hipOccupancyMaxActiveBlocksPerMultiprocessor()


[ROCm/clr commit: 7b9801fe9a]
2019-08-01 08:58:48 +00:00
Maneesh Gupta d5f73d22aa [nvcc] Populate missing fields in hipGetDeviceProperties
Change-Id: Ie90e02674d503e385f144f1ead3d53ff7b49cecc


[ROCm/clr commit: b24a4000f8]
2019-08-01 13:16:39 +05:30
Vladislav Sytchenko 5b201fe854 Don't use a hardcoded warp size, since it can be dynamically changed. Query it from the runtime instead.
[ROCm/clr commit: 9a1835ddc3]
2019-07-31 17:04:31 -04:00
Yaxun (Sam) Liu 05b71135e9 Fix -std=c++14 for windows
[ROCm/clr commit: c1dc675e3d]
2019-07-31 16:36:47 -04:00
Evgeny Mankov f99c7e2bb6 Merge pull request #1274 from emankov/cuDNN
[HIP][doc] Populate CUDA Runtime API doc with CUDA version field

[ROCm/clr commit: a6dcaf4bcd]
2019-07-31 23:01:32 +03:00
Evgeny Mankov d261b0593a [HIP][doc] Populate CUDA Runtime API doc with CUDA version field
+ CUDA version - version in which API has appeared and (optional) last version before abandoning it; no value in case of earlier versions < 7.5.
+ Fix typos, add missing references.


[ROCm/clr commit: b149219167]
2019-07-31 22:59:05 +03:00
Maneesh Gupta b297398add Merge pull request #1269 from gargrahul/fix_ptr_attr_unkonwn_to_invalid
hipPointerGetAttributes- Change hipErrorUnknown to hipErrorInvalidValue

[ROCm/clr commit: e0397d3d1f]
2019-07-31 15:43:06 +00:00
Maneesh Gupta f7e8d957bd Merge pull request #1265 from gargrahul/fix_hip_porting_guide_texture_ref_use
[docs]Fix texture reference APIs usage part

[ROCm/clr commit: 56d41344c6]
2019-07-31 15:42:54 +00:00
Rahul Garg f9eaac9561 Add hip init in hipExtLaunchMultiKernelMultiDevice (#1263)
* Add hip init in hipExtLaunchMultiKernelMultiDevice

* Add more logstatus for multiple return paths

* Fix missing i in function name


[ROCm/clr commit: 8df47255c5]
2019-07-31 15:42:29 +00:00
Rahul Garg 3b5dac1d9d Add HIP init in hipFuncGetAttributes (#1262)
* Add HIP init in hipFuncGetAttributes

* [dtest]Remove explicit hip init call in hipFuncGetAttributes dtest


[ROCm/clr commit: c610159b85]
2019-07-31 15:42:08 +00:00
Maneesh Gupta 0e56fee8e8 Merge pull request #1270 from mangupta/ci_stablity
[ci] Disable flaky hipMemoryAllocateCoherentDriver on CI for now

[ROCm/clr commit: 25a90ee0d6]
2019-07-31 05:03:25 +00:00
ansurya 733178f492 Testcase to validate signed/unsigned char,short as normalized float (#1267)
* Testcase to validate signed/unsigned char,short as normalized float

* corrected test_common.cpp file path


[ROCm/clr commit: 440c5f1677]
2019-07-31 05:02:35 +00:00
ansurya d46e575ec1 Add HSA_PATH to hip_Includes in cmake and hipconfig (#1260)
* Add HSA_PATH to hip_Includes in cmake and hipconfig

* HSA_PATH to CACHE path,checks for HSA include path

* Removed new lines at EOF


[ROCm/clr commit: 53b5c917cc]
2019-07-31 05:02:20 +00:00
Maneesh Gupta f32d31b79e [ci] Disable flaky hipMemoryAllocateCoherentDriver on CI for now
Change-Id: Ib90dd390ed71d0b3867e5dc36a41988cc4d42a99


[ROCm/clr commit: 08062ca607]
2019-07-31 09:35:43 +05:30
Rahul Garg 7a21d085ad Change hipErrorUnknown to hipErrorInvalidValue
[ROCm/clr commit: 1c49943ac3]
2019-07-31 00:28:30 +05:30
Evgeny Mankov ae01ed798f [HIPIFY][DNN][doc] Populate cuDNN API doc with CUDA version field
+ CUDA version - version in which API has appeared and (optional) last version before abandoning it; no value in case of earlier versions < 7.5.
+ Fix typos.


[ROCm/clr commit: 98ce4725fd]
2019-07-30 20:53:57 +03:00
Rahul Garg 8245cb797e [docs]Fix texture reference APIs usage part
[ROCm/clr commit: ebdc3a9cb3]
2019-07-30 02:56:47 +05:30
Evgeny Mankov c649564657 [HIPIFY][SPARSE] Sync cuSPARSE 10.1 - HIP - HIPIFY (Step 2 of 2)
+ Add undocumented but presented in cusparse.h functions since CUDA 10.1 Update 1


[ROCm/clr commit: ec755e0005]
2019-07-29 21:12:35 +03:00
Evgeny Mankov 8c8c8ca153 [HIPIFY][SPARSE] Sync cuSPARSE 10.1 - HIP - HIPIFY (Step 1 of 2)
[ROCm/clr commit: ea02797cc7]
2019-07-26 21:34:36 +03:00
Evgeny Mankov ab1de116f2 [HIPIFY][SPARSE][doc] Populate cuSPARSE API doc with CUDA version field
+ CUDA version - version in which API has appeared and (optional) last version before abandoning it; no value in case of earlier versions < 7.5.
+ Fix typos


[ROCm/clr commit: e145850f26]
2019-07-26 19:05:42 +03:00
Evgeny Mankov 31a851036d [HIPIFY][FFT][doc] Populate cuFFT API doc with CUDA version field
CUDA version - version in which API has appeared and (optional) last version before abandoning it; no value in case of earlier versions < 7.5.


[ROCm/clr commit: 9547bd5ddb]
2019-07-25 19:32:50 +03:00
Evgeny Mankov 4b9727f8c5 [HIPIFY][BLAS][doc] Populate cuBlas API doc with CUDA version field
CUDA version - version in which API has appeared and (optional) last version before abandoning it; no value in case of earlier versions < 7.5.


[ROCm/clr commit: e61a9d60f0]
2019-07-25 18:49:23 +03:00
Evgeny Mankov a5f8444a26 [HIPIFY][doc] Fix typo
[ROCm/clr commit: fa0ef27994]
2019-07-24 21:10:14 +03:00
Evgeny Mankov dba1863c97 [HIPIFY][doc] Fix typos
[ROCm/clr commit: 325ddef6b6]
2019-07-24 21:04:41 +03:00
Evgeny Mankov ff629321fb [HIPIFY][doc] Populate Driver API doc with CUDA version field
CUDA version - version in which API has appeared and (optional) last version before abandoning it; no value in case of earlier versions < 7.5.


[ROCm/clr commit: d1a0ac6990]
2019-07-24 20:52:42 +03:00
Aryan Salmanpour 8b0c37a9ce [hip][tests] add a unit test for using hipExtLaunchMultiKernelMultiDevice API (#1250)
[ROCm/clr commit: 571af6d85b]
2019-07-24 07:57:39 +00:00
Aaron Enye Shi 73af1ac4bd Add GFX908 specific changes to HIP (#1229)
* Add GFX908 specific for HIP

* Fix missing __halfTest in hipTestNativeHalf


[ROCm/clr commit: 8c82f9db77]
2019-07-24 07:51:17 +00:00
Maneesh Gupta a1e4200ee8 [dtests] Fix complex_loading_behavior.cpp build issues on nvcc path (#1242)
[ROCm/clr commit: 7feda764b6]
2019-07-24 07:49:39 +00:00
Aaron Enye Shi 99ca4c2483 Fix hipMemcpy-size test running out of Host Mem (#1224)
* Fix hipMemcpy-size test running out of Host Mem

The hipMemcpy-size uses a maxElem calculated from the total GPU mem /8. Then it will allocate 4 times that amount of host memory. This tests begins failing when there is not enough host memory, such as on systems with 32GB GPU mem, and 16GB RAM. This fixes the test if not enough host memory is available on the system.

* Add windows support to hipMemcpy-size fix

* avoid linking extra libs for windows

* HIPMemcpy-size Remove freeCPU including swap


[ROCm/clr commit: c56876cc19]
2019-07-24 07:49:20 +00:00
Evgeny Mankov 267c313862 [HIPIFY][doc] Update Readme.md
+ 8.0.1 - is the latest supported clang version on Linux
+ Testing update


[ROCm/clr commit: 682f1301bd]
2019-07-23 19:30:57 +03:00
Evgeny Mankov 493afd57d7 HIPIFY] Clang 8.0.1 support
+ Tested on Windows and Linux;
+ Provide patch for clang's bug 38811;
+ Update Readme.md accordingly.

P.S.
With the next 9.0.0 release patches for Windows won't be needed, cause all fixes will be there.


[ROCm/clr commit: deb4325372]
2019-07-23 19:18:17 +03:00
Evgeny Mankov 24277696ab [HIPIFY][#1239] Fix an assert
[ROCm/clr commit: 6d61e02acb]
2019-07-22 22:13:54 +03:00
Jeff Daily df457ce2bd remove stream locks where it is safe to do so
[ROCm/clr commit: 734c4b92c3]
2019-07-22 17:38:51 +00:00
Evgeny Mankov 1f4954ca79 [HIPIFY][#941][fix] Workaround for an assert on forbidden option "--"
+ Fixes the following assert in debug version:
Assertion failed: (S.empty() || S[0] != '-') && "Option can't start with '-", file C:\GIT\LLVM\trunk-for-submits\llvm-project\llvm\lib\Support\CommandLine.cpp, line 440
+ DashDash option left declared in order to be listed in help.


[ROCm/clr commit: 14aad50e07]
2019-07-19 20:23:17 +03:00
Yaxun (Sam) Liu 16632b627c Temporarily disable maxTexture1D in hipDeviceProp_t for VDI (#1237)
[ROCm/clr commit: 13301398d8]
2019-07-19 15:38:32 +00:00
gilbertlee-amd 71f87c218e Fix Typo HipHostMemory to HipHostMalloc (#1236)
[ROCm/clr commit: 54ad90f2fa]
2019-07-19 04:46:07 +00:00
Maneesh Gupta f0f831daa7 [hit] Workaround for %cc and %cxx mappings. (#1233)
* [hit] Workaround for %cc and %cxx mappings.

HIP CMakeLists.txt modifies CMAKE_C_COMPILER and CMAKE_CXX_COMPILER.
This messes up any dtests that want to test against cc/c++.

So hardcode %cc to /usr/bin/cc and %cxx to /usr/bin/c++ for now till
we come up with a better solution.

Change-Id: I7dce93ce8360191e612a94e3a735e5612ac27ab5

* [hit] Add auto-variable %hip-path to syntax for BUILD_CMD

Change-Id: Id097a183fbce2b2c9691d0180d3304dd17a4e016


[ROCm/clr commit: af9aae6b4e]
2019-07-19 04:45:56 +00:00
cdevadas 35f0fb2916 Increased the number of implicit-kernarg bytes to 56 (#1217)
[ROCm/clr commit: fc0aca2a7d]
2019-07-19 04:45:34 +00:00
ansurya af2df395a0 [HIP][Tests] Added new testcases for Module API (#1150)
* [HIP][tests] New testcases for module api

* [HIP][Tests]Support for CUDA devices

* Updated tests as per latest master & test GetGlobal to work on all platforms


[ROCm/clr commit: f566bec546]
2019-07-19 04:45:20 +00:00
wkwchau d20537e595 Fixed bug of determine max block size in hipOccupancyMaxPotentialBlockSize (#1235)
[ROCm/clr commit: 6ec476e50a]
2019-07-18 03:19:29 +00:00
ansurya 8b8946f78b Add Max Texture 1D,2D,3D device properties (#1226)
* Add Max Texture 1D,2D,3D device properties

* Corrected testcase to use enums defined in hipDeviceAttribute_t

* Added texture 1D,2D and 3D support for NVIDIA path


[ROCm/clr commit: 00aa42e05f]
2019-07-18 03:18:50 +00:00
ansurya 264df316ba UChar and UShort textures as Normalized Float (#1213)
* UChar and UShort textures as Normalized Float

* UChar and UShort textures as Normalized Float for all float variants

* Handled uninitilaized texture format value


[ROCm/clr commit: 849b5ef6af]
2019-07-18 03:18:28 +00:00