rocm-systems

Autor(a)	SHA1	Mensagem	Data
Sarbojit2019	55fa6a83c6	Enabled gcc for hip host code (#1214 ) * Enabled gcc for hip host code * Adding tests for hip code + (gcc & g++), without kernels * Excluding nvcc platforms for gcc and g++ tests + Addressing review comments * minor code clean-up * Add rocm include path * Added relative path for library * Hiding non supported functions for gcc * Incorporating review comments [ROCm/clr commit: `f23c1a1499`]	2019-08-05 09:51:36 +00:00
Jeff Daily	68f674205e	consolidate thread local storage (#915 ) * all thread local access now through single struct * clean up old commented-out code, more use of GET_TLS() * fewer calls to GET_TLS by passing tls as a funtion argument * revert unnecessary change to printf * fix failing tests due to TLS change * fix merge conflicts in ihipOccupancyMaxActiveBlocksPerMultiprocessor [ROCm/clr commit: `f337ae1edb`]	2019-08-05 09:51:02 +00:00
Evgeny Mankov	e278fbf2f9	[HIPIFY][fix][#211 ] Taking into account include guard controlling macro ...while including HIP main header file, which is inserted now after #indef controlling macro, or after #pragma once, if it's occurred earlier. + Add a couple of unit tests. ToDo: Check backward compatibility on older clang versions. [ROCm/clr commit: `fedef02c37`]	2019-08-02 16:46:45 +03:00
Maneesh Gupta	f36dced5d7	Merge pull request #1278 from gargrahul/fix_hipfuncGetAttribute_logstatus Fix missing logstatus in hipFuncGetAttributes [ROCm/clr commit: `a489877153`]	2019-08-02 10:00:38 +00:00
wkwchau	7b0f478767	Added CooperativeLaunch and CooperativeMultiDeviceLaunch flag and property for hipDeviceGetAttribute() and hipGetDeviceProperties() (#1247 ) [ROCm/clr commit: `ed04e96e2d`]	2019-08-02 10:00:25 +00:00
Rahul Garg	b064d7cab2	Fix missing logstatus in hipFuncGetAttributes [ROCm/clr commit: `20e9aba94e`]	2019-08-02 11:51:34 +05:30
wkwchau	75f1bb21b3	Added query of hipDeviceAttributeHdpMemFlushCntl and hipDeviceAttribu… (#1238 ) * Added query of hipDeviceAttributeHdpMemFlushCntl and hipDeviceAttributeHdpRegFlushCntl * Added NVCC blocker for the hip*FlushCntl test cases [ROCm/clr commit: `abe6776677`]	2019-08-01 16:03:35 +00:00
Maneesh Gupta	98669d1867	Merge pull request #1277 from mangupta/nvcc_devprop [nvcc] Populate missing fields in hipGetDeviceProperties [ROCm/clr commit: `d5a3202a47`]	2019-08-01 08:59:58 +00:00
Maneesh Gupta	969eddc258	Merge pull request #1276 from vsytch/SWDEV-197675 [hip][tests] Don't use a hardcoded warp size, since it can be dynamically changed.… [ROCm/clr commit: `6ee5fcc07c`]	2019-08-01 08:59:43 +00:00
Maneesh Gupta	b458d72079	Merge pull request #1275 from yxsamliu/fix-std Fix -std=c++14 for windows [ROCm/clr commit: `d87e243c51`]	2019-08-01 08:59:27 +00:00
Maneesh Gupta	60210d987c	Merge pull request #1243 from jeffdaily/master-stream-lock-fix remove stream locks where it is safe to do so [ROCm/clr commit: `3de3f57468`]	2019-08-01 08:59:13 +00:00
wkwchau	a19b4fbd8b	Added support of hipOccupancyMaxActiveBlocksPerMultiprocessor & hipOc… (#1240 ) * Added support of hipOccupancyMaxActiveBlocksPerMultiprocessor & hipOccupancyMaxActiveBlocksPerMultiprocessorWithFlags APIs * Taking into account of SGPR usage to determine the max active blocks in hipOccupancyMaxActiveBlocksPerMultiprocessor() [ROCm/clr commit: `7b9801fe9a`]	2019-08-01 08:58:48 +00:00
Maneesh Gupta	d5f73d22aa	[nvcc] Populate missing fields in hipGetDeviceProperties Change-Id: Ie90e02674d503e385f144f1ead3d53ff7b49cecc [ROCm/clr commit: `b24a4000f8`]	2019-08-01 13:16:39 +05:30
Vladislav Sytchenko	5b201fe854	Don't use a hardcoded warp size, since it can be dynamically changed. Query it from the runtime instead. [ROCm/clr commit: `9a1835ddc3`]	2019-07-31 17:04:31 -04:00
Yaxun (Sam) Liu	05b71135e9	Fix -std=c++14 for windows [ROCm/clr commit: `c1dc675e3d`]	2019-07-31 16:36:47 -04:00
Evgeny Mankov	f99c7e2bb6	Merge pull request #1274 from emankov/cuDNN [HIP][doc] Populate CUDA Runtime API doc with CUDA version field [ROCm/clr commit: `a6dcaf4bcd`]	2019-07-31 23:01:32 +03:00
Evgeny Mankov	d261b0593a	[HIP][doc] Populate CUDA Runtime API doc with CUDA version field + CUDA version - version in which API has appeared and (optional) last version before abandoning it; no value in case of earlier versions < 7.5. + Fix typos, add missing references. [ROCm/clr commit: `b149219167`]	2019-07-31 22:59:05 +03:00
Maneesh Gupta	b297398add	Merge pull request #1269 from gargrahul/fix_ptr_attr_unkonwn_to_invalid hipPointerGetAttributes- Change hipErrorUnknown to hipErrorInvalidValue [ROCm/clr commit: `e0397d3d1f`]	2019-07-31 15:43:06 +00:00
Maneesh Gupta	f7e8d957bd	Merge pull request #1265 from gargrahul/fix_hip_porting_guide_texture_ref_use [docs]Fix texture reference APIs usage part [ROCm/clr commit: `56d41344c6`]	2019-07-31 15:42:54 +00:00
Rahul Garg	f9eaac9561	Add hip init in hipExtLaunchMultiKernelMultiDevice (#1263 ) * Add hip init in hipExtLaunchMultiKernelMultiDevice * Add more logstatus for multiple return paths * Fix missing i in function name [ROCm/clr commit: `8df47255c5`]	2019-07-31 15:42:29 +00:00
Rahul Garg	3b5dac1d9d	Add HIP init in hipFuncGetAttributes (#1262 ) * Add HIP init in hipFuncGetAttributes * [dtest]Remove explicit hip init call in hipFuncGetAttributes dtest [ROCm/clr commit: `c610159b85`]	2019-07-31 15:42:08 +00:00
Maneesh Gupta	0e56fee8e8	Merge pull request #1270 from mangupta/ci_stablity [ci] Disable flaky hipMemoryAllocateCoherentDriver on CI for now [ROCm/clr commit: `25a90ee0d6`]	2019-07-31 05:03:25 +00:00
ansurya	733178f492	Testcase to validate signed/unsigned char,short as normalized float (#1267 ) * Testcase to validate signed/unsigned char,short as normalized float * corrected test_common.cpp file path [ROCm/clr commit: `440c5f1677`]	2019-07-31 05:02:35 +00:00
ansurya	d46e575ec1	Add HSA_PATH to hip_Includes in cmake and hipconfig (#1260 ) * Add HSA_PATH to hip_Includes in cmake and hipconfig * HSA_PATH to CACHE path,checks for HSA include path * Removed new lines at EOF [ROCm/clr commit: `53b5c917cc`]	2019-07-31 05:02:20 +00:00
Maneesh Gupta	f32d31b79e	[ci] Disable flaky hipMemoryAllocateCoherentDriver on CI for now Change-Id: Ib90dd390ed71d0b3867e5dc36a41988cc4d42a99 [ROCm/clr commit: `08062ca607`]	2019-07-31 09:35:43 +05:30
Rahul Garg	7a21d085ad	Change hipErrorUnknown to hipErrorInvalidValue [ROCm/clr commit: `1c49943ac3`]	2019-07-31 00:28:30 +05:30
Evgeny Mankov	ae01ed798f	[HIPIFY][DNN][doc] Populate cuDNN API doc with CUDA version field + CUDA version - version in which API has appeared and (optional) last version before abandoning it; no value in case of earlier versions < 7.5. + Fix typos. [ROCm/clr commit: `98ce4725fd`]	2019-07-30 20:53:57 +03:00
Rahul Garg	8245cb797e	[docs]Fix texture reference APIs usage part [ROCm/clr commit: `ebdc3a9cb3`]	2019-07-30 02:56:47 +05:30
Evgeny Mankov	c649564657	[HIPIFY][SPARSE] Sync cuSPARSE 10.1 - HIP - HIPIFY (Step 2 of 2) + Add undocumented but presented in cusparse.h functions since CUDA 10.1 Update 1 [ROCm/clr commit: `ec755e0005`]	2019-07-29 21:12:35 +03:00
Evgeny Mankov	8c8c8ca153	[HIPIFY][SPARSE] Sync cuSPARSE 10.1 - HIP - HIPIFY (Step 1 of 2) [ROCm/clr commit: `ea02797cc7`]	2019-07-26 21:34:36 +03:00
Evgeny Mankov	ab1de116f2	[HIPIFY][SPARSE][doc] Populate cuSPARSE API doc with CUDA version field + CUDA version - version in which API has appeared and (optional) last version before abandoning it; no value in case of earlier versions < 7.5. + Fix typos [ROCm/clr commit: `e145850f26`]	2019-07-26 19:05:42 +03:00
Evgeny Mankov	31a851036d	[HIPIFY][FFT][doc] Populate cuFFT API doc with CUDA version field CUDA version - version in which API has appeared and (optional) last version before abandoning it; no value in case of earlier versions < 7.5. [ROCm/clr commit: `9547bd5ddb`]	2019-07-25 19:32:50 +03:00
Evgeny Mankov	4b9727f8c5	[HIPIFY][BLAS][doc] Populate cuBlas API doc with CUDA version field CUDA version - version in which API has appeared and (optional) last version before abandoning it; no value in case of earlier versions < 7.5. [ROCm/clr commit: `e61a9d60f0`]	2019-07-25 18:49:23 +03:00
Evgeny Mankov	a5f8444a26	[HIPIFY][doc] Fix typo [ROCm/clr commit: `fa0ef27994`]	2019-07-24 21:10:14 +03:00
Evgeny Mankov	dba1863c97	[HIPIFY][doc] Fix typos [ROCm/clr commit: `325ddef6b6`]	2019-07-24 21:04:41 +03:00
Evgeny Mankov	ff629321fb	[HIPIFY][doc] Populate Driver API doc with CUDA version field CUDA version - version in which API has appeared and (optional) last version before abandoning it; no value in case of earlier versions < 7.5. [ROCm/clr commit: `d1a0ac6990`]	2019-07-24 20:52:42 +03:00
Aryan Salmanpour	8b0c37a9ce	[hip][tests] add a unit test for using hipExtLaunchMultiKernelMultiDevice API (#1250 ) [ROCm/clr commit: `571af6d85b`]	2019-07-24 07:57:39 +00:00
Aaron Enye Shi	73af1ac4bd	Add GFX908 specific changes to HIP (#1229 ) * Add GFX908 specific for HIP * Fix missing __halfTest in hipTestNativeHalf [ROCm/clr commit: `8c82f9db77`]	2019-07-24 07:51:17 +00:00
Maneesh Gupta	a1e4200ee8	[dtests] Fix complex_loading_behavior.cpp build issues on nvcc path (#1242 ) [ROCm/clr commit: `7feda764b6`]	2019-07-24 07:49:39 +00:00
Aaron Enye Shi	99ca4c2483	Fix hipMemcpy-size test running out of Host Mem (#1224 ) * Fix hipMemcpy-size test running out of Host Mem The hipMemcpy-size uses a maxElem calculated from the total GPU mem /8. Then it will allocate 4 times that amount of host memory. This tests begins failing when there is not enough host memory, such as on systems with 32GB GPU mem, and 16GB RAM. This fixes the test if not enough host memory is available on the system. * Add windows support to hipMemcpy-size fix * avoid linking extra libs for windows * HIPMemcpy-size Remove freeCPU including swap [ROCm/clr commit: `c56876cc19`]	2019-07-24 07:49:20 +00:00
Evgeny Mankov	267c313862	[HIPIFY][doc] Update Readme.md + 8.0.1 - is the latest supported clang version on Linux + Testing update [ROCm/clr commit: `682f1301bd`]	2019-07-23 19:30:57 +03:00
Evgeny Mankov	493afd57d7	HIPIFY] Clang 8.0.1 support + Tested on Windows and Linux; + Provide patch for clang's bug 38811; + Update Readme.md accordingly. P.S. With the next 9.0.0 release patches for Windows won't be needed, cause all fixes will be there. [ROCm/clr commit: `deb4325372`]	2019-07-23 19:18:17 +03:00
Evgeny Mankov	24277696ab	[HIPIFY][#1239 ] Fix an assert [ROCm/clr commit: `6d61e02acb`]	2019-07-22 22:13:54 +03:00
Jeff Daily	df457ce2bd	remove stream locks where it is safe to do so [ROCm/clr commit: `734c4b92c3`]	2019-07-22 17:38:51 +00:00
Evgeny Mankov	1f4954ca79	[HIPIFY][#941 ][fix] Workaround for an assert on forbidden option "--" + Fixes the following assert in debug version: Assertion failed: (S.empty() \|\| S[0] != '-') && "Option can't start with '-", file C:\GIT\LLVM\trunk-for-submits\llvm-project\llvm\lib\Support\CommandLine.cpp, line 440 + DashDash option left declared in order to be listed in help. [ROCm/clr commit: `14aad50e07`]	2019-07-19 20:23:17 +03:00
Yaxun (Sam) Liu	16632b627c	Temporarily disable maxTexture1D in hipDeviceProp_t for VDI (#1237 ) [ROCm/clr commit: `13301398d8`]	2019-07-19 15:38:32 +00:00
gilbertlee-amd	71f87c218e	Fix Typo HipHostMemory to HipHostMalloc (#1236 ) [ROCm/clr commit: `54ad90f2fa`]	2019-07-19 04:46:07 +00:00
Maneesh Gupta	f0f831daa7	[hit] Workaround for %cc and %cxx mappings. (#1233 ) * [hit] Workaround for %cc and %cxx mappings. HIP CMakeLists.txt modifies CMAKE_C_COMPILER and CMAKE_CXX_COMPILER. This messes up any dtests that want to test against cc/c++. So hardcode %cc to /usr/bin/cc and %cxx to /usr/bin/c++ for now till we come up with a better solution. Change-Id: I7dce93ce8360191e612a94e3a735e5612ac27ab5 * [hit] Add auto-variable %hip-path to syntax for BUILD_CMD Change-Id: Id097a183fbce2b2c9691d0180d3304dd17a4e016 [ROCm/clr commit: `af9aae6b4e`]	2019-07-19 04:45:56 +00:00
cdevadas	35f0fb2916	Increased the number of implicit-kernarg bytes to 56 (#1217 ) [ROCm/clr commit: `fc0aca2a7d`]	2019-07-19 04:45:34 +00:00
ansurya	af2df395a0	[HIP][Tests] Added new testcases for Module API (#1150 ) * [HIP][tests] New testcases for module api * [HIP][Tests]Support for CUDA devices * Updated tests as per latest master & test GetGlobal to work on all platforms [ROCm/clr commit: `f566bec546`]	2019-07-19 04:45:20 +00:00

1 2 3 4 5 ...

3412 Cometimentos