rocm-systems

Author	SHA1	Message	Date
Aaron En Ye Shi	6c98a12a9b	Merge "find_package(hip) changes for clang" into amd-master-next	2020-05-04 10:19:51 -04:00
Alex Xie	9136212b51	SWDEV-221166 - Detect support for large bar access through HIP runtime API Change-Id: Iaa9756c1b5e40c1ab5afb38e44a6699fa5f6c13f	2020-05-01 20:39:52 -04:00
agodavar	14b060bfd3	find_package(hip) changes for clang Change-Id: I9dc1be1dd5f9424e26ebcd2961207e1c4467550f	2020-05-01 20:48:32 +00:00
root	3c8b7f1e38	Merge master into amd-master-next Change-Id: I3fc1dc0c860d627053537581e75561e8a7efe327	2020-04-26 22:19:37 +00:00
Michael LIAO	2ab06b307d	[vdi] Fix texture reference sample. - The driver code should not re-define `tex` again as it's already defined in the kernel code. Eventually, the driver code should be as regular C++ code instad of HIP code. Change-Id: I8c7cab204b98990619d6e7109b990d7089ea9261	2020-04-23 23:41:56 -04:00
Maneesh Gupta	e46a4e3e48	Merge in the rocclr based hip runtime (#2032 ) * Merge master-next changes in master (include vdi development in master branch)	2020-04-23 09:12:06 -07:00
Michael LIAO	8ec28891de	[vdi] Refactor texture/surface reference support. Change-Id: I8014d82aae7139ef5f95e4b50c4fc6da200dbc9d	2020-04-21 11:56:48 -04:00
Tao Sang	6c0a08ba4f	Solve issues with hip-vdi runtime static lib 1.Combine libamdhip64_static_base.a and libamdvdi_static.a into libamdhip64_static.a. 2.Let hipcc use -use-staticlib to link libamdhip64_static.a. 3.Add some samples for static lib. 4.Fix compiling failure of code object. Change-Id: Ic8c95228eb139058da8b5d66ba8439486154ca6f	2020-04-20 15:37:02 -04:00
Tao Sang	af5a4ca38b	Revert "Solve issues with hip-vdi runtime static lib" This reverts commit `ed3b0eb391`. Reason for revert: It is causing dkms-no-npi-hipclang broken. It is top priority to maintain dkms-no-npi-hipclang build, otherwise we lose track of regression analysis. So revert the change for now and recommit it after fixing it. Change-Id: Ia5136e888baecb6148c6c18eedbf37066fcb1eaa	2020-04-17 10:13:08 -05:00
Tao Sang	ed3b0eb391	Solve issues with hip-vdi runtime static lib 1.Combine libamdhip64_static_base.a and libamdvdi_static.a into libamdhip64_static.a. 2.Let hipcc use -use-staticlib to link libamdhip64_static.a. 3.Add some samples for static lib. 4.Fix compiling failure of code object. Change-Id: Ia2333622a8d05639b90974c4c5d3d85654ba0138	2020-04-15 13:46:36 -04:00
Maneesh Gupta	22e9bcf424	Merge branch 'amd-master' into amd-master-next Change-Id: I3094c15008093f2072bcd38aca4ea90aeae2d97b	2020-04-09 06:31:00 -04:00
Evgeny Mankov	a93ae7cbed	[HIP][doc] Update docs due to moving of HIPIFY to a separate repo (#2001 )	2020-04-07 14:03:19 +05:30
Rahul Garg	177457e54c	Rename hipDrvOccupancy to hipModuleOccupancy and match CUDA syntax (#1943 )	2020-04-07 14:02:52 +05:30
Sarbojit2019	5144ee8c04	Fix few memory leaks in HIP (#1969 )	2020-03-27 14:08:30 +05:30
Joseph Greathouse	449e2c931b	Fix maxSharedMemoryPerMultiProcessor attribute (#1927 ) The maxSharedMemoryPerMultiProcessor attribute is meant to describe the number of bytes of shared memory (LDS space in AMD terminology) in each SM (CU in AMD terminology). For instance, on AMD GPUs this is often 64KB per CU, and some Nvidia GPUs it's 96KB per SM. This shared memory is a different address space from the normal global memory. However, the current HIP-HCC properties fill this in with a size that matches the totalGlboalMem property. This gives a drastically too-high calculation for the amount of LDS space that each CU has -- tens of GBs vs. 10s of KBs. This patch fixes this by pulling the maxSharedMemoryPerMultiProcessor property from the HSA pool that describes how much workgroup-local space is available on each CU. The HSA runtime eventually pulls this from the topology information about LDSSizeInKB, defined as "Size of Local Data Store in Kilobytes per SIMD". Previously, this HSA query was used to fill in the value of the sharedMemPerBlock property. On today's AMD GPUs, we know that the amount of LDS avaialble to the workgroup is identical to the amount of LDS space in the CU. However, in the future this may differ. As such, this patch changes around the order and fills in the "PerMultiProcessor" property from the HSA query (since what's what the query is defined to return), and then separately fills in the "PerBlock" property as we know it.	2020-03-17 14:00:51 +05:30
Lad, Aditya	5ddccf0af7	Merge branch 'master' into amd-master-next Conflicts: CMakeLists.txt tests/src/texture/simpleTexture2DLayered.cpp tests/src/texture/simpleTexture3D.cpp Change-Id: I4aa4754d391b5f37ddf15fa0bcfc84d9da020119	2020-03-06 14:10:44 -05:00
Rahul Garg	409b21017c	Remove deprecated HIP markers (#1876 )	2020-02-28 16:47:15 +05:30
Tao Sang	6eabd2e7dd	Temporarily comment out Hcc-specific APIs for CLang compiler Temporarily comment out Hcc-specific template functions hipExtLaunchKernelGGL and hipOccupancyMaxPotentialBlockSize for CLang compiler so that all test cases under hip/samples can be built successfully for Clang + Hip/Hcc runtime. Change-Id: Iafc761257be4a7b34eafa6759a01f369570cd6ce	2020-02-16 22:26:47 -05:00
Rahul Garg	f9d99f3f8e	[sample] Add hipDispatchEnqueueRateMT (#1869 ) * [sample] Add hipDispatchEnqueueRateMT	2020-02-13 23:21:40 -08:00
Maneesh Gupta	3e0af4b4a9	Revert "Match Occupancy APIs syntax with CUDA (#1625 )" (#1857 ) Reverting this for now till we figure out how to avoid the build breakage. This reverts commit `acb2ea8618`.	2020-02-10 10:45:28 +05:30
Rahul Garg	9a88f7ae56	[docs] Fix links in cookbook samples (#1824 )	2020-02-04 08:56:31 +05:30
satyanveshd	acb2ea8618	Match Occupancy APIs syntax with CUDA (#1625 ) * Match Occupancy APIs syntax with CUDA and fix tests using these APIs	2020-01-29 13:05:53 -08:00
Rahul Garg	c7280bec4d	Simplify hipDispatchLatency sample (#1793 ) * Use hipExtLaunchKernelGGL in dispatchlatency sample * Let it run on NVCC path too * Refactoring * Add test_kernel source * Remove ResultDB * Remove error checks	2020-01-22 14:37:47 -08:00
Evgeny Mankov	6a881a647c	[HIP] Clean-up deprecated HIP error codes hipErrorMemoryAllocation -> hipErrorOutOfMemory hipErrorInitializationError -> hipErrorNotInitialized hipErrorMapBufferObjectFailed -> hipErrorMapFailed hipErrorInvalidResourceHandle -> hipErrorInvalidHandle	2019-12-23 17:01:35 +03:00
Rahul Garg	67e27b3603	Rename hip/hip_hcc.h to hip/hip_ext.h (#1341 ) * Rename hip/hip_hcc.h to hip/hip_ext.h * Deprecate hip_hcc.h	2019-11-07 13:17:10 +05:30
Rahul Garg	bdeed6c207	Merge pull request #1515 from ansurya/tex_unbind_issue_fix Fix undefined ref to hipUnbindTexture for texture types	2019-10-30 17:54:15 -07:00
Rahul Garg	1aa9eab331	Revert "Fix occupany APIs (#1560 )" This reverts commit `033960c86e`.	2019-10-29 11:41:08 -07:00
Anusha Godavarthy Surya	7cfd70847d	merge from master	2019-10-25 15:52:09 +05:30
Rahul Garg	bee4958708	Add HIP checks in texture driver sample (#1581 )	2019-10-24 17:45:51 +05:30
satyanveshd	033960c86e	Fix occupany APIs (#1560 ) Addresses SWDEV-205006	2019-10-24 17:44:47 +05:30
kjayapra-amd	26c3b4f985	Use the correct return type in runTest in 11_texture_driver sample. (#1546 ) Fixes SWDEV-203394. Currently in runTest() returns true, even if the texture reference copy does not happen. Using the existing testResult Flag to return from runTest().	2019-10-16 10:52:15 +05:30
Anusha Godavarthy Surya	fc776424a0	Added unbind texture to samples and tests	2019-10-15 11:34:15 +05:30
Yaxun (Sam) Liu	d9d6374a63	Fix sample 11_texture_driver for hip-clang (#1370 )	2019-08-29 01:03:20 +00:00
satyanveshd	39de5ed699	[sample] add new cookbook sample - occupancy (#1352 ) * occupancy.cpp with Makefile * occupancy sample changes according tothe comments * Changes according to the review comments * Occupancy Sample Changes * Changes according to review comments	2019-08-29 01:01:49 +00:00
Sarbojit2019	bd535ca14b	Added missing device prop fields into hipInfo sample (#1357 ) * Added prop.integrated into hipInfo sample * Added missing deviceProp fileds in hipInfo	2019-08-23 09:21:47 +00:00
chrispaquot	cb0a3ca5bc	Set device before processing each one (#1358 )	2019-08-23 09:19:50 +00:00
Rahul Garg	0f47934640	Remove -g from hipBusBandwidth makefile (#1351 )	2019-08-21 10:01:31 +00:00
Rahul Garg	4ea06f3a28	Add support for hipFuncGetAttribute (#1279 ) * Add support for hipFunGetAttribute * Support NVCC path * Test using sample module_api_global * Try fixing CI build failure due to hip_prof_gen scan * Fix for CI build issue * Resolve conflict * Rebase and resolve conflicts with master * Fix build error * Fix NVCC path build error	2019-08-08 08:27:41 +00:00
ansurya	53aeee42f9	HIPCommander code cleanup (#1207 ) * HIPCommander code cleanup * Removed non-used headerfiles * Removed empty ifdef	2019-07-18 03:18:15 +00:00
Evgeny Mankov	7a0e0cb50e	[HIP][HIPIFY] Split HIP_ARRAY_DESCRIPTOR struct to HIP_ARRAY_DESCRIPTOR and HIP_ARRAY3D_DESCRIPTOR [Reason] To be compatible with CUDA [#1133] Update HIP code, hipify-clang, tests and docs [TODO] Add support of the corresponding functions on nvcc fallback path	2019-07-11 14:58:16 +03:00
Aryan Salmanpour	69b15b5580	[hip][samples] fix a typo in module_api/defaultDriver sample (#1210 )	2019-07-10 03:05:19 +00:00
Maneesh Gupta	5cd72efff0	Fix sample module_api_global for hip-clang (#1201 ) module_api_global relies on a HCC only feature which allows host code to write to device variables. This feature does not exist in CUDA or hip-clang, which causes the sample not working in CUDA or hip-clang. This patch fixes the sample by using standard features of CUDA and hip-clang. The fixed sample works in HCC, CUDA and hip-clang.	2019-07-03 08:52:19 +00:00
Jatin Chaudhary	3b0faf950b	Adding new unroll example (#1187 )	2019-07-03 08:51:19 +00:00
Yaxun Sam Liu	688ce62b49	Fix sample module_api_global for hip-clang module_api_global relies on a HCC only feature which allows host code to write to device variables. This feature does not exist in CUDA or hip-clang, which causes the sample not working in CUDA or hip-clang. This patch fixes the sample by using standard features of CUDA and hip-clang. The fixed sample works in HCC, CUDA and hip-clang.	2019-07-02 16:36:53 -04:00
Maneesh Gupta	0b7762df01	Merge pull request #1164 from gargrahul/cleanup_module_api_samples Add cleanup code in module api samples	2019-06-20 05:57:46 +05:30
Rahul Garg	2dfc912094	Add cleanup code in module api samples	2019-06-07 05:23:11 +05:30
Rahul Garg	2b56ec4344	Fix sample to use kernelargs for launch	2019-06-07 05:17:15 +05:30
Evgeny Mankov	66c182946e	[HIP][HIPIFY] Make hipMemcpyParam2D coherent with cuMemcpy2D + Makes hip_Memcpy2D struct compatible with CUDA_MEMCPY2D struct + Add hipMemcpyParam2D support in nvcc fallback path + Update hipify-clang, tests and docs accordingly	2019-05-22 18:31:39 +03:00
Evgeny Mankov	f5f240744a	[HIP][tests] Copyright notice update	2019-05-20 18:46:23 +03:00
Nick Curtis	7391b26fac	Markdown fixes & Whitespace cleanup for samples (#1096 ) * Fix multiline code blocks in README's * Whitespace cleanup	2019-05-12 19:27:44 +05:30

1 2 3 4 5

209 Commits