rocm-systems

مولف	SHA1	پیام	تاریخ
Sarbojit2019	f1b028b93e	Fix few memory leaks in HIP (#1969 )	2020-03-27 14:08:30 +05:30
Joseph Greathouse	55e55e78bb	Fix maxSharedMemoryPerMultiProcessor attribute (#1927 ) The maxSharedMemoryPerMultiProcessor attribute is meant to describe the number of bytes of shared memory (LDS space in AMD terminology) in each SM (CU in AMD terminology). For instance, on AMD GPUs this is often 64KB per CU, and some Nvidia GPUs it's 96KB per SM. This shared memory is a different address space from the normal global memory. However, the current HIP-HCC properties fill this in with a size that matches the totalGlboalMem property. This gives a drastically too-high calculation for the amount of LDS space that each CU has -- tens of GBs vs. 10s of KBs. This patch fixes this by pulling the maxSharedMemoryPerMultiProcessor property from the HSA pool that describes how much workgroup-local space is available on each CU. The HSA runtime eventually pulls this from the topology information about LDSSizeInKB, defined as "Size of Local Data Store in Kilobytes per SIMD". Previously, this HSA query was used to fill in the value of the sharedMemPerBlock property. On today's AMD GPUs, we know that the amount of LDS avaialble to the workgroup is identical to the amount of LDS space in the CU. However, in the future this may differ. As such, this patch changes around the order and fills in the "PerMultiProcessor" property from the HSA query (since what's what the query is defined to return), and then separately fills in the "PerBlock" property as we know it.	2020-03-17 14:00:51 +05:30
Rahul Garg	6c5fa32815	Remove deprecated HIP markers (#1876 )	2020-02-28 16:47:15 +05:30
Rahul Garg	9d97f91fbb	[sample] Add hipDispatchEnqueueRateMT (#1869 ) * [sample] Add hipDispatchEnqueueRateMT	2020-02-13 23:21:40 -08:00
Maneesh Gupta	f8e1c01900	Revert "Match Occupancy APIs syntax with CUDA (#1625 )" (#1857 ) Reverting this for now till we figure out how to avoid the build breakage. This reverts commit `fa98798b63`.	2020-02-10 10:45:28 +05:30
Rahul Garg	8324258e6e	[docs] Fix links in cookbook samples (#1824 )	2020-02-04 08:56:31 +05:30
satyanveshd	fa98798b63	Match Occupancy APIs syntax with CUDA (#1625 ) * Match Occupancy APIs syntax with CUDA and fix tests using these APIs	2020-01-29 13:05:53 -08:00
Rahul Garg	2eb7ae4f0c	Simplify hipDispatchLatency sample (#1793 ) * Use hipExtLaunchKernelGGL in dispatchlatency sample * Let it run on NVCC path too * Refactoring * Add test_kernel source * Remove ResultDB * Remove error checks	2020-01-22 14:37:47 -08:00
Evgeny Mankov	4921678b6c	[HIP] Clean-up deprecated HIP error codes hipErrorMemoryAllocation -> hipErrorOutOfMemory hipErrorInitializationError -> hipErrorNotInitialized hipErrorMapBufferObjectFailed -> hipErrorMapFailed hipErrorInvalidResourceHandle -> hipErrorInvalidHandle	2019-12-23 17:01:35 +03:00
Rahul Garg	579a4f36fa	Rename hip/hip_hcc.h to hip/hip_ext.h (#1341 ) * Rename hip/hip_hcc.h to hip/hip_ext.h * Deprecate hip_hcc.h	2019-11-07 13:17:10 +05:30
Rahul Garg	ba8105e0cd	Merge pull request #1515 from ansurya/tex_unbind_issue_fix Fix undefined ref to hipUnbindTexture for texture types	2019-10-30 17:54:15 -07:00
Rahul Garg	e4a1e44162	Revert "Fix occupany APIs (#1560 )" This reverts commit `af351d7e1b`.	2019-10-29 11:41:08 -07:00
Anusha Godavarthy Surya	5f47e99ffe	merge from master	2019-10-25 15:52:09 +05:30
Rahul Garg	04e10814d8	Add HIP checks in texture driver sample (#1581 )	2019-10-24 17:45:51 +05:30
satyanveshd	af351d7e1b	Fix occupany APIs (#1560 ) Addresses SWDEV-205006	2019-10-24 17:44:47 +05:30
kjayapra-amd	fbb98f4950	Use the correct return type in runTest in 11_texture_driver sample. (#1546 ) Fixes SWDEV-203394. Currently in runTest() returns true, even if the texture reference copy does not happen. Using the existing testResult Flag to return from runTest().	2019-10-16 10:52:15 +05:30
Anusha Godavarthy Surya	d5b1e887e9	Added unbind texture to samples and tests	2019-10-15 11:34:15 +05:30
Yaxun (Sam) Liu	c404312ffe	Fix sample 11_texture_driver for hip-clang (#1370 )	2019-08-29 01:03:20 +00:00
satyanveshd	f807cc1a7b	[sample] add new cookbook sample - occupancy (#1352 ) * occupancy.cpp with Makefile * occupancy sample changes according tothe comments * Changes according to the review comments * Occupancy Sample Changes * Changes according to review comments	2019-08-29 01:01:49 +00:00
Sarbojit2019	7257f23063	Added missing device prop fields into hipInfo sample (#1357 ) * Added prop.integrated into hipInfo sample * Added missing deviceProp fileds in hipInfo	2019-08-23 09:21:47 +00:00
chrispaquot	344d150bf8	Set device before processing each one (#1358 )	2019-08-23 09:19:50 +00:00
Rahul Garg	b3a0d4266b	Remove -g from hipBusBandwidth makefile (#1351 )	2019-08-21 10:01:31 +00:00
Rahul Garg	6ce86f409d	Add support for hipFuncGetAttribute (#1279 ) * Add support for hipFunGetAttribute * Support NVCC path * Test using sample module_api_global * Try fixing CI build failure due to hip_prof_gen scan * Fix for CI build issue * Resolve conflict * Rebase and resolve conflicts with master * Fix build error * Fix NVCC path build error	2019-08-08 08:27:41 +00:00
ansurya	84f8441146	HIPCommander code cleanup (#1207 ) * HIPCommander code cleanup * Removed non-used headerfiles * Removed empty ifdef	2019-07-18 03:18:15 +00:00
Evgeny Mankov	c7117df91b	[HIP][HIPIFY] Split HIP_ARRAY_DESCRIPTOR struct to HIP_ARRAY_DESCRIPTOR and HIP_ARRAY3D_DESCRIPTOR [Reason] To be compatible with CUDA [#1133] Update HIP code, hipify-clang, tests and docs [TODO] Add support of the corresponding functions on nvcc fallback path	2019-07-11 14:58:16 +03:00
Aryan Salmanpour	f21601733e	[hip][samples] fix a typo in module_api/defaultDriver sample (#1210 )	2019-07-10 03:05:19 +00:00
Maneesh Gupta	502a734ebf	Fix sample module_api_global for hip-clang (#1201 ) module_api_global relies on a HCC only feature which allows host code to write to device variables. This feature does not exist in CUDA or hip-clang, which causes the sample not working in CUDA or hip-clang. This patch fixes the sample by using standard features of CUDA and hip-clang. The fixed sample works in HCC, CUDA and hip-clang.	2019-07-03 08:52:19 +00:00
Jatin Chaudhary	3152af0175	Adding new unroll example (#1187 )	2019-07-03 08:51:19 +00:00
Yaxun Sam Liu	60e1733afe	Fix sample module_api_global for hip-clang module_api_global relies on a HCC only feature which allows host code to write to device variables. This feature does not exist in CUDA or hip-clang, which causes the sample not working in CUDA or hip-clang. This patch fixes the sample by using standard features of CUDA and hip-clang. The fixed sample works in HCC, CUDA and hip-clang.	2019-07-02 16:36:53 -04:00
Maneesh Gupta	ba323cdef1	Merge pull request #1164 from gargrahul/cleanup_module_api_samples Add cleanup code in module api samples	2019-06-20 05:57:46 +05:30
Rahul Garg	f2b3526503	Add cleanup code in module api samples	2019-06-07 05:23:11 +05:30
Rahul Garg	7a3bda3c2f	Fix sample to use kernelargs for launch	2019-06-07 05:17:15 +05:30
Evgeny Mankov	8f059b0ee9	[HIP][HIPIFY] Make hipMemcpyParam2D coherent with cuMemcpy2D + Makes hip_Memcpy2D struct compatible with CUDA_MEMCPY2D struct + Add hipMemcpyParam2D support in nvcc fallback path + Update hipify-clang, tests and docs accordingly	2019-05-22 18:31:39 +03:00
Evgeny Mankov	c9af61e9d1	[HIP][tests] Copyright notice update	2019-05-20 18:46:23 +03:00
Nick Curtis	5257b54a39	Markdown fixes & Whitespace cleanup for samples (#1096 ) * Fix multiline code blocks in README's * Whitespace cleanup	2019-05-12 19:27:44 +05:30
Rahul Garg	34e675d0f6	Test hipModuleGetGlobal	2019-03-15 04:08:03 +05:30
Rahul Garg	59081c69fc	Add extension for kernel concurrency on same stream	2019-03-06 12:55:39 +05:30
Maneesh Gupta	b93666bdde	Merge pull request #938 from gargrahul/fix_hipBusBW_p2p_bidir Fix hipBusBW sample for P2P bidirectional test	2019-02-28 07:14:38 +05:30
Rahul Garg	70278f1777	Fix hipBusBW sample for P2P bidirectional test	2019-02-28 00:56:07 +05:30
Rahul Garg	673ecd02fe	Fix hipBusBW overflow with setting beats/iterations	2019-02-27 00:18:52 +05:30
Rahul Garg	e8cf9cba6b	Fix error codes in P2P sample test	2019-02-05 06:30:57 +05:30
Rahul Garg	1b8ddbbb85	Fixed HIP error check in P2P sample test	2019-01-23 01:10:07 +05:30
Yaxun Sam Liu	bc40ddabe0	Fix sample bit_extract for hip-clang	2018-11-12 16:50:47 -05:00
Alex Voicu	e66a75ea55	Guard new includes.	2018-10-18 12:29:27 +01:00
Alex Voicu	fe959f7bd7	Re-sync with upstream.	2018-10-18 12:27:03 +01:00
Maneesh Gupta	52e320f396	Replace hipLaunchKernel -> hipLaunchKernelGGL Change-Id: I4d99009e1199811d417becf1e1b934ec4d4e30be	2018-10-17 14:32:25 +05:30
Alex Voicu	7839045d30	Update samples.	2018-10-13 23:28:22 +01:00
Rahul Garg	fb706902cc	Remove adipose extn from launchKernelHcc sample	2018-09-12 16:41:24 +05:30
Rahul Garg	8f0d684988	Clean up module api samples	2018-08-08 22:28:13 +05:30
Rahul Garg	dc4d305c25	Fix hipCommander Makefile	2018-05-16 15:01:32 +05:30

1 2 3 4

192 کامیت‌ها