نمودار کامیت

191 کامیت‌ها

مولف SHA1 پیام تاریخ
Joseph Greathouse 44b16df84f Fix maxSharedMemoryPerMultiProcessor attribute (#1927)
The maxSharedMemoryPerMultiProcessor attribute is meant to describe
the number of bytes of shared memory (LDS space in AMD terminology)
in each SM (CU in AMD terminology). For instance, on AMD GPUs this
is often 64KB per CU, and some Nvidia GPUs it's 96KB per SM.

This shared memory is a different address space from the normal
global memory. However, the current HIP-HCC properties fill this
in with a size that matches the totalGlboalMem property. This gives
a drastically too-high calculation for the amount of LDS space that
each CU has -- tens of GBs vs. 10s of KBs.

This patch fixes this by pulling the maxSharedMemoryPerMultiProcessor
property from the HSA pool that describes how much workgroup-local
space is available on each CU. The HSA runtime eventually pulls
this from the topology information about LDSSizeInKB, defined as
"Size of Local Data Store in Kilobytes per SIMD".

Previously, this HSA query was used to fill in the value of the
sharedMemPerBlock property. On today's AMD GPUs, we know that
the amount of LDS avaialble to the workgroup is identical to the
amount of LDS space in the CU. However, in the future this may
differ. As such, this patch changes around the order and fills
in the "PerMultiProcessor" property from the HSA query (since
what's what the query is defined to return), and then separately
fills in the "PerBlock" property as we know it.
2020-03-17 14:00:51 +05:30
Rahul Garg 6be7537cf9 Remove deprecated HIP markers (#1876) 2020-02-28 16:47:15 +05:30
Rahul Garg 893ccc4ddf [sample] Add hipDispatchEnqueueRateMT (#1869)
* [sample] Add hipDispatchEnqueueRateMT
2020-02-13 23:21:40 -08:00
Maneesh Gupta d032637934 Revert "Match Occupancy APIs syntax with CUDA (#1625)" (#1857)
Reverting this for now till we figure out how to avoid the build
breakage.

This reverts commit 262ad13dd5.
2020-02-10 10:45:28 +05:30
Rahul Garg 27fbfd9d66 [docs] Fix links in cookbook samples (#1824) 2020-02-04 08:56:31 +05:30
satyanveshd 262ad13dd5 Match Occupancy APIs syntax with CUDA (#1625)
* Match Occupancy APIs syntax with CUDA and fix tests using these APIs
2020-01-29 13:05:53 -08:00
Rahul Garg 8b8e4a66b4 Simplify hipDispatchLatency sample (#1793)
* Use hipExtLaunchKernelGGL in dispatchlatency sample

* Let it run on NVCC path too

* Refactoring

* Add test_kernel source

* Remove ResultDB

* Remove error checks
2020-01-22 14:37:47 -08:00
Evgeny Mankov 4aaa2336a8 [HIP] Clean-up deprecated HIP error codes
hipErrorMemoryAllocation -> hipErrorOutOfMemory
hipErrorInitializationError -> hipErrorNotInitialized
hipErrorMapBufferObjectFailed -> hipErrorMapFailed
hipErrorInvalidResourceHandle -> hipErrorInvalidHandle
2019-12-23 17:01:35 +03:00
Rahul Garg dfee3ae279 Rename hip/hip_hcc.h to hip/hip_ext.h (#1341)
* Rename hip/hip_hcc.h to hip/hip_ext.h

* Deprecate hip_hcc.h
2019-11-07 13:17:10 +05:30
Rahul Garg aeb7cebbad Merge pull request #1515 from ansurya/tex_unbind_issue_fix
Fix undefined ref to hipUnbindTexture for texture types
2019-10-30 17:54:15 -07:00
Rahul Garg 27221bc823 Revert "Fix occupany APIs (#1560)"
This reverts commit 6c5fbf9b4a.
2019-10-29 11:41:08 -07:00
Anusha Godavarthy Surya ae838f8cee merge from master 2019-10-25 15:52:09 +05:30
Rahul Garg 170c4f0270 Add HIP checks in texture driver sample (#1581) 2019-10-24 17:45:51 +05:30
satyanveshd 6c5fbf9b4a Fix occupany APIs (#1560)
Addresses SWDEV-205006
2019-10-24 17:44:47 +05:30
kjayapra-amd 9d571e3c9e Use the correct return type in runTest in 11_texture_driver sample. (#1546)
Fixes SWDEV-203394.
Currently in runTest() returns true, even if the texture reference copy does not happen. Using the existing testResult Flag to return from runTest().
2019-10-16 10:52:15 +05:30
Anusha Godavarthy Surya 20cfe17805 Added unbind texture to samples and tests 2019-10-15 11:34:15 +05:30
Yaxun (Sam) Liu 793b18c0b1 Fix sample 11_texture_driver for hip-clang (#1370) 2019-08-29 01:03:20 +00:00
satyanveshd d2df21e58c [sample] add new cookbook sample - occupancy (#1352)
* occupancy.cpp with Makefile

* occupancy sample changes according tothe comments

* Changes according to the review comments

* Occupancy Sample Changes

* Changes according to review comments
2019-08-29 01:01:49 +00:00
Sarbojit2019 919bb6c74d Added missing device prop fields into hipInfo sample (#1357)
* Added prop.integrated into hipInfo sample

* Added missing deviceProp fileds in hipInfo
2019-08-23 09:21:47 +00:00
chrispaquot d0b11deb7f Set device before processing each one (#1358) 2019-08-23 09:19:50 +00:00
Rahul Garg 900590967d Remove -g from hipBusBandwidth makefile (#1351) 2019-08-21 10:01:31 +00:00
Rahul Garg 8b6317d041 Add support for hipFuncGetAttribute (#1279)
* Add support for hipFunGetAttribute

* Support NVCC path

* Test using sample module_api_global

* Try fixing CI build failure due to hip_prof_gen scan

* Fix for CI build issue

* Resolve conflict

* Rebase and resolve conflicts with master

* Fix build error

* Fix NVCC path build error
2019-08-08 08:27:41 +00:00
ansurya 12bf6dbcda HIPCommander code cleanup (#1207)
* HIPCommander code cleanup

* Removed non-used headerfiles

* Removed empty ifdef
2019-07-18 03:18:15 +00:00
Evgeny Mankov f0832fd968 [HIP][HIPIFY] Split HIP_ARRAY_DESCRIPTOR struct to HIP_ARRAY_DESCRIPTOR and HIP_ARRAY3D_DESCRIPTOR
[Reason] To be compatible with CUDA [#1133]

Update HIP code, hipify-clang, tests and docs

[TODO] Add support of the corresponding functions on nvcc fallback path
2019-07-11 14:58:16 +03:00
Aryan Salmanpour b1d8736f4a [hip][samples] fix a typo in module_api/defaultDriver sample (#1210) 2019-07-10 03:05:19 +00:00
Maneesh Gupta 3541d18528 Fix sample module_api_global for hip-clang (#1201)
module_api_global relies on a HCC only feature which allows host code
to write to device variables. This feature does not exist in CUDA
or hip-clang, which causes the sample not working in CUDA or hip-clang.

This patch fixes the sample by using standard features of CUDA and
hip-clang. The fixed sample works in HCC, CUDA and hip-clang.
2019-07-03 08:52:19 +00:00
Jatin Chaudhary 3cb87cdc46 Adding new unroll example (#1187) 2019-07-03 08:51:19 +00:00
Yaxun Sam Liu 98648828c0 Fix sample module_api_global for hip-clang
module_api_global relies on a HCC only feature which allows host code
to write to device variables. This feature does not exist in CUDA
or hip-clang, which causes the sample not working in CUDA or hip-clang.

This patch fixes the sample by using standard features of CUDA and
hip-clang. The fixed sample works in HCC, CUDA and hip-clang.
2019-07-02 16:36:53 -04:00
Maneesh Gupta 439b6e41d4 Merge pull request #1164 from gargrahul/cleanup_module_api_samples
Add cleanup code in module api samples
2019-06-20 05:57:46 +05:30
Rahul Garg 49ac54abde Add cleanup code in module api samples 2019-06-07 05:23:11 +05:30
Rahul Garg 4e6ad95403 Fix sample to use kernelargs for launch 2019-06-07 05:17:15 +05:30
Evgeny Mankov 9cb3e9aa5e [HIP][HIPIFY] Make hipMemcpyParam2D coherent with cuMemcpy2D
+ Makes hip_Memcpy2D struct compatible with CUDA_MEMCPY2D struct
+ Add hipMemcpyParam2D support in nvcc fallback path
+ Update hipify-clang, tests and docs accordingly
2019-05-22 18:31:39 +03:00
Evgeny Mankov 7664560f14 [HIP][tests] Copyright notice update 2019-05-20 18:46:23 +03:00
Nick Curtis fb92feae0e Markdown fixes & Whitespace cleanup for samples (#1096)
* Fix multiline code blocks in README's

* Whitespace cleanup
2019-05-12 19:27:44 +05:30
Rahul Garg 46346343af Test hipModuleGetGlobal 2019-03-15 04:08:03 +05:30
Rahul Garg 263e82a67a Add extension for kernel concurrency on same stream 2019-03-06 12:55:39 +05:30
Maneesh Gupta 4eff6bd09a Merge pull request #938 from gargrahul/fix_hipBusBW_p2p_bidir
Fix hipBusBW sample for P2P bidirectional test
2019-02-28 07:14:38 +05:30
Rahul Garg 828e62fe4f Fix hipBusBW sample for P2P bidirectional test 2019-02-28 00:56:07 +05:30
Rahul Garg 4fef69afdc Fix hipBusBW overflow with setting beats/iterations 2019-02-27 00:18:52 +05:30
Rahul Garg 3e2f946d3f Fix error codes in P2P sample test 2019-02-05 06:30:57 +05:30
Rahul Garg 7951cd03b2 Fixed HIP error check in P2P sample test 2019-01-23 01:10:07 +05:30
Yaxun Sam Liu de907674d6 Fix sample bit_extract for hip-clang 2018-11-12 16:50:47 -05:00
Alex Voicu 59533b84e1 Guard new includes. 2018-10-18 12:29:27 +01:00
Alex Voicu 602280eb8b Re-sync with upstream. 2018-10-18 12:27:03 +01:00
Maneesh Gupta e1fe095471 Replace hipLaunchKernel -> hipLaunchKernelGGL
Change-Id: I4d99009e1199811d417becf1e1b934ec4d4e30be
2018-10-17 14:32:25 +05:30
Alex Voicu f480a134f4 Update samples. 2018-10-13 23:28:22 +01:00
Rahul Garg 632b6f6b39 Remove adipose extn from launchKernelHcc sample 2018-09-12 16:41:24 +05:30
Rahul Garg b6c5d7d71b Clean up module api samples 2018-08-08 22:28:13 +05:30
Rahul Garg 9707e9f563 Fix hipCommander Makefile 2018-05-16 15:01:32 +05:30
Rahul Garg 2184e7f6da Removed hidden args and hipLaunchParm from HIP/HCC path 2018-03-16 22:50:25 +05:30