Gráfico de commits

4484 Commits

Autor SHA1 Mensaje Fecha
German Andryeyev 7975a07112 SWDEV-332512 - Add ROC_SIGNAL_POOL_SIZE
Default value is 32 HSA signals in the pool.

Change-Id: Icb69413d3ff6ef228d9a9e22fd024e72c6d8ebe4
2022-04-14 17:32:00 -04:00
Chauncey Hui ac4ad0ab73 SWDEV-2 - Change OpenCL version number from 3446 to 3447 2022-04-13 03:00:06 -04:00
Saleel Kudchadker 8eeaa998c0 SWDEV-301667 - Add cache state for a device
- Add a global cache state for a device to indicate scopes of submitted
AQL packets
- Remove scopes for TS marker if hipEventReleaseToDevice is passed. Set
env ROC_EVENT_NO_FLUSH=1 to use NOP AQL for event records.
It would flush caches by default with system scope release.
- Calling finish() should ensure if caches are flushed, if not queue a
marker

Change-Id: Ibbbdbb1cd7ac61cb35649169212142545be159e0
2022-04-12 12:27:31 -04:00
Chauncey Hui 857634241b SWDEV-2 - Change OpenCL version number from 3445 to 3446 2022-04-12 03:00:05 -04:00
haoyuan2 1fbc01a812 SWDEV-328274 - Move DLLMain from VDI layer to HIP/OCL layers
Change-Id: Idc84eb0db92d21a5ced8769fa1eae064b86c31b0
2022-04-11 16:55:59 -04:00
Maxime Chambonnet d45794e985 SWDEV-1 - ROC CLR typos
This is cherry-picked from this github issue:
https://github.com/ROCm-Developer-Tools/ROCclr/issues/28

Change-Id: I236f4f25a2dabe05883159af0fab0bad06ab0fd0
2022-04-11 14:24:39 -04:00
German Andryeyev abf088ea89 SWDEV-307184 - Hidden heap keyword was renamed
Change-Id: I0a72b0cc16bd3d637cbaa79a692640bc922f62c4
2022-04-11 09:26:14 -04:00
Chauncey Hui 66d404f707 SWDEV-2 - Change OpenCL version number from 3444 to 3445 2022-04-09 03:00:05 -04:00
German Andryeyev 2be0b1e612 SWDEV-307184 - Report 1 for unused dimensions
Remove assert for kernel arg size, because COv5 reports a value
bigger than the actual usage in the most of cases

Change-Id: I8e15bc45a9e21b58a5894f9977511ca84408ce61
2022-04-08 13:43:37 -04:00
Jeremy Newton 00efdc1cd6 SWDEV-323669 - Improve arch detection
- Clean up detection by using visual studio macros to detect arch; I
  didn't list all possible ARM platforms (can be done later if desired)
- Fixed two incorrect uses of !defined(ATI_ARCH_ARM) to instead use
  defined(ATI_ARCH_X86), as they contain X86 specific code
- Fixed one use of __ARM_ARCH_7A__ to use ATI_ARCH_ARM instead

This is an improvement to the fixes in the last patch for SWDEV-323669

Signed-off-by: Jeremy Newton <Jeremy.Newton@amd.com>
Change-Id: I8568167293c34ad5331902105877f3ab6e25acb3
2022-04-08 12:21:58 -04:00
kjayapra-amd b3b88ef926 SWDEV-331104 - Size passed to fillBuffer should not be 0.
Change-Id: Ifbc6047fafa0e55b5ab956cf3b7254c7e20b1e88
2022-04-08 09:29:55 -04:00
Chauncey Hui f2e2984cb9 SWDEV-2 - Change OpenCL version number from 3443 to 3444 2022-04-08 03:00:04 -04:00
German Andryeyev caa6110c29 SWDEV-307184 - Move local size calculation
With COv5 local size calculation must occur before
runtime programs kernel arguments

Change-Id: I0726c6529bde69b8fcf5360aa83986cf84e04168
2022-04-05 11:19:51 -04:00
German Andryeyev e5a791217e SWDEV-1 - Fix compilation error with PAL staging
Replace compile assert with an error message during execution

Change-Id: I4c054c65845b14bb980e165855693705c097ff80
2022-04-05 11:19:51 -04:00
Chauncey Hui 7630e701f6 SWDEV-2 - Change OpenCL version number from 3442 to 3443 2022-04-02 03:00:06 -04:00
kjayapra-amd 378a427d8c SWDEV-325776 - Adding device release scope for kernel dispatch packet
Change-Id: I8ea763f4c0239c410143b748c05822e9f6694412
(cherry picked from commit ec4894f8a27a3330b895a0ded385ab96f5ef242d)
2022-04-01 08:17:29 -04:00
Chauncey Hui 258e981094 SWDEV-2 - Change OpenCL version number from 3441 to 3442 2022-04-01 03:00:06 -04:00
Christophe Paquot 867346520f SWDEV-322620 - Virtual Memory Management Part 1
Adding opaque data handle to memory. This is used to look back the HIP object associated with it.

Change-Id: I1bbb14a915bed79c6c3593a29a627778c7aaf13a
2022-03-31 21:12:26 -04:00
kjayapra-amd 7fb80a027a SWDEV-305527 - Changes to handle memset blit kernel that takes width, height and depth. This also fixes SWDEV-317261.
Change-Id: Ic85f63a95d9d8f48884fc8c7fd95cbb496dfbbca
2022-03-31 09:02:33 -04:00
Chauncey Hui 7a0e83c061 SWDEV-2 - Change OpenCL version number from 3440 to 3441 2022-03-31 03:00:06 -04:00
German Andryeyev 28597ec5b5 SWDEV-328670 - Enable arena for ROCr interops
Add ROCR memory detection and enable arena mem object for possible
access in HIP

Change-Id: Icf86ac789176bfee4ea8d36b0970a817d4c6a2f7
2022-03-30 16:46:36 -04:00
Chauncey Hui 3a8b5cb9c3 SWDEV-2 - Change OpenCL version number from 3439 to 3440 2022-03-30 03:00:04 -04:00
Saleel Kudchadker 61d0b999be SWDEV-301947 - Report regular CU count for OpenCL
Change-Id: I3ea058bba98f3c6554cbde37173bbd772f489cf5
2022-03-29 16:19:14 -04:00
Satyanvesh Dittakavi c1b95b09bf SWDEV-326397 - P2P copies to take SDMA path if there is no pending dispatch
Change-Id: I50cfb8d77f7882151a20a1de7aaf5219b1695b7d
2022-03-29 14:59:11 +00:00
Chauncey Hui 37e6cbd983 SWDEV-2 - Change OpenCL version number from 3438 to 3439 2022-03-29 03:00:03 -04:00
Saleel Kudchadker 802c2c8a9f SWDEV-322225 - Use numa_allocate_bitmask
- Fix a crash with AMD_CPU_AFFINITY=1 as numa_bitmask_alloc isnt the
right api to allocate bitmask
- Do not set affinity for ROCr thread. It worsens performance rather
than any improvement.
- Fix regression from my previous change for event handler.

Change-Id: I3ea75adc2a6333f29752283eddd5b555e9b58cc5
2022-03-26 13:24:51 -04:00
Chauncey Hui f8a6099344 SWDEV-2 - Change OpenCL version number from 3437 to 3438 2022-03-26 03:00:06 -04:00
German Andryeyev a02ae1b851 SWDEV-311271 - Add a key to control memory pool feature
Change-Id: Ibd929592b802e65d0e1a4fd9689050bce5059e98
2022-03-25 19:07:14 -04:00
Ajay 35877b1b13 SWDEV-301667 - return void in getTime(). Avoid warning treated as error
Change-Id: I9445eec554e6f705fb8f248e6be7ff995f163f25
2022-03-25 13:39:55 -04:00
Chauncey Hui a43de338e3 SWDEV-2 - Change OpenCL version number from 3436 to 3437 2022-03-25 03:00:07 -04:00
Saleel Kudchadker 3c3c0ca4c5 SWDEV-301667 - Selectively queue handler
- Queue handler for hipEventRecord(aka marker_ts_) only if there is a
callback associated with it.

Change-Id: I8a9877ae0e342556053abbaacc9510744a8e772a
2022-03-24 19:46:28 -04:00
Saleel Kudchadker 9292abb2d8 SWDEV-328349 - Rate limit hostcall listener
Update timeout for hostcall wait for signal. If the timeout is small it
checks frequent enough to affect performance for certain applications
which may be CPU bound.

Change-Id: I0a879559e4ad111b09a994a5b82a6faf6e4fea3f
2022-03-23 12:18:00 -04:00
Chauncey Hui 9085b482ad SWDEV-2 - Change OpenCL version number from 3435 to 3436 2022-03-23 03:06:16 -04:00
German Andryeyev 3af3fe10de SWDEV-307185 - Move memory allocation under device layer
It can be too early to allocate memory at the begining of
Device::create() under PAL

Change-Id: I4bd76db7be3f6fb246243ea68022d8b0f860471d
2022-03-21 16:17:22 -04:00
Chauncey Hui 54771e2ed7 SWDEV-2 - Change OpenCL version number from 3434 to 3435 2022-03-18 03:00:08 -04:00
sdashmiz 6e506da36d SWDEV-307107 - HipPrintf failure for XGMI on windows
Signed-off-by: sdashmiz <shadi.dashmiz@amd.com>
Change-Id: Iab2740364bb673029f0a80b419d80d5b726b9082
2022-03-17 10:03:56 -04:00
Chauncey Hui c36b593657 SWDEV-2 - Change OpenCL version number from 3433 to 3434 2022-03-17 03:00:10 -04:00
Sarbojit Sarkar 3c2dc1f646 SWDEV-325708 - Query for FineGrained support
Change-Id: Idd20a71467595ab6577bf47c081c437a4b166988
2022-03-16 05:09:42 -04:00
Chauncey Hui 3b730facae SWDEV-2 - Change OpenCL version number from 3432 to 3433 2022-03-16 03:00:06 -04:00
Jason Tang 7356d2ecb1 SWDEV-316665 - Use CMAKE_SIZEOF_VOID_P to detect 32bit build
Change-Id: If959b0d012caa55afbe54f0fe7da8ad3cca6d452
2022-03-15 10:56:26 -04:00
Chauncey Hui c0ada20664 SWDEV-2 - Change OpenCL version number from 3431 to 3432 2022-03-15 03:00:06 -04:00
German Andryeyev 43ea9a8e31 SWDEV-286150 - Switch PAL version to 711
Change-Id: Iaf07946d6327f92ed734223a6b38c433ce368c4a
2022-03-14 13:00:40 -04:00
Chauncey Hui e8501ecda9 SWDEV-2 - Change OpenCL version number from 3430 to 3431 2022-03-12 03:00:10 -05:00
Jeremy Newton f2e5ef5617 SWDEV-323669 - Fix linux arch detection
CMake assumes we're bundling on x86, but for GNU compatible compilers,
we should rely on the compiler target to set the build arch.

For non-gnu compilers, just fall back to assuming x86 (no change).

Signed-off-by: Jeremy Newton <Jeremy.Newton@amd.com>
Change-Id: Iee9794e6f7c3973c781ddaf740ded77f34712c4f
2022-03-11 09:53:07 -05:00
Chauncey Hui c1858dc39e SWDEV-2 - Change OpenCL version number from 3429 to 3430 2022-03-10 03:00:14 -05:00
Saleel Kudchadker 3bf1d5ac97 SWDEV-301667 - Remove guarantee
Remove guarantee from AddMemObj as one can call it multiple times for
different devices

Change-Id: I49dd76068b3c4c709f17541159052302dcdb374d
2022-03-08 16:20:10 -08:00
Chauncey Hui ed42eda11d SWDEV-2 - Change OpenCL version number from 3428 to 3429 2022-03-08 03:00:12 -05:00
German Andryeyev 7b114a2b8b SWDEV-307185 - Create heap for device memory allocator
Pass the allocated heap with the kernel arguments

Change-Id: Icdec09b7f937845c39e21cbca7071dc3ba791af9
2022-03-04 00:44:41 -05:00
Chauncey Hui a6bcb4435a SWDEV-2 - Change OpenCL version number from 3427 to 3428 2022-03-02 03:00:06 -05:00
haoyuan2 58e1867079 SWDEV-298331 - Fix FreeLibrary throw exception issue on Windows
release HIP resource at DLL_PROCESS_DETACH point

Change-Id: I884ab028eeec1aaa2f655746f7eeae130fd009e4
2022-03-01 12:06:05 -05:00