German Andryeyev
7975a07112
SWDEV-332512 - Add ROC_SIGNAL_POOL_SIZE
...
Default value is 32 HSA signals in the pool.
Change-Id: Icb69413d3ff6ef228d9a9e22fd024e72c6d8ebe4
2022-04-14 17:32:00 -04:00
Chauncey Hui
ac4ad0ab73
SWDEV-2 - Change OpenCL version number from 3446 to 3447
2022-04-13 03:00:06 -04:00
Saleel Kudchadker
8eeaa998c0
SWDEV-301667 - Add cache state for a device
...
- Add a global cache state for a device to indicate scopes of submitted
AQL packets
- Remove scopes for TS marker if hipEventReleaseToDevice is passed. Set
env ROC_EVENT_NO_FLUSH=1 to use NOP AQL for event records.
It would flush caches by default with system scope release.
- Calling finish() should ensure if caches are flushed, if not queue a
marker
Change-Id: Ibbbdbb1cd7ac61cb35649169212142545be159e0
2022-04-12 12:27:31 -04:00
Chauncey Hui
857634241b
SWDEV-2 - Change OpenCL version number from 3445 to 3446
2022-04-12 03:00:05 -04:00
haoyuan2
1fbc01a812
SWDEV-328274 - Move DLLMain from VDI layer to HIP/OCL layers
...
Change-Id: Idc84eb0db92d21a5ced8769fa1eae064b86c31b0
2022-04-11 16:55:59 -04:00
Maxime Chambonnet
d45794e985
SWDEV-1 - ROC CLR typos
...
This is cherry-picked from this github issue:
https://github.com/ROCm-Developer-Tools/ROCclr/issues/28
Change-Id: I236f4f25a2dabe05883159af0fab0bad06ab0fd0
2022-04-11 14:24:39 -04:00
German Andryeyev
abf088ea89
SWDEV-307184 - Hidden heap keyword was renamed
...
Change-Id: I0a72b0cc16bd3d637cbaa79a692640bc922f62c4
2022-04-11 09:26:14 -04:00
Chauncey Hui
66d404f707
SWDEV-2 - Change OpenCL version number from 3444 to 3445
2022-04-09 03:00:05 -04:00
German Andryeyev
2be0b1e612
SWDEV-307184 - Report 1 for unused dimensions
...
Remove assert for kernel arg size, because COv5 reports a value
bigger than the actual usage in the most of cases
Change-Id: I8e15bc45a9e21b58a5894f9977511ca84408ce61
2022-04-08 13:43:37 -04:00
Jeremy Newton
00efdc1cd6
SWDEV-323669 - Improve arch detection
...
- Clean up detection by using visual studio macros to detect arch; I
didn't list all possible ARM platforms (can be done later if desired)
- Fixed two incorrect uses of !defined(ATI_ARCH_ARM) to instead use
defined(ATI_ARCH_X86), as they contain X86 specific code
- Fixed one use of __ARM_ARCH_7A__ to use ATI_ARCH_ARM instead
This is an improvement to the fixes in the last patch for SWDEV-323669
Signed-off-by: Jeremy Newton <Jeremy.Newton@amd.com >
Change-Id: I8568167293c34ad5331902105877f3ab6e25acb3
2022-04-08 12:21:58 -04:00
kjayapra-amd
b3b88ef926
SWDEV-331104 - Size passed to fillBuffer should not be 0.
...
Change-Id: Ifbc6047fafa0e55b5ab956cf3b7254c7e20b1e88
2022-04-08 09:29:55 -04:00
Chauncey Hui
f2e2984cb9
SWDEV-2 - Change OpenCL version number from 3443 to 3444
2022-04-08 03:00:04 -04:00
German Andryeyev
caa6110c29
SWDEV-307184 - Move local size calculation
...
With COv5 local size calculation must occur before
runtime programs kernel arguments
Change-Id: I0726c6529bde69b8fcf5360aa83986cf84e04168
2022-04-05 11:19:51 -04:00
German Andryeyev
e5a791217e
SWDEV-1 - Fix compilation error with PAL staging
...
Replace compile assert with an error message during execution
Change-Id: I4c054c65845b14bb980e165855693705c097ff80
2022-04-05 11:19:51 -04:00
Chauncey Hui
7630e701f6
SWDEV-2 - Change OpenCL version number from 3442 to 3443
2022-04-02 03:00:06 -04:00
kjayapra-amd
378a427d8c
SWDEV-325776 - Adding device release scope for kernel dispatch packet
...
Change-Id: I8ea763f4c0239c410143b748c05822e9f6694412
(cherry picked from commit ec4894f8a27a3330b895a0ded385ab96f5ef242d)
2022-04-01 08:17:29 -04:00
Chauncey Hui
258e981094
SWDEV-2 - Change OpenCL version number from 3441 to 3442
2022-04-01 03:00:06 -04:00
Christophe Paquot
867346520f
SWDEV-322620 - Virtual Memory Management Part 1
...
Adding opaque data handle to memory. This is used to look back the HIP object associated with it.
Change-Id: I1bbb14a915bed79c6c3593a29a627778c7aaf13a
2022-03-31 21:12:26 -04:00
kjayapra-amd
7fb80a027a
SWDEV-305527 - Changes to handle memset blit kernel that takes width, height and depth. This also fixes SWDEV-317261.
...
Change-Id: Ic85f63a95d9d8f48884fc8c7fd95cbb496dfbbca
2022-03-31 09:02:33 -04:00
Chauncey Hui
7a0e83c061
SWDEV-2 - Change OpenCL version number from 3440 to 3441
2022-03-31 03:00:06 -04:00
German Andryeyev
28597ec5b5
SWDEV-328670 - Enable arena for ROCr interops
...
Add ROCR memory detection and enable arena mem object for possible
access in HIP
Change-Id: Icf86ac789176bfee4ea8d36b0970a817d4c6a2f7
2022-03-30 16:46:36 -04:00
Chauncey Hui
3a8b5cb9c3
SWDEV-2 - Change OpenCL version number from 3439 to 3440
2022-03-30 03:00:04 -04:00
Saleel Kudchadker
61d0b999be
SWDEV-301947 - Report regular CU count for OpenCL
...
Change-Id: I3ea058bba98f3c6554cbde37173bbd772f489cf5
2022-03-29 16:19:14 -04:00
Satyanvesh Dittakavi
c1b95b09bf
SWDEV-326397 - P2P copies to take SDMA path if there is no pending dispatch
...
Change-Id: I50cfb8d77f7882151a20a1de7aaf5219b1695b7d
2022-03-29 14:59:11 +00:00
Chauncey Hui
37e6cbd983
SWDEV-2 - Change OpenCL version number from 3438 to 3439
2022-03-29 03:00:03 -04:00
Saleel Kudchadker
802c2c8a9f
SWDEV-322225 - Use numa_allocate_bitmask
...
- Fix a crash with AMD_CPU_AFFINITY=1 as numa_bitmask_alloc isnt the
right api to allocate bitmask
- Do not set affinity for ROCr thread. It worsens performance rather
than any improvement.
- Fix regression from my previous change for event handler.
Change-Id: I3ea75adc2a6333f29752283eddd5b555e9b58cc5
2022-03-26 13:24:51 -04:00
Chauncey Hui
f8a6099344
SWDEV-2 - Change OpenCL version number from 3437 to 3438
2022-03-26 03:00:06 -04:00
German Andryeyev
a02ae1b851
SWDEV-311271 - Add a key to control memory pool feature
...
Change-Id: Ibd929592b802e65d0e1a4fd9689050bce5059e98
2022-03-25 19:07:14 -04:00
Ajay
35877b1b13
SWDEV-301667 - return void in getTime(). Avoid warning treated as error
...
Change-Id: I9445eec554e6f705fb8f248e6be7ff995f163f25
2022-03-25 13:39:55 -04:00
Chauncey Hui
a43de338e3
SWDEV-2 - Change OpenCL version number from 3436 to 3437
2022-03-25 03:00:07 -04:00
Saleel Kudchadker
3c3c0ca4c5
SWDEV-301667 - Selectively queue handler
...
- Queue handler for hipEventRecord(aka marker_ts_) only if there is a
callback associated with it.
Change-Id: I8a9877ae0e342556053abbaacc9510744a8e772a
2022-03-24 19:46:28 -04:00
Saleel Kudchadker
9292abb2d8
SWDEV-328349 - Rate limit hostcall listener
...
Update timeout for hostcall wait for signal. If the timeout is small it
checks frequent enough to affect performance for certain applications
which may be CPU bound.
Change-Id: I0a879559e4ad111b09a994a5b82a6faf6e4fea3f
2022-03-23 12:18:00 -04:00
Chauncey Hui
9085b482ad
SWDEV-2 - Change OpenCL version number from 3435 to 3436
2022-03-23 03:06:16 -04:00
German Andryeyev
3af3fe10de
SWDEV-307185 - Move memory allocation under device layer
...
It can be too early to allocate memory at the begining of
Device::create() under PAL
Change-Id: I4bd76db7be3f6fb246243ea68022d8b0f860471d
2022-03-21 16:17:22 -04:00
Chauncey Hui
54771e2ed7
SWDEV-2 - Change OpenCL version number from 3434 to 3435
2022-03-18 03:00:08 -04:00
sdashmiz
6e506da36d
SWDEV-307107 - HipPrintf failure for XGMI on windows
...
Signed-off-by: sdashmiz <shadi.dashmiz@amd.com >
Change-Id: Iab2740364bb673029f0a80b419d80d5b726b9082
2022-03-17 10:03:56 -04:00
Chauncey Hui
c36b593657
SWDEV-2 - Change OpenCL version number from 3433 to 3434
2022-03-17 03:00:10 -04:00
Sarbojit Sarkar
3c2dc1f646
SWDEV-325708 - Query for FineGrained support
...
Change-Id: Idd20a71467595ab6577bf47c081c437a4b166988
2022-03-16 05:09:42 -04:00
Chauncey Hui
3b730facae
SWDEV-2 - Change OpenCL version number from 3432 to 3433
2022-03-16 03:00:06 -04:00
Jason Tang
7356d2ecb1
SWDEV-316665 - Use CMAKE_SIZEOF_VOID_P to detect 32bit build
...
Change-Id: If959b0d012caa55afbe54f0fe7da8ad3cca6d452
2022-03-15 10:56:26 -04:00
Chauncey Hui
c0ada20664
SWDEV-2 - Change OpenCL version number from 3431 to 3432
2022-03-15 03:00:06 -04:00
German Andryeyev
43ea9a8e31
SWDEV-286150 - Switch PAL version to 711
...
Change-Id: Iaf07946d6327f92ed734223a6b38c433ce368c4a
2022-03-14 13:00:40 -04:00
Chauncey Hui
e8501ecda9
SWDEV-2 - Change OpenCL version number from 3430 to 3431
2022-03-12 03:00:10 -05:00
Jeremy Newton
f2e5ef5617
SWDEV-323669 - Fix linux arch detection
...
CMake assumes we're bundling on x86, but for GNU compatible compilers,
we should rely on the compiler target to set the build arch.
For non-gnu compilers, just fall back to assuming x86 (no change).
Signed-off-by: Jeremy Newton <Jeremy.Newton@amd.com >
Change-Id: Iee9794e6f7c3973c781ddaf740ded77f34712c4f
2022-03-11 09:53:07 -05:00
Chauncey Hui
c1858dc39e
SWDEV-2 - Change OpenCL version number from 3429 to 3430
2022-03-10 03:00:14 -05:00
Saleel Kudchadker
3bf1d5ac97
SWDEV-301667 - Remove guarantee
...
Remove guarantee from AddMemObj as one can call it multiple times for
different devices
Change-Id: I49dd76068b3c4c709f17541159052302dcdb374d
2022-03-08 16:20:10 -08:00
Chauncey Hui
ed42eda11d
SWDEV-2 - Change OpenCL version number from 3428 to 3429
2022-03-08 03:00:12 -05:00
German Andryeyev
7b114a2b8b
SWDEV-307185 - Create heap for device memory allocator
...
Pass the allocated heap with the kernel arguments
Change-Id: Icdec09b7f937845c39e21cbca7071dc3ba791af9
2022-03-04 00:44:41 -05:00
Chauncey Hui
a6bcb4435a
SWDEV-2 - Change OpenCL version number from 3427 to 3428
2022-03-02 03:00:06 -05:00
haoyuan2
58e1867079
SWDEV-298331 - Fix FreeLibrary throw exception issue on Windows
...
release HIP resource at DLL_PROCESS_DETACH point
Change-Id: I884ab028eeec1aaa2f655746f7eeae130fd009e4
2022-03-01 12:06:05 -05:00