Satyanvesh Dittakavi
b5799c4dbe
SWDEV-363536 - HIP_VISIBLE_DEVICES= should disable the device
...
Setting HIP_VISIBLE_DEVICES= should be treated as invalid device
which makes all the devices invisible to the app. This matches
the CUDA behavior
Change-Id: I937ac4c0b7dacff776cdbe692d4576c81b86ee2d
2022-11-10 01:25:46 -05:00
Maneesh Gupta
a52f5bda8f
SWDEV-299940 - Query global memory available on device with HSA attribute HSA_AMD_AGENT_INFO_MEMORY_AVAIL
...
This reverts commit 4afca0647e .
Reason for revert: ROCr query should now be usable in upcoming release.
Change-Id: I2207761ca6af5d585d090bae1af09eb9a8e9bad6
2022-11-09 10:48:47 -05:00
Chauncey Hui
0b4465efdb
SWDEV-2 - Change OpenCL version number from 3525 to 3526
2022-11-09 03:00:07 -05:00
Juan Manuel MARTINEZ CAAMAÑO
40f75306d5
SWDEV-286150 - [NFC] Refactor repeated option parsing code into function
...
Change-Id: I606dc1cd48d880974142e523d16f5d9ac6f3aff1
2022-11-08 10:29:13 -05:00
Juan Manuel MARTINEZ CAAMAÑO
f5e982cc07
SWDEV-286150 - [NFC] Avoid copying string when calling HashString
...
Change-Id: Ic2f4d469ca319f7d31814b3955430dbbfdad5bb2
2022-11-08 07:07:06 -05:00
Chauncey Hui
773d85e3d9
SWDEV-2 - Change OpenCL version number from 3524 to 3525
2022-11-08 03:00:07 -05:00
Julia Jiang
9d616ad0f4
SWDEV-360748 - Update return value for hipExtGetLinkTypeAndHopCount_Positive_Basic test on Windows
...
Change-Id: I0eb984da6e96c2409e1b1bbda61ae694c8bebe94
2022-11-07 10:00:32 -05:00
German
c1b22840fa
SWDEV-286150 - Switch PAL to 775 version
...
Change-Id: I8e64cbe0fea273d4bd6f7d3ebd66feb628fac6b9
2022-11-07 09:54:22 -05:00
Chauncey Hui
d0a57e4905
SWDEV-2 - Change OpenCL version number from 3523 to 3524
2022-11-05 03:00:05 -04:00
Saleel Kudchadker
feca11d5e3
SWDEV-301667 - Improve logging
...
Change-Id: Ifa6da876b85cb503967cf09aac6d477b10db8e63
2022-11-04 18:23:18 -04:00
Saleel Kudchadker
3e465c5ff8
SWDEV-364052 - Print free memory on queue callback
...
Change-Id: I7d15b6f6277326d5f3e9b784a3443f18ec6ba58a
2022-11-04 13:19:33 -04:00
Todd tiantuo Li
246429bd68
SWDEV-302996 - add gfx1103 Phoenix support - OCL runtime
...
Change-Id: Ib6b7efa085b734bda6b5b50c720fe9d93e6dbf95
2022-11-04 12:07:22 -04:00
Chauncey Hui
cfd9ced308
SWDEV-2 - Change OpenCL version number from 3522 to 3523
2022-11-04 03:00:06 -04:00
kjayapra-amd
ea0ce45ad3
SWDEV-329423 - Enable NV3x target ID related changes on mainline.
...
Change-Id: I54fafb371976aaa16a452ec65b0c340c5682c264
2022-11-03 21:11:24 +00:00
German
473621c008
SWDEV-365121 - Use CP DMA for tiny transfers
...
Sync between compute and SDMA engines can be very expensive under Windows.
Use CP DMA for tiny transfers (< 1KiB) to avoid syncs and improve performance.
Change-Id: I9db39a2199f7b9e337ed08fd36d9cbc150502f1f
2022-11-03 10:04:53 -04:00
Tao Sang
06573ac92f
SWDEV-363140 - Support performance counter on gfx11
...
Change-Id: I2216f71f4d4db6dd3766333b0c821cb3d35d7851
2022-11-03 07:59:24 -04:00
Chauncey Hui
4b3ff92abd
SWDEV-2 - Change OpenCL version number from 3521 to 3522
2022-11-03 03:00:09 -04:00
Laurent Morichetti
b043b4f5a2
SWDEV-351980 - Store the correlation ID in the dispatch packet
...
The profiler can retrieve this correlation ID to attribute waves to
specific dispatch locations.
Change-Id: I700e8a91219d612f6a2028c0dda0c92753f3526a
2022-11-02 23:40:10 -04:00
Chauncey Hui
b070ba9cc3
SWDEV-2 - Change OpenCL version number from 3520 to 3521
2022-11-01 03:00:05 -04:00
jatang
166ef6b622
SWDEV-293816 - Only dispatch Counter Aql packet when it is created successfully.
...
Change-Id: Ic8a7374d19012754c6de5d483ec5d07a56661d1b
2022-10-31 15:11:41 -04:00
Saleel Kudchadker
cf1fd98074
SWDEV-301667 - Use queue base address for AQL log
...
Change-Id: Ie525991b341deeb93ab3cb2d77550e8e1d87ee20
2022-10-31 13:54:49 -04:00
German
ff6b4db70b
SWDEV-363074 - Clean-up sync between SDMA and compute
...
HIP can't rely on the resource tracking, used in OCL and requires different explicit sync.
Make sure ROCCLR syncs compute only when SDMA is used and vise versa.
The new logic will allow to enable CPDMA without unnecessary waits.
Change-Id: Ib9d1788cfd5afa5ea2fec4c96a37d8b9c4d0059d
2022-10-31 10:02:01 -04:00
Chauncey Hui
3bce4df27d
SWDEV-2 - Change OpenCL version number from 3519 to 3520
2022-10-29 03:00:09 -04:00
German
dc4ad8c99c
SWDEV-363074 - Enable resource cache for SVM
...
Blender creates and destroys big allocations during the benchmark.
That causes big delays, because vidmm has to page-in/page-out memory.
Change-Id: I2baf4545807127406e3d2870a7581ff9ae7bcdb5
2022-10-27 16:39:29 -04:00
Chauncey Hui
cff74ae551
SWDEV-2 - Change OpenCL version number from 3518 to 3519
2022-10-25 03:00:08 -04:00
German
e223b0f678
SWDEV-352487 - Don't add notifications as the last command
...
Change-Id: Ifed34485839ef2c9491e8e8f6bb3569932160b1c
2022-10-24 09:39:03 -04:00
Chauncey Hui
a65ed69c1b
SWDEV-2 - Change OpenCL version number from 3517 to 3518
2022-10-22 03:00:06 -04:00
Ajay
261620a2ef
SWDEV-357207 - Linux: enable StreamWrite/Wait APIs in deviceAttribute
...
Change-Id: I593df2dbb7a0233dd42f8d873510d85bbb27feff
2022-10-21 12:44:35 -04:00
German
c1c5127875
SWDEV-360183 - Finish the trace if queue was destroyed
...
- Make sure SQQT trace is captured for RGP server if the queue is destroyed before normal capture is done.
- Remove prepare queue from the logic. It's not really used for any HW capture and can cause RGP server abort if destroyed before capture is even started(delayed capture)
Change-Id: I6eb19963190a5769c6477a5496c1b831a6d59b89
2022-10-21 10:44:10 -04:00
Alex (Bin) Xie
06867f0c04
SWDEV-361591 - clean up header files
...
Change-Id: I25ca37a94524da0521f5b856a9fa9bab73f103c1
2022-10-21 10:23:55 -04:00
Chauncey Hui
904f3e3d75
SWDEV-2 - Change OpenCL version number from 3516 to 3517
2022-10-21 03:00:52 -04:00
Tao Sang
3d281114fb
SWDEV-360706 - Fix WriteAqlArgAt() crash in X86
...
In X86 on Windows, sizeof(size_t)=4, but size=8, for
amd::KernelParameterDescriptor::HiddenGlobalOffsetX/Y/Z items.
Loose the condition to prevent crash.
Change-Id: I2216f71f4d4fd6dd3766023b1c821cb3d35d7848
2022-10-20 18:04:49 -04:00
Sourabh Betigeri
b2665ccc4b
SWDEV-361730 - Fix to match the kernel arg allocation sizes between gpu visible mem and host/runtime mem
...
Change-Id: I6495aa57e97b46373c1d90cc230435780fdcf3ef
2022-10-20 14:20:37 -04:00
Ajay
a5b550e828
SWDEV-360944 - palBlit setArgument desc.size_ --> argSize
...
Fixes AMF tests that were impacted on Windows
Change-Id: I0326e794ec5a34fdec7a5ff4ed79d5bd055997d2
2022-10-20 12:44:15 -04:00
Laurent Morichetti
9a82118c85
SWDEV-362046 - Report HIP_OPS activities using the ROCr driver_node_id instead of the device's index
...
The ROCclr assigns zero-based IDs to GPUs in the order they are
discovered. That zero-based ID is what is used to identify the GPU
on which the HIP_OPS activity took place.
When multiple ranks are used, each rank's first logical device always
has GPU ID 0, regardless of which physical device is selected with
CUDA_VISIBLE_DEVICES. Because of this, when merging trace files from
multiple ranks, GPU IDs from different processes may overlap.
The long term solution is to use the KFD's gpu_id which is stable
across APIs and processes. Unfortunately the gpu_id is not yet exposed
by the ROCr, so for now use the driver's node id.
Change-Id: Ib78854527d600d175bb76e2df0747c33f898c615
2022-10-20 12:31:30 -04:00
Julia Jiang
dacd55f3d7
SWDEV-357122 - fix failure in vdi so as to handle unreasonable input size in MallocManagedNegativeTest
...
Change-Id: I1ed1916b652afc67327b0935c3c60fc2a404df30
2022-10-20 12:19:22 -04:00
Chauncey Hui
05b2bd7995
SWDEV-2 - Change OpenCL version number from 3515 to 3516
2022-10-20 03:00:06 -04:00
Christophe Paquot
470c4d7335
SWDEV-322620 - Virtual Memory Management
...
Hooked up the start VA hint address to PAL.
Change-Id: I4301d3eaaabcc85faf63fb25291291dcc3e8ebd1
2022-10-19 10:50:07 -04:00
Chauncey Hui
5cad630425
SWDEV-2 - Change OpenCL version number from 3514 to 3515
2022-10-19 03:00:13 -04:00
kjayapra-amd
951a5b9e7d
SWDEV-307411 - Require comment/message string in guarantee().
...
Change-Id: I55a699c5366f6a7d167ede4c9be2ec5b15cea9d0
2022-10-13 16:21:21 -04:00
Chauncey Hui
4e0e0666d6
SWDEV-2 - Change OpenCL version number from 3513 to 3514
2022-10-13 03:00:09 -04:00
Saleel Kudchadker
6405b6cdba
SWDEV-359548 - Detect fence optimization
...
- Use a dirty flag to determine fence optimization
- If fence is dirty submit a marker at top level to sync.
Change-Id: I53fb19b5bb05b7c7b37c41637a6c7aaf870b639a
2022-10-10 14:15:16 -04:00
Chauncey Hui
621faa03ec
SWDEV-2 - Change OpenCL version number from 3512 to 3513
2022-10-08 03:00:09 -04:00
Saleel Kudchadker
eef2f99513
SWDEV-356482 - Improve logging
...
Change-Id: I0c6286fcdb9d94398ed66262e95e8837898ff9fe
2022-10-06 11:13:24 -04:00
Chauncey Hui
3e3fa68657
SWDEV-2 - Change OpenCL version number from 3511 to 3512
2022-10-06 03:00:06 -04:00
Sourabh Betigeri
84fbb30b7c
SWDEV-357246 - Adds a missing return statement
...
Change-Id: I2216f71f4d4fb6dd3766023b0c821cb3d35d7849
2022-10-05 16:29:32 -04:00
Laurent Morichetti
e00965df50
SWDEV-351980 - Add FillBuffer byte count to the record
...
Change-Id: I90c791f5810b8a3f6b1d6a9e81c165b1a7515c92
2022-09-30 21:20:14 -07:00
Chauncey Hui
800fe4b239
SWDEV-2 - Change OpenCL version number from 3510 to 3511
2022-09-30 03:00:10 -04:00
Jeremy Newton
65fd4f5f07
SWDEV-1 - Fix typo in compiler utils
...
Found by the Debian maintainers:
https://github.com/ROCm-Developer-Tools/ROCclr/issues/28
Signed-off-by: Jeremy Newton <Jeremy.Newton@amd.com >
Change-Id: I8c09913f7b7f4ad02f12a0630f15e66bdd5360a7
2022-09-29 18:14:29 -04:00
Chauncey Hui
f93d921c4f
SWDEV-2 - Change OpenCL version number from 3509 to 3510
2022-09-23 03:00:11 -04:00