Saleel Kudchadker
8a538b5639
SWDEV-384557 - Reset mask at in the constructor
...
Change-Id: I15dafd281213c03590bcae4bf4544e33c4a7e277
2023-08-11 17:22:27 -04:00
Saleel Kudchadker
aa6eb555e2
SWDEV-384557 - Enable SDMA query
...
Change-Id: Ibb0a8d131f799985a4d4adbf753261e58c04157f
2023-08-01 18:41:23 -04:00
Alex Xie
7912f3af89
SWDEV-409299 - Vega clinfo is not working
...
Change-Id: Ia48bc6f130bd102dff210b105de6f9c02ebbe012
2023-07-10 09:53:50 -04:00
German
ad2c1e899a
SWDEV-408650 - Report CL_DEVICE_HALF_FP_CONFIG query
...
If cl_khr_fp16 extension is enabled, then OCL runtime should report CL_DEVICE_HALF_FP_CONFIG.
Change-Id: I7c4ac48387f80bc704a475c57e5b52a462090d1b
2023-07-06 11:58:16 -04:00
kjayapra-amd
aca7d9e14a
FEAT-47686 - Changes to support new uncached memory segment in ROCr.
...
Change-Id: I0ba8769d6737cdf1cc8a8644b2e82109f584a430
2023-07-05 20:47:51 -04:00
Rakesh Roy
8c1232124e
SWDEV-405329 - Fix cuMask issue for WGP mode
...
- Enable CUs adjacent pairwise for WGP mode
- In HostQueue::terminate() do not segfault if virtual device hasn't been created
Change-Id: I94402ff333308af5824878086cc238b3993d534d
2023-06-30 01:09:01 -04:00
Saleel Kudchadker
770b2a4711
SWDEV-384557 - Rename env var
...
- Rename HIP_USE_SDMA_QUERY to DEBUG_CLR_USE_SDMA_QUERY as this is
supposed to be a temporary env var for debug purposes only.
Change-Id: If6ebd52ab87624375a3df24ceccdcc05c60a65af
2023-06-29 13:54:55 -04:00
ajay
d6946ffcbc
SWDEV-406687 - combining rocblitcl and palblitcl blit kernel defs
...
Change-Id: Ia312d73584a03491e8d574f424295b64df6de174
2023-06-23 18:38:36 -04:00
German
9aa6f25f1c
SWDEV-311270 - Clean-up IPC path in ROCclr
...
- Make IPC interfaces generic between devices and rely on the IPC buffer
for attach/detach logic
Change-Id: Id3c18d122030329b7ee532bbb6317de9dd6a0bbe
2023-06-20 15:18:21 -04:00
Saleel Kudchadker
4510caed26
SWDEV-384557 - Include query under env var
...
Change-Id: Ibe0335168f9ad0cb72abb2ed69a5e40d33ce7c1c
2023-06-16 15:09:46 -07:00
Saleel Kudchadker
5436d362b1
SWDEV-301667 - Add a flag for gpuvm kernargs
...
HIP_FORCE_DEV_KERNARG=1 will create a device allocation for kernel arg
segment. Flag is 0 by default.
Change-Id: Iaaf5a149f3be8596568878d5d272268baf067c60
2023-05-22 11:23:48 -04:00
Ioannis Assiouras
a07e38be2f
SWDEV-400119 - Return error from globalFreeMemory() if HSA available memory query fails
...
Change-Id: I33c069fa1e0d31891f64dc8167a24ef9d7ed1426
2023-05-15 11:54:25 +01:00
Saleel Kudchadker
0b475284e9
SWDEV-398151 - Partly relax static engine allocation
...
Change-Id: I4903b51a34b597a2e84d771b52cf629f877dba05
2023-05-11 00:52:18 -04:00
Jaydeep Patel
ffa1deef27
SWDEV-398047 - Disable arena for XNACK.
...
Change-Id: If6c98490bd5b8146a357f45ed132f687def0877f
2023-05-10 23:28:05 -04:00
Saleel Kudchadker
5865c642d4
SWDEV-384557 - Fix engine status query
...
- Maintain a map of SDMA engine# to stream allocated following a greedy
approach
- Anything past that will query SDMA engine status always and go with a
SDMA or Blit copy path
Change-Id: Ibfaed7f951ab84d80cb0430596a4d11b5aec9202
2023-04-21 00:57:26 -04:00
Xiaomeng Hou
35a0855e4e
SWDEV-373400 - Lower the threshold of single alloc percent for APUs w/ small VRAM
...
Change-Id: Ie8e3504a0c468bc44633991cdfde66bcf0b74f06
2023-03-06 10:12:01 -05:00
Saleel Kudchadker
d429314b21
SWDEV-380035 - Do not create Arena Memobj for pinned memory
...
Change-Id: I4da281f0b7139efb4bb46b47f812358a0212445a
2023-02-24 12:31:25 -08:00
Anusha GodavarthySurya
f98dcf9d7f
SWDEV-366636 - Fix performance drop in TF-RCCL models
...
Change-Id: Idc845bb0dab858b94b9d2720cae8308cac2e7328
2023-02-17 00:48:13 -05:00
Maneesh Gupta
5dc104b3ea
SWDEV-368235 - Revert "Remove obsolete env variables"
...
This reverts commit 7b50c935f8 .
Reason for revert: Deferred to a future release.
Change-Id: Ia66c37f0ab9734dee73c930d10d7469d5fd57254
2023-02-15 07:25:00 +00:00
Anusha GodavarthySurya
fa35239b4b
SWDEV-380035 - Check for agent and ptr match for hsa LOCKED ptr
...
Change-Id: I2503aa05512aebc3535963e188ca8d1dbfab08f0
2023-02-15 00:12:42 -05:00
David Yat Sin
fa1b7925cf
SWDEV-365908 - Do not align mem size for IPC
...
Remove alignment to granularity for IPC handles as ROCr has a patch that
will internally validate pointer sizes against requested size during
allocation instead of size aligned to page size. This patch is needed
together with this patch from ROCr:
f8a42a3a:Use user requested size for memory fragments
Change-Id: I28b25558ea03c836b44fafdb34b7330cf6887424
2023-02-13 17:32:02 -05:00
German Andryeyev
67af16bdf3
SWDEV-381633 - Use blit manager on app's queue
...
Heap initialization used device queue, but it shoudl be used for
cooperative launches only. Heap initialization must use the same queue
as the current dispatch.
Change-Id: I856621bf82bbdeb1c2d0fbc4970e90d09af805cb
2023-02-09 09:43:13 -05:00
Saleel Kudchadker
7619b53e9b
SWDEV-381633 - Better log
...
Change-Id: Ie151d73aa4b1dae7d0d9acd0457e7dbdf35f21ee
2023-02-08 17:54:11 -05:00
jatang
b798c85272
SWDEV-380792 - Fix floating point exception when maxEngineClockFrequency_ is 0
...
Change-Id: Ic443ceae586c4c84995ed2abef9bd7f32f8b60f9
2023-02-07 11:43:10 -05:00
German
7b50c935f8
SWDEV-368235 - Remove obsolete env variables
...
Change-Id: I7e14d53297e79e2f68b3a6cc40251ad7db9eb5ab
2023-02-03 13:44:24 -05:00
David Yat Sin
08d72c4f98
SWDEV-380258 - Fix invalid iterator after erase
...
Fix rare segfault due to invalid iterator after erase
Change-Id: Id5b54d6cf10075deff0d613fec12af249c6c55a3
2023-01-31 17:46:15 +00:00
Jiabao Xie
3e88081961
SWDEV-366886 - Revert "SWDEV-366886 - force svm alloc for rocm"
...
This reverts commit 8278e86918 .
Reason for revert: performance drop in Quicksilver app
Change-Id: I2bdf42ad0a235a74e2bb4d38f86471e9affbd7a6
2023-01-25 15:54:53 -05:00
German
6dad2fc306
SWDEV-377991 - Remove liquidflash support
...
Remove amdgpu-pro interface for persistent memory,
used in Liquidflash
Change-Id: I7d1720ad0875a62ebb2d7f96cba39601d560a5df
2023-01-23 10:40:57 -05:00
jiabaxie
8278e86918
SWDEV-366886 - force svm alloc for rocm
...
Change-Id: If85a6aaeb5861ff1550a0bc3f0a25073ff8145e7
2023-01-18 16:26:34 -05:00
German
c8927cd84e
SWDEV-377991 - Remove Liquidflash extension
...
Initial check-in to untie dependencies with HIP and OCL repos
Change-Id: I363b63954c3f118f40a6ed893545d6a4ac44144c
2023-01-18 13:16:20 -05:00
Anusha GodavarthySurya
2b50636809
SWDEV-373173 - Correct max VGPRs and VGPR Granularity in Occupancy calculation for different asics
...
Change-Id: I6f595428ed31733cdc04e65e7c3e383f454ab72b
2023-01-18 00:22:02 -05:00
Anusha GodavarthySurya
274f2de391
SWDEV-364576 - initialize device malloc heap state using blit kernel
...
Change-Id: I5d0172aff7d2c04b322a4d828b8a2b438158b80f
2023-01-07 06:53:53 +00:00
jatang
e8cd1e93e8
SWDEV-1 - Not to truncate size_t in print when it's 64bit
...
Change-Id: Iab8b7eb128c767b6808b19cbeac5169b7a97f1ad
2022-12-13 09:13:33 -05:00
kjayapra-amd
e56a611b92
SWDEV-371904 - Adding pseudo fine grain flag to hsa memory allocation for device fine grained memory.
...
Change-Id: I8cada90f0e3880dfbc5bf5a3fac4554e7a0cb08e
2022-12-11 08:15:17 -05:00
pghafari
402504f548
SWDEV-366279 - updating p2p error msg
...
Change-Id: I294e7770efd11f511dd5817eb28bd3c97d9d4926
2022-12-09 00:05:04 -05:00
Ioannis Assiouras
b445717f72
SWDEV-369547 - Ensure that ipc_mem_detach is not called for non-ipcShared MemObj.
...
Change-Id: Ia73b60b9f08d593301879e0f72c395edbc215112
2022-11-30 02:16:13 -05:00
Anusha GodavarthySurya
446a3fc688
SWDEV-354074 - cache hsa queue and delete when device is destroyed
...
Change-Id: I26365521d785f0bc612e32bdcdb6caacb3af9a11
2022-11-23 07:17:45 +00:00
German
b40823ccc1
SWDEV-328670 - Enable ROCr interop
...
Change-Id: I932afa2a0e75b1f1771c4be1e984b32966636afb
2022-11-17 09:40:34 -05:00
Maneesh Gupta
a52f5bda8f
SWDEV-299940 - Query global memory available on device with HSA attribute HSA_AMD_AGENT_INFO_MEMORY_AVAIL
...
This reverts commit 4afca0647e .
Reason for revert: ROCr query should now be usable in upcoming release.
Change-Id: I2207761ca6af5d585d090bae1af09eb9a8e9bad6
2022-11-09 10:48:47 -05:00
Saleel Kudchadker
3e465c5ff8
SWDEV-364052 - Print free memory on queue callback
...
Change-Id: I7d15b6f6277326d5f3e9b784a3443f18ec6ba58a
2022-11-04 13:19:33 -04:00
Ajay
261620a2ef
SWDEV-357207 - Linux: enable StreamWrite/Wait APIs in deviceAttribute
...
Change-Id: I593df2dbb7a0233dd42f8d873510d85bbb27feff
2022-10-21 12:44:35 -04:00
Laurent Morichetti
9a82118c85
SWDEV-362046 - Report HIP_OPS activities using the ROCr driver_node_id instead of the device's index
...
The ROCclr assigns zero-based IDs to GPUs in the order they are
discovered. That zero-based ID is what is used to identify the GPU
on which the HIP_OPS activity took place.
When multiple ranks are used, each rank's first logical device always
has GPU ID 0, regardless of which physical device is selected with
CUDA_VISIBLE_DEVICES. Because of this, when merging trace files from
multiple ranks, GPU IDs from different processes may overlap.
The long term solution is to use the KFD's gpu_id which is stable
across APIs and processes. Unfortunately the gpu_id is not yet exposed
by the ROCr, so for now use the driver's node id.
Change-Id: Ib78854527d600d175bb76e2df0747c33f898c615
2022-10-20 12:31:30 -04:00
Julia Jiang
dacd55f3d7
SWDEV-357122 - fix failure in vdi so as to handle unreasonable input size in MallocManagedNegativeTest
...
Change-Id: I1ed1916b652afc67327b0935c3c60fc2a404df30
2022-10-20 12:19:22 -04:00
kjayapra-amd
951a5b9e7d
SWDEV-307411 - Require comment/message string in guarantee().
...
Change-Id: I55a699c5366f6a7d167ede4c9be2ec5b15cea9d0
2022-10-13 16:21:21 -04:00
Saleel Kudchadker
9b5cbd37a2
SWDEV-352001 - Store last scopes for dispatch
...
- Store last fence scopes and use the last value to determine if we need a cache flush again. This helps cases where hipExtLaunchKernel API is
used.
- Purge code for ROC_EVENT_NO_FLUSH
Change-Id: I531cf9c9c60d5e2b3a9e265d0f52f79ed2fa8a8c
2022-09-22 11:34:10 -04:00
Maneesh Gupta
4afca0647e
SWDEV-352502 - SWDEV-355630 - Revert "SWDEV-299940 - Query global memory available on device with HSA attribute HSA_AMD_AGENT_INFO_MEMORY_AVAIL"
...
This reverts commit 73021582d3 .
Change-Id: I86175ccf2b543c3ff3dffec6ccae410976972f22
2022-09-12 07:34:32 -04:00
Sourabh Betigeri
b15b3173f4
SWDEV-326798 - Avoids waiting for the signal with a timeout and instead queries the current status
...
Change-Id: I8b4052d5439ca930ceb924da30cb0335f0826a95
2022-09-06 12:49:46 -04:00
sdashmiz
50e0ddb055
SWDEV-350569 - add proper indexing for attribute
...
- rocr attribute needs to be updated after each iteration
Signed-off-by: sdashmiz <shadi.dashmiz@amd.com >
Change-Id: I3afb2d7954ef3de37f5f5f9d3cc7757fdacffcec
2022-08-18 09:59:42 -04:00
Sarbojit Sarkar
a28b22d9b4
SWDEV-344210 - Fixed page fault when mempool accessed from remote device
...
Change-Id: Ie41b0c0499f7733d4541ccd50b3d0d976c6431c9
2022-08-09 23:26:27 -04:00
Anusha Godavarthy Surya
73021582d3
SWDEV-299940 - Query global memory available on device with HSA attribute HSA_AMD_AGENT_INFO_MEMORY_AVAIL
...
Change-Id: Ifcfb95f59d110b3b037a7131be21d5348601d2e2
2022-08-05 06:55:12 -04:00