Граф коммитов

12735 Коммитов

Автор SHA1 Сообщение Дата
Brzak, Branislav d4275741ba SWDEV-525653 - Make hipGetDeviceProperties and hipChooseDevice use the new API (#159)
[ROCm/clr commit: b006380ff6]
2025-04-08 18:54:05 +02:00
Patel, Jaydeepkumar 2f3bc7f01c SWDEV-521011 - Allow max stack size as per ISA. (#73)
[ROCm/clr commit: 9e7248aa36]
2025-04-08 10:15:38 +05:30
Andryeyev, German 4c9cc6ba30 SWDEV-497841 - Add VmHeapArray support (#76)
Add VmHeapArray class to reduce the pressure on VA reservation, since
multiple memory pools can be active at the same time.

[ROCm/clr commit: e974f7fde1]
2025-04-03 21:04:18 +05:30
Andryeyev, German 3ceab5ba02 SWDEV-524849 - Fix HIP error returned during capture (#141)
Always use the latest dependent nodes during hipEventRecord capture

[ROCm/clr commit: 3514f45544]
2025-04-03 20:08:25 +05:30
Betigeri, Sourabh 487ede31a9 SWDEV-523281 - [clr] Implementation of hipLaunchKernelExC and hipDrvLaunchKernelEx API with support for cooperative launch (#92)
[ROCm/clr commit: 8c6b90996e]
2025-04-03 20:10:05 +09:00
Arandjelovic, Marko 1c83314659 SWDEV-517867 - Remove invalid assert (#55)
* Remove invalid assert

* Retrigger CI

* Rebase

[ROCm/clr commit: 8fcaa1ca93]
2025-04-03 11:14:32 +02:00
Patel, Jaydeepkumar b217d3a4e6 SWDEV-508632 - Align address to 2 MBs for hidden heap allocation. (#29)
[ROCm/clr commit: b5c9cbc236]
2025-04-02 16:33:29 +05:30
Mallya, Ameya Keshava 29be7230eb fixed syntax to mainline
[ROCm/clr commit: 98f1db181c]
2025-04-01 09:51:41 -07:00
Mallya, Ameya Keshava f117699bef !verify functionality
[ROCm/clr commit: ae1d0ef8a1]
2025-03-31 13:14:08 -07:00
Mallya, Ameya Keshava 594c7e6704 Adding KWS check for amd-mainline
[ROCm/clr commit: 24184e151c]
2025-03-28 08:05:47 -07:00
MartinezFernandez, Juan 966157cd5b Remove PCH code: the code related to PCH is dead and not used (#66)
cherry-pick of compute/ec/clr/+/1184122

Co-authored-by: Juan Manuel Martinez Caamaño <juamarti@amd.com>

[ROCm/clr commit: f580632174]
2025-03-28 10:36:19 +01:00
Sang, Tao d49a2a51d6 SWDEV-508863 - Support generic target in compressed fatbin (#44)
[ROCm/clr commit: 8d90b44a1b]
2025-03-27 20:13:51 +05:30
GunaShekar, Ajay aaba454bfc SWDEV-523853 - Use RecordRenderOps instead of RecordRenderOp (#97)
[ROCm/clr commit: 686dd56a4e]
2025-03-26 09:28:40 +05:30
Belton-Schure, Aidan e27e3eb66a SWDEV-515426 - Use RAII classes for comgr (#28)
Change-Id: I9f6005542cc88f1e16e22741dcc0ce904fdaa2b0

[ROCm/clr commit: ded41058a0]
2025-03-25 20:10:44 +05:30
Dittakavi, Satyanvesh 0506822f4d SWDEV-516595 - Add __shfl functions with __hip_bfloat16 datatype (#42)
Also removes asserts in cooperative groups shfl functions since
__hip_bfloat16 shfl is present now

Change-Id: I57578b6e68dccc10c2ddcd194e9cc18bc7732ce1

[ROCm/clr commit: 376f23b86a]
2025-03-25 15:38:01 +05:30
Gupta, Maneesh 781da97ff6 Update CODEOWNERS (#77)
[ROCm/clr commit: d9abcdd999]
2025-03-20 15:40:50 +05:30
Arandjelovic, Marko cc5124241b Revert SWDEV-512344 - Unmap all subbuffers (#26)
This reverts commit 0b69120cfcb5b4689d9f2037b1a01e274d85c20f.

[ROCm/clr commit: e7ada4effe]
2025-03-19 21:17:36 +05:30
Godavarthy Surya, Anusha 35a15ca3b2 Revert "SWDEV-492049 - Remove the handle of Phy Mem from Memobj" (#72)
This reverts commit dabe311bd8.

[ROCm/clr commit: 2259a8c01c]
2025-03-19 21:16:51 +05:30
Andryeyev, German 5c7c86f66d SWDEV-517481 - Add dynamic queue management (#37)
Enabled by defaulty. DEBUG_HIP_DYNAMIC_QUEUES controls the feature

[ROCm/clr commit: 28967982b2]
2025-03-19 11:22:50 -04:00
Andryeyev, German 00e77db856 SWDEV-497841 - Avoid access to the null stream on mempool alloc
Null stream isn't created during the device creation

[ROCm/clr commit: 392ed53c3c]
2025-03-17 11:40:14 -04:00
Mallya, Ameya Keshava 61a6b9bf60 Added KWS check
[ROCm/clr commit: cde722ad71]
2025-03-12 10:12:06 -07:00
Mallya, Ameya Keshava 5c27fe7c8e Added rocm-ci-caller
[ROCm/clr commit: 35dcd43c59]
2025-03-12 10:05:57 -07:00
Gerardo Hernandez 8c9aa07b3a SWDEV-420237 - Add __reduce_add_sync()
Change-Id: Ic8e4fab6b7aeb879d40b2c1419b30d1355a2bbdc


[ROCm/clr commit: 340d6bb69f]
2025-03-12 03:20:49 -04:00
agunashe 4a5c6b8d82 SWDEV-513810 - APU: memory allocations threshold 0.75-->1
Needs further debugging but for now can test the change

Need to verify if this fixes all the below issues-
SWDEV-512754, SWDEV-511675, SWDEV-511055, SWDEV-504085, SWDEV-499503
Also verify original issues
SWDEV-471863, SWDEV-490991

Change-Id: Ic845f851de1b98e8ed9aa0f07afddec3858119e9


[ROCm/clr commit: f1b8ee7b7f]
2025-03-11 05:30:43 -04:00
Saleel Kudchadker c94c02a2e6 SWDEV-519596 - Avoid passing dep signal to SDMA
- For D2H cases avoid passing dependent signals to SDMA, the signals
  take a while to resolve on SDMA engine

Change-Id: I569635228af977847f201c82ca897002f8f2f4a8


[ROCm/clr commit: 78d0ff2dbc]
2025-03-07 17:37:21 -05:00
Pengda Xie 021ca96766 SWDEV-497619 - Ensure suballocSize is integer multiple of 4096
Change-Id: Iefc452d73566f58cfb63391a68c836f30d77dd6c


[ROCm/clr commit: b02b1858c0]
2025-03-07 15:36:57 -05:00
Rakesh Roy 4dd97ee92c Revert "SWDEV-508982 - [6.4 Preview] - Handle hipMemPoolCreate, hipMemPoolDestory & hipDeviceSetMemPool during stream capture."
This reverts commit f674ba58f0.

Reason for revert: 6.4 Preview changes need not be merged to amd-staging as of now

Change-Id: I86452adfed14655f72d90440a486089743cc6587


[ROCm/clr commit: 5da8ce45ab]
2025-03-07 06:43:24 -05:00
Rakesh Roy d35aed5012 Revert "SWDEV-510271 - [6.4 Preview] fix hipCreateSurfaceObject & hipDestroySurfaceObject during stream capture"
This reverts commit 82f78ce187.

Reason for revert: 6.4 Preview changes need not be merged to amd-staging as of now

Change-Id: Ifba0c8a248bc40deaa9c59b7f2901531300e5ea4


[ROCm/clr commit: 4206405514]
2025-03-07 06:42:12 -05:00
Rakesh Roy ff16c1a755 Revert "SWDEV-508980 - [6.4 Preview] fix hipDeviceSetCacheConfig during stream capture"
This reverts commit 04dc7ca51f.

Reason for revert: 6.4 Preview changes need not be merged to amd-staging as of now

Change-Id: I04af8603053338f08c396e78ff8a6715e641ca19


[ROCm/clr commit: 3fa6049c46]
2025-03-07 06:40:53 -05:00
Ioannis Assiouras df9f0a0eec SWDEV-511813 - Fix linkage of hipRTC-header.o into libhiprtc.a
Using target_link_libraries does not properly link the hipRTC-header.o
into libhiprtc for static build. Change to use target_sources instead.
This does not affect the linkage in the shared build.

Change-Id: I626f9eacc1637b792a50e7ddddb5db09e704ac4a


[ROCm/clr commit: 8f54aeb765]
2025-03-06 16:29:57 -05:00
Jacob Lambert 26345a1431 SWDEV-518221 Fix major/minor Comgr version check
Change-Id: I2210aadafcae984dafc68c3fe16508bb2b409077


[ROCm/clr commit: 2e2b6b3592]
2025-03-06 13:02:34 -05:00
Julia Jiang 7c813db874 SWDEV - 508961 - Update requestedHanleTye in CLR repos
Change-Id: I6949a36c5b0bb8e88a2a33ed13ae8f278a5b19c7


[ROCm/clr commit: e5425393b4]
2025-03-06 11:37:31 -05:00
taosang2 dae18cb473 SWDEV-512613 - Improve device atomics functions
Also part of SWDEV-510994.
1. Fix atomicMin/Max_system() for float and double.
2. Remove logics of gfx941 which isn't supported.

Change-Id: Iacfdc1bc13e8da2f5df8751bb315b37d33cea667


[ROCm/clr commit: d91e1f19d0]
2025-03-06 10:05:59 -05:00
Ioannis Assiouras 844ba771ec SWDEV-517715 - Remove dependency on non-static hipcc from hip-static-devel
Change-Id: I1184680949fa73d7dc0957062292e6682179b203


[ROCm/clr commit: e963d30b5d]
2025-03-06 10:01:58 -05:00
Saleel Kudchadker 5070d4b654 SWDEV-508004 - Improve hipStreamWaitEvent & Fix typo
- hipStreamWaitEvent may not resolve streams
- Correct usage of flag passed to streamWait function

Change-Id: I2ee163615d303b98937c1035d60da283cce6f677


[ROCm/clr commit: 940347ad42]
2025-03-05 11:56:01 -05:00
Pengda Xie cd9bcb7e53 SWDEV-518317 - Don't attempt to remove managedVars when map is empty
Change-Id: I25c33487dc08f96c087b6acc1abe42a4a666a609


[ROCm/clr commit: ae3b053ddf]
2025-03-05 11:53:18 -05:00
Branislav Brzak c81185cf26 SWDEV-516564 - SWDEV-512817 - Remove mentions of gfx940 and gfx941
Change-Id: Ia069fcb9c6948c3fc9a00961593c9dcc59609375


[ROCm/clr commit: c2d1776ebd]
2025-03-05 04:26:07 -05:00
Saleel Kudchadker c8f39ec2b0 SWDEV-502365 - Track last used command
- This change tries to save extra synchronization packets we may insert
  as we didnt track the completion signals for every command. We track
the current enqueued command until it exits the enqueue stage. We also
record the exit scope to know if we flushed the caches
- Handle correct release scopes and store completion signal as HW events
- Use a new finishCommand implementation to only wait for the command
  passed as the argument

Change-Id: Ie4350c5dd24f5d48dfa6ccbabd892f0544caadcc


[ROCm/clr commit: e03e4f3b5d]
2025-03-04 16:05:02 -05:00
German Andryeyev 77840f1cb9 SWDEV-518474 - Add comgr debug mask
Move prints from CO processing under COMGR debug mask.

Change-Id: I2a417e42a1f4e2922a34eb104c69e4db10b5f1c6


[ROCm/clr commit: cece301fd4]
2025-03-04 14:37:08 -05:00
Julia Jiang 6fe0eb91ee SWDEV-509855 - Update hipDeviceAttributePciDomainID in CLR
Change-Id: I79939b333ef6114b97009ca4bfb67f63a9a22784


[ROCm/clr commit: 81db54d3f9]
2025-03-04 14:08:08 -05:00
Marko Arandjelovic 861491b824 SWDEV-512344 - Unmap all subbuffers
Since hipMemMap can be called for multiple device handles on the same virtual memory, the same is true for hipMemUnmap, meaning that virtual memory can be "partially unmapped".

This means that the unmap function can be called for a specific part of the reserved address, meaning that only the designated subbuffer should be released. If unmap is called on the entire reserved memory, then all subbuffers should be released.

The main point is that for every hsa_amd_vmem_map, there should be a corresponding hsa_amd_vmem_unmap. Otherwise, if entire memory is unmapped by a single unmap call, then HSA will report the memory as "in use" if an attempt is made to delete it.

Change-Id: I039308eafb820decfb1c09f60347f26cdad1a362


[ROCm/clr commit: 3ec1d2d2f1]
2025-03-02 13:41:48 -05:00
Todd tiantuo Li 4e4c2038eb SWDEV-1 - skip OCLMemoryInfo test for APUs
Change-Id: I5afc244f68115a538eb3d989c3aa119e0f279deb


[ROCm/clr commit: 0558a8cd8a]
2025-02-28 13:10:40 -05:00
Saleel Kudchadker d0a7ae02cf SWDEV-513197 - Unify getBuffer implementation
- Use getBuffer/releaseBuffer in BlitManager
- Cleanup XferBuffer as we use ManagedBuffer for both reads/writes

Change-Id: I2661b85dd012763b17a38a743fec1b1d79125f67


[ROCm/clr commit: 37d606d193]
2025-02-28 12:47:51 -05:00
Saleel Kudchadker ef505c7cd8 SWDEV-513197 - Improve launch perf for Device Heap kernels
- If any kernel uses device heap, the launch needs to be preceeded by an
  init kernel, Save on the extra barrier packet launch/flush between the
init heap kernel and user kernel

Change-Id: I8ebc6246188200e5f673dc464bc76a53bcb8b7c6


[ROCm/clr commit: ca530c660b]
2025-02-27 19:17:51 -05:00
Ioannis Assiouras efd1f3f012 SWDEV-509788 - Code cleanups in Event class
Change-Id: I4163ce6c1dabeaab92de13b51b6a46b7be83e2bd


[ROCm/clr commit: e9b33af45a]
2025-02-27 17:16:50 -05:00
Alex Xie 004fc8f4a7 SWDEV-514060 - Use newer OpenCL API in test using AMD extension API
Change-Id: Ifdbb3c45628116e48f68ae29324e4cc554c9f2d5


[ROCm/clr commit: 375a147e51]
2025-02-27 11:20:37 -05:00
Rahul Manocha 90337103ac SWDEV-510849 - Restore pinned memory copy path
1) Create getBuffer method to return pinned host memory or staging buffer
2) for D2H path use managed buffer instead of static buffer
3) use staging buffer copy for 16KB < size < 1MB
4) use pinned memory copy for size > 1MB

Change-Id: I13d4d6ab60691bc6c7724239db1e11e23f0f3dc2


[ROCm/clr commit: 4bf634dfca]
2025-02-26 11:25:02 -05:00
Jacob Lambert 7da5718afb SWDEV-360440 - Prepare CLR dynamic Comgr load for V3
Change-Id: Iffd835056740ff10711468cc752b7b55078b9f8f


[ROCm/clr commit: 187648429b]
2025-02-26 11:22:18 -05:00
Pengda Xie e591f1f7f7 SWDEV-512044 - Fix logic error in texture size validation
Change-Id: I6aefcfed25b099c17bf0856d621081c0a5ce46c5


[ROCm/clr commit: ade704dd2f]
2025-02-26 11:20:58 -05:00
Ioannis Assiouras eb825325d2 SWDEV-516994 - Fix race condition in the implementation of graph AutoFreeOnLaunch on Windows
Change-Id: I3c98d0d4bffe2a9e0aa5cfa24b6c8e9a8087da29


[ROCm/clr commit: a8f309049d]
2025-02-26 02:36:31 -05:00