rocm-systems

Autor(a)	SHA1	Mensagem	Data
Ioannis Assiouras	844ba771ec	SWDEV-517715 - Remove dependency on non-static hipcc from hip-static-devel Change-Id: I1184680949fa73d7dc0957062292e6682179b203 [ROCm/clr commit: `e963d30b5d`]	2025-03-06 10:01:58 -05:00
Saleel Kudchadker	5070d4b654	SWDEV-508004 - Improve hipStreamWaitEvent & Fix typo - hipStreamWaitEvent may not resolve streams - Correct usage of flag passed to streamWait function Change-Id: I2ee163615d303b98937c1035d60da283cce6f677 [ROCm/clr commit: `940347ad42`]	2025-03-05 11:56:01 -05:00
Pengda Xie	cd9bcb7e53	SWDEV-518317 - Don't attempt to remove managedVars when map is empty Change-Id: I25c33487dc08f96c087b6acc1abe42a4a666a609 [ROCm/clr commit: `ae3b053ddf`]	2025-03-05 11:53:18 -05:00
Branislav Brzak	c81185cf26	SWDEV-516564 - SWDEV-512817 - Remove mentions of gfx940 and gfx941 Change-Id: Ia069fcb9c6948c3fc9a00961593c9dcc59609375 [ROCm/clr commit: `c2d1776ebd`]	2025-03-05 04:26:07 -05:00
Saleel Kudchadker	c8f39ec2b0	SWDEV-502365 - Track last used command - This change tries to save extra synchronization packets we may insert as we didnt track the completion signals for every command. We track the current enqueued command until it exits the enqueue stage. We also record the exit scope to know if we flushed the caches - Handle correct release scopes and store completion signal as HW events - Use a new finishCommand implementation to only wait for the command passed as the argument Change-Id: Ie4350c5dd24f5d48dfa6ccbabd892f0544caadcc [ROCm/clr commit: `e03e4f3b5d`]	2025-03-04 16:05:02 -05:00
German Andryeyev	77840f1cb9	SWDEV-518474 - Add comgr debug mask Move prints from CO processing under COMGR debug mask. Change-Id: I2a417e42a1f4e2922a34eb104c69e4db10b5f1c6 [ROCm/clr commit: `cece301fd4`]	2025-03-04 14:37:08 -05:00
Julia Jiang	6fe0eb91ee	SWDEV-509855 - Update hipDeviceAttributePciDomainID in CLR Change-Id: I79939b333ef6114b97009ca4bfb67f63a9a22784 [ROCm/clr commit: `81db54d3f9`]	2025-03-04 14:08:08 -05:00
Marko Arandjelovic	861491b824	SWDEV-512344 - Unmap all subbuffers Since hipMemMap can be called for multiple device handles on the same virtual memory, the same is true for hipMemUnmap, meaning that virtual memory can be "partially unmapped". This means that the unmap function can be called for a specific part of the reserved address, meaning that only the designated subbuffer should be released. If unmap is called on the entire reserved memory, then all subbuffers should be released. The main point is that for every hsa_amd_vmem_map, there should be a corresponding hsa_amd_vmem_unmap. Otherwise, if entire memory is unmapped by a single unmap call, then HSA will report the memory as "in use" if an attempt is made to delete it. Change-Id: I039308eafb820decfb1c09f60347f26cdad1a362 [ROCm/clr commit: `3ec1d2d2f1`]	2025-03-02 13:41:48 -05:00
Todd tiantuo Li	4e4c2038eb	SWDEV-1 - skip OCLMemoryInfo test for APUs Change-Id: I5afc244f68115a538eb3d989c3aa119e0f279deb [ROCm/clr commit: `0558a8cd8a`]	2025-02-28 13:10:40 -05:00
Saleel Kudchadker	d0a7ae02cf	SWDEV-513197 - Unify getBuffer implementation - Use getBuffer/releaseBuffer in BlitManager - Cleanup XferBuffer as we use ManagedBuffer for both reads/writes Change-Id: I2661b85dd012763b17a38a743fec1b1d79125f67 [ROCm/clr commit: `37d606d193`]	2025-02-28 12:47:51 -05:00
Saleel Kudchadker	ef505c7cd8	SWDEV-513197 - Improve launch perf for Device Heap kernels - If any kernel uses device heap, the launch needs to be preceeded by an init kernel, Save on the extra barrier packet launch/flush between the init heap kernel and user kernel Change-Id: I8ebc6246188200e5f673dc464bc76a53bcb8b7c6 [ROCm/clr commit: `ca530c660b`]	2025-02-27 19:17:51 -05:00
Ioannis Assiouras	efd1f3f012	SWDEV-509788 - Code cleanups in Event class Change-Id: I4163ce6c1dabeaab92de13b51b6a46b7be83e2bd [ROCm/clr commit: `e9b33af45a`]	2025-02-27 17:16:50 -05:00
Alex Xie	004fc8f4a7	SWDEV-514060 - Use newer OpenCL API in test using AMD extension API Change-Id: Ifdbb3c45628116e48f68ae29324e4cc554c9f2d5 [ROCm/clr commit: `375a147e51`]	2025-02-27 11:20:37 -05:00
Rahul Manocha	90337103ac	SWDEV-510849 - Restore pinned memory copy path 1) Create getBuffer method to return pinned host memory or staging buffer 2) for D2H path use managed buffer instead of static buffer 3) use staging buffer copy for 16KB < size < 1MB 4) use pinned memory copy for size > 1MB Change-Id: I13d4d6ab60691bc6c7724239db1e11e23f0f3dc2 [ROCm/clr commit: `4bf634dfca`]	2025-02-26 11:25:02 -05:00
Jacob Lambert	7da5718afb	SWDEV-360440 - Prepare CLR dynamic Comgr load for V3 Change-Id: Iffd835056740ff10711468cc752b7b55078b9f8f [ROCm/clr commit: `187648429b`]	2025-02-26 11:22:18 -05:00
Pengda Xie	e591f1f7f7	SWDEV-512044 - Fix logic error in texture size validation Change-Id: I6aefcfed25b099c17bf0856d621081c0a5ce46c5 [ROCm/clr commit: `ade704dd2f`]	2025-02-26 11:20:58 -05:00
Ioannis Assiouras	eb825325d2	SWDEV-516994 - Fix race condition in the implementation of graph AutoFreeOnLaunch on Windows Change-Id: I3c98d0d4bffe2a9e0aa5cfa24b6c8e9a8087da29 [ROCm/clr commit: `a8f309049d`]	2025-02-26 02:36:31 -05:00
Rahul Manocha	75d53e5c22	SWDEV-489106 - Linker API addition to runtime 1) Add Linker APIs to runtime to support SPIRV linking 2) Migrate Internal implementations to runtime and share with rtc 3) Add Support to bundled and unbundled SPIRV Code object linking. Change-Id: Ic1fd4431f842a208a2468e8aec54a65b5fa6b0e3 [ROCm/clr commit: `5930f047bb`]	2025-02-22 13:39:23 -05:00
Li, Todd tiantuo	04dc7ca51f	SWDEV-508980 - [6.4 Preview] fix hipDeviceSetCacheConfig during stream capture Change-Id: I8e89774a8163fdc120155f742606ee2c0aa7103b [ROCm/clr commit: `9faaf20aae`]	2025-02-22 01:05:28 -05:00
Li, Todd tiantuo	82f78ce187	SWDEV-510271 - [6.4 Preview] fix hipCreateSurfaceObject & hipDestroySurfaceObject during stream capture Change-Id: I19e149549c271d847f52b72e04cb2427ca194b24 [ROCm/clr commit: `c07468e53c`]	2025-02-22 01:04:35 -05:00
Ioannis Assiouras	8d29fb9e6d	SWDEV-509788 - Use stream memory operation in hipStreamWaitEvent This change removes the stream callback from hipStreamWaitEvent and uses a stream memory wait operation instead. This allows the hipStreamWaitEvent to be non-blocking on the host. Change-Id: Ie5530febda5a5bcb5daa0db8a01249d6b137fd43 [ROCm/clr commit: `721c5800ca`]	2025-02-21 11:46:09 -05:00
Julia Jiang	1495cc77eb	SWDEV-513294 - fix regression on SVM sub-test failure in Conformance Change-Id: Ic2449dd34a9cd2b623d5f8fbe89fd042566a56e3 [ROCm/clr commit: `b7eaec76fc`]	2025-02-20 15:40:23 -05:00
kjayapra-amd	010253430f	SWDEV-516303 - Remove SDMA retainer logic to select the engine. Change-Id: I818129444131825cdb87e06cb495afa3e5cdb683 [ROCm/clr commit: `1f583a6870`]	2025-02-20 11:34:38 -05:00
German Andryeyev	a7f3ad7867	SWDEV-515356 - Make the round-robin queue selection - Add custom compare to the map of queues, which will help with the round-robin selection Change-Id: Ie67a820bfb1a5b484a1b3edced967eed94228bb8 [ROCm/clr commit: `ba8e740be4`]	2025-02-20 11:09:54 -05:00
German Andryeyev	f9d9b2c441	SWDEV-497841 - Add virtual memory heap Add initial implementation of virtual memory heap with dynamic virtual memory mapping support for memory pools. DEBUG_HIP_MEM_POOL_VMHEAP controls the new method. Change-Id: I8dc5be2e0f34ab472f1800f43bb6243639a5e500 [ROCm/clr commit: `296dce5570`]	2025-02-20 10:55:49 -05:00
German Andryeyev	6f2a603277	SWDEV-497619 - Allocate extra space in CB Compute doesn't support IB chaining, but RGP may collect perf counters, which require more space in CB. Increase CB size if RGP is enabled. Change-Id: Iaa0a620ead8541a679b0dfe5e5711af5afdba545 [ROCm/clr commit: `63cf3057ba`]	2025-02-20 10:40:09 -05:00
Jimbo Xie	8a42a52d0f	SWDEV-477219 - implement hipEventRecordWithFlags Change-Id: Icf07e85fc8c15f921f6e7c9fbd31dd3856dc988b [ROCm/clr commit: `7a4a22d454`]	2025-02-19 13:53:00 -05:00
Jatin Chaudhary	16f9dbff6c	SWDEV-511239 - make fp8 standalone host compileable - Use correct header in device_library_decl - use std:: instead of __hip_internal:: for host compilation - hide device specific stuff behind __clang__ and __HIP__ check Change-Id: I2f3647e00555ed0e79f9954a459c41394c3cd49b [ROCm/clr commit: `c3f49c8788`]	2025-02-18 19:07:45 -05:00
Jatin Chaudhary	508d043176	SWDEV-515255 - do not free bitcode object before code gen - Also add a cache, which allows compiled code objects to be reused instead of compiling again. This should improve performance on multigpu systems. Change-Id: Ib135d616c076b77f8aaf28de275d408b38021d89 [ROCm/clr commit: `0391aec14a`]	2025-02-18 12:39:31 -05:00
Tim Gu	8fcbc2acfe	SWDEV-502248 - Parse file path with space characters Signed-off-by: Tim Gu <Tim.Gu@amd.com> Change-Id: I67fb9cf5559c9c06f24627a1b25fec3e89b2d1cf [ROCm/clr commit: `84a867fb73`]	2025-02-18 10:31:21 -05:00
agunashe	52a1f5dbf7	SWDEV-507967 - Deprecate gfx9, gfx8, gfx7 on Windows PAL_CLIENT_INTERFACE_MAJOR_VERSION from 872 --> 910 Change-Id: I03dfa2924ccdae4c2f13f09d5f34ee58298e1343 [ROCm/clr commit: `ea804e16f8`]	2025-02-17 02:59:41 -05:00
Anusha GodavarthySurya	c6bea0ea59	SWDEV-469422 - hipgraph remove static typecast to parent Change-Id: I339250cfd26a7c04543722a82301acbb41c7d5d7 [ROCm/clr commit: `199e464402`]	2025-02-14 11:09:32 -05:00
David Salinas	e2da5772ff	Deprecate roc-obj* tooling - make Perl packages RECOMENDS/SUGGESTS for hip-dev - update CHANGE log SWDEV-511528 - TECH Remove ROCM Perl dependency - hip-dev SWDEV-333176 - Shift functionality of 'roc-obj-*' perl scripts into llvm-objdump Change-Id: Iec3ba245848781f95c825f0d37aff4b4fb54f5e4 [ROCm/clr commit: `c942833b34`]	2025-02-13 11:42:57 -05:00
Vladana Stojiljkovic	7078aab436	SWDEV-510059 - Format CU mask properly Change-Id: I80e94b4f3ea25f6988fc06d83aeb398e81ccddd1 [ROCm/clr commit: `061c5d877f`]	2025-02-13 11:02:56 -05:00
harkgill-amd	cac2e94141	Specify C++ language mode for warning post amdgpu-arch failure Change-Id: I55bf6734a1e8dc06dd0a1ee12086b7667332206f [ROCm/clr commit: `935b538261`]	2025-02-13 09:40:13 -05:00
Aidan Belton-Schure	4b4a35b86b	SWDEV-508279 - Improve HIP event profiling There are 2 functional changes to this patch: * Use GPU timing for internal markers for HIP. * Measure CPU time closer to GPU timer, to reduce delta between GPU/CPU timestamp measurements. There are some smaller non-functional updates: * waifForFence -> waitForFence typo * Remove unused drmProfiling Change-Id: I4c5fa600a842ab60e454888779edcac8449a902a [ROCm/clr commit: `179801a750`]	2025-02-13 04:15:40 -05:00
Jatin Chaudhary	5725b99619	SWDEV-474146 - use __bf16 to do operations Change-Id: I568dfa97238fd760f5362a8e560c33402f96cff3 [ROCm/clr commit: `c23913f6e7`]	2025-02-12 07:03:05 -05:00
Jatin Chaudhary	db2a3214c4	SWDEV-504769 - Allow hipEvent_t to record on hipStreamLegacy Change-Id: Ib86412255adad172598620ea81214e5eb56020ea [ROCm/clr commit: `e560d94d2c`]	2025-02-12 07:02:35 -05:00
Ioannis Assiouras	a349b23474	SWDEV-514686 - Fixed hipEventSynchronize/hipStreamWaitEvent for IPC events Resolved an issue where hipEventSynchronize and hipStreamWaitEvent APIs did not function correctly for events created with the hipEventInterprocess flag. The bug caused the event to be incorrectly marked as "recorded," leading to these APIs failing to wait for the event as expected. Change-Id: Ic9fdfaab2393beb93d6e0b83661545e902a63499 [ROCm/clr commit: `1cdfbfd270`]	2025-02-11 18:43:06 -05:00
kjayapra-amd	1f648c7d94	SWDEV-511672 - Special case the Remote USWC memory usage for HIP, if the alloc size is large. Change-Id: I524c1402b249cedfd58b56f494caa2ac057e1623 [ROCm/clr commit: `cf6aabb823`]	2025-02-11 06:42:18 -05:00
Saleel Kudchadker	71e1a0b10d	SWDEV-504494 - Further copy improvements - Fix regression for D2H pinned copies which adds systemscope release. - Skip cpu wait for D2H unpinned copies as we can pass the signal of the barrier to rocr copy. - Fix an old bug in sdmaEngineRetainCount_ logic - Improve logging Change-Id: If074bddb05564b15949b0d5f9bf12acd3692174e [ROCm/clr commit: `4c95ee5e1e`]	2025-02-11 00:55:52 -05:00
victzhan	7cd780c1cb	SWDEV-485042 - Remove -I option passed into comgr when file type is not FILE_TYPE_ASM_TEXT Change-Id: If8e469f881651f7b3dae364e8182ef1ba6f3a0d1 [ROCm/clr commit: `ca35d93672`]	2025-02-10 11:47:04 -05:00
Ioannis Assiouras	eb77b9aba6	SWDEV-508435 - Use the stream of the src/dst image memory object in A2H and H2A commands Change-Id: I9b776a54760a4633d5f84cf7b467d2d3ba8cbdde [ROCm/clr commit: `a8edb8d467`]	2025-02-07 13:38:31 -05:00
taosang2	f84a8e62d3	SWDEV-446880 - Make ocltst MemoryInfo pass in EMU Make ocltst -m tests/ocltst/liboclruntime.so -t OCLMemoryInfo pass in emu where GPU memory is very big. Cherry pick https://gerrit-git.amd.com/c/compute/ec/clr/+/1014858 Change-Id: I0228c5e87ce7c366983fd4af71c25e7f8161c2c7 [ROCm/clr commit: `de83d7a6ae`]	2025-02-07 09:16:24 -05:00
Satyanvesh Dittakavi	8daab29f7f	SWDEV-477584 - hipExtGetLastError should return the immediate previous API error hipGetLastError should return the error by any of the previous APIs in the same host thread to match the CUDA behavior, whereas hipExtGetLastError will return the error by the immediate previous API. This Ext API was added earlier to facilitate the existing HIP apps which are following the current behavior of hipGetLastError Change-Id: I61e95b1fc136cc761e2434e02187b7ed2598b733 [ROCm/clr commit: `4b443f8133`]	2025-02-06 23:30:48 -05:00
Ioannis Assiouras	6a00aa8d61	SWDEV-508435 - Added a fix for double free of hsaImageObject Change-Id: I9397f7c9dbbad7c249b359155df312cb920eba6c [ROCm/clr commit: `d05ecea253`]	2025-02-05 22:21:24 +00:00
Ioannis Assiouras	c0b728fcad	SWDEV-513323 - Fix for BatchMemOp on devices with no image support BatchMemop should be positioned before the image support kernels because the total number of kernels is determined by BlitLinearTotal, when there is no image support on the device. Change-Id: I8e53caf744ba54259ac04bad1762eef21806f3f2 [ROCm/clr commit: `3e01da3dac`]	2025-02-05 04:45:22 -05:00
Anusha GodavarthySurya	5535f15104	SWDEV-469422 - hipGraph move to classes from structs Change-Id: I0f9c8ef1161c0c92ebe0cce6844b2feacfee83f5 [ROCm/clr commit: `32e5b00c30`]	2025-02-05 00:33:41 -05:00
taosang2	27e87ccca6	SWDEV-513458 - Add gfx950 target ID Add gfx950 target ID Cherry-picked https://gerrit-git.amd.com/c/compute/ec/clr/+/997678 https://gerrit-git.amd.com/c/compute/ec/clr/+/1063519 Change-Id: I0228c5e87ceec366983fd4afb1c25e7f8161c2c2 [ROCm/clr commit: `29cc394510`]	2025-02-04 18:30:23 -05:00
Steven Chung	5513df58eb	SWDEV-496674 - Convert non-templated typedefs to templates for consistent mangling Change-Id: I952d15f20afc85c0118403f82e75360197049ef5 [ROCm/clr commit: `782976f5c2`]	2025-02-04 16:37:00 -05:00

1 2 3 4 5 ...

12702 Cometimentos