rocm-systems

Autore	SHA1	Messaggio	Data
Jaydeep Patel	24c57cb984	SWDEV-496544 - Reset mem alloc node count for AutoFreeOnLaunch. Change-Id: Ib32b04584548a46632606ecd85b58c6ce4a5894d	2024-11-11 11:03:32 +00:00
taosang2	cc25c5d646	SWDEV-487356 - Fix AMD LOG compiling warining Change-Id: I757185f9c7c12f736e266219b67daf5836d2a125	2024-11-09 12:57:22 -05:00
Saleel Kudchadker	582dc7dd6d	SWDEV-446123 - Revert "Match hipGetLastError behavior with CUDA using env var" This reverts commit `5f477900a3`. Reason for revert: <INSERT REASONING HERE> Change-Id: I11a456655393bcf4b82d749ce7259bc1b78d1424	2024-11-08 20:35:13 -05:00
Satyanvesh Dittakavi	5f477900a3	SWDEV-446123 - Match hipGetLastError behavior with CUDA using env var Change-Id: Iaec697c1304d746376ecf2bfe2ad683b15ee189f	2024-11-07 12:02:34 -05:00
Alex Xie	b5aab7f6b4	SWDEV-491168 - expose one more API in glGetExtensionFuncionAddress Change-Id: Ia78b6b9377d99298c68968ef0eddd87534e0f750	2024-11-07 10:15:05 -05:00
Branislav Brzak	6e55f40313	Revert "SWDEV-465201 - Match negative hipModuleLoad return with Cuda" This reverts commit `68e5aeb93d`. Reason for revert: Breaking change that will be merged in at a later date Change-Id: Idd300492cc08a57c50decc22df287ddcc5463c88	2024-11-07 09:51:50 -05:00
Vladana Stojiljkovic	62e1d50c2a	Revert "SWDEV-491452 - Allow hipMemAdvise capturing only inrelaxed mode" This reverts commit `02bbe11e56`. Reason for revert: It is considered a breaking change. Change-Id: Ia1f7b27a6d3caa4a9fe12bee469f5f0a6b8327fd	2024-11-07 09:50:21 -05:00
Vladana Stojiljkovic	ed458615c4	Revert "SWDEV-492768 - Match hipStreamAddCallback capture behavior with nvidia" This reverts commit `f6c8bbf4dc`. Reason for revert: It is considered a breaking change. Change-Id: I9b2dfc60f62eec7268c0374b0b33e45eca7b3bbb	2024-11-07 09:49:55 -05:00
Vladana Stojiljkovic	48df130d87	Revert "SWDEV-490474 - Allow hipMallocManaged capturing only in relaxed mode" This reverts commit `6deecf1bfe`. Reason for revert: It is considered a breaking change. Change-Id: I6bfd81dcc16d36304efa2f151969427249a4122f	2024-11-07 04:33:07 -05:00
Julia Jiang	11a4cb3c34	SWDEV-494501 - Fix CTS api negative sub-test failures Change-Id: I0dd67467d977569fca45dcdad28a96c839bae33b	2024-11-05 14:58:01 -05:00
Ajay	aa4f27c29b	SWDEV-495842 - use find_program clang, llvm-mc instead of hip_package Helps in needing minimum files to build hip runtime on windows Change-Id: I1d27b72d3b0551b9261524606f08361d00463394	2024-11-04 23:09:11 -05:00
Branislav Brzak	68e5aeb93d	SWDEV-465201 - Match negative hipModuleLoad return with Cuda Change-Id: Id6c682e7f9ec8d34e9c8d5d5933f31e003c67e27	2024-11-04 03:58:06 -05:00
Jaydeep Patel	c24bc05cac	SWDEV-491409 - Handle DynCO path to delete DeviceVars. Change-Id: I757185e9c7c12f236e266219b67daf5836d2a122	2024-11-04 01:52:26 -05:00
Saleel Kudchadker	f1e98ab6e4	SWDEV-483586 - Do not take pinned path for read/write - When GPU_FORCE_BLIT_COPY_SIZE is set do not take pinned path Change-Id: Iaa065db63cc8fda61f82e6c9701e9fdaec5c54cb	2024-11-01 12:55:15 -04:00
Satyanvesh Dittakavi	a26dc29eb9	SWDEV-491967 - Add the right VGPRs per SIMD and VGPR Granularity for gfx12 - Default values are being assigned causing occupancy calculation to go wrong without the right values defined for gfx12 ASICs - Also added the these values for gfx1105 Change-Id: I611cc3a8ed8c57f2def637310ce1c3a48c16a574	2024-11-01 12:47:23 -04:00
Rahul Manocha	314d4a2c22	SWDEV-490864 - Optimize Alloc Node detection in graph Change-Id: I6ac32f9abd0b44864071a0a9396463cb13f6941f	2024-11-01 11:45:49 -04:00
Konstantin Zhuravlyov	3387f48b56	SWDEV-428601 - Don't enforce 1 isa per device in rocm backend - Device can have multiple isas as per HSA spec - First isa is most specific one, so this change is sort of a NOP Change-Id: Ib332af21745f2e6a7c25db8986bf7717501059bc	2024-11-01 11:01:02 -04:00
Tao Sang	802cacf3e9	SWDEV-487356 - Fix AMD LOG issue in Win32 Change-Id: Ia1c19cf4ea24188cdb2d374b01f975f794e02dbf	2024-11-01 08:26:25 -04:00
Satyanvesh Dittakavi	ba2ebb3b99	SWDEV-489570 - Update AQL packet in hipDrvGraphExecMemsetNodeSetParams After setting the new params in hipDrvGraphExecMemsetNodeSetParams, we need to update the AQL packet as well, otherwise during the graph launch it still dispatches the packet which has the original params and not the updated one. Change-Id: Ie49a641ba3f66c8085a29f92d88ac6ea6a1c0534	2024-11-01 07:01:10 -04:00
Jaydeep Patel	4aa52155ee	SWDEV-491149 - OCL does not need to update scratch as it can't update stack size using API unlike hipDeviceSetLimit. for HIP, Update should be only if compiler notifies use of stack size. Change-Id: Ic781bcac6fcf586da39ec4aafd4809da3652ede3	2024-11-01 01:05:07 -04:00
Vladana Stojiljkovic	f6c8bbf4dc	SWDEV-492768 - Match hipStreamAddCallback capture behavior with nvidia Change-Id: I7a084d8eeffe8b5095f7eb9969a565a40e76bb4b	2024-10-31 12:42:17 -04:00
Vladana Stojiljkovic	02bbe11e56	SWDEV-491452 - Allow hipMemAdvise capturing only inrelaxed mode Change-Id: I1ca5e050ff869b486e3a0a41d7f06390a88e1110	2024-10-31 12:41:47 -04:00
Vladana Stojiljkovic	e08df57502	SWDEV-493526 - Create kernel node when hipLaunchByPtr is captured Change-Id: Id3493485dfdb468436ab33e6d7cb19b6b0066fd4	2024-10-31 12:41:31 -04:00
Vladana Stojiljkovic	ec60bb1aed	SWDEV-489571 - Fix ihipGraphAddMemsetNode to allow memset of 3d portions of an array * When hipMemset3dAsync is captured, a 3d extent can set be as a parameter (depth > 1). That worked on nvidia, but on amd wrong portion of array was filled because when creating Memset3D command, extent dimensions were used to create pitchedPtr, instead of original array width and height. * Also, when capturing hipMemset3dAsync, nvidia allows any of the extent dimension to be 0, and in that case, no work should be done. Change-Id: I46a605bf9ae801cd3348e98d528c21263a8eefce	2024-10-31 10:29:54 -04:00
Alex Xie	f8c56f6bac	SWDEV-489468 - make resource cache bigger for APU Change-Id: I065c712acd06c273a0b194fe792ec4f876fa9c46	2024-10-31 09:55:01 -04:00
Tao Sang	82dff9a67d	SWDEV-492563 - Fix Ocl issues 1. Fix LDSSize type to be uint32_t. 2. Prevent clWaitForEvents running on complete events whose HostQueue have been destructed. Change-Id: I829e915f56b37db2ba76bb876c9656166534f154	2024-10-30 19:15:59 -04:00
Saleel Kudchadker	e23ff0520b	SWDEV-491375 - Improve MemObjMap perf - Create bins each with its own map and lock. This would help cases where the hash of a VA is differnet than ther one which falls in different bin, and there is no lock contention - Use STL shared mutexes, that way we can unique_lock for map updates vs simple reads which can use shared_lock Change-Id: I118818be65c6373700f5e511045babb6a398938a	2024-10-30 05:37:33 +00:00
German Andryeyev	403f624bf8	SWDEV-486602 - Add tracking of HSA handlers Add an atomic counter to track the outstanding HSA handlers. Wait on CPU for the callbacks if the number exceeds the value in DEBUG_HIP_BLOCK_SYNC env variable. Change-Id: I95dc8c4bf0258c7e59411b7504220709ed6898c5	2024-10-25 15:20:50 -04:00
Sameer Sahasrabuddhe	556390f9c7	SWDEV-490198: _sync() will be enabled by default in 6.4 Change-Id: Id029424a9c0f6b144a7aa0e96fe8acc4a138ec51	2024-10-25 09:54:40 -04:00
Julia Jiang	9f2f6a8aa7	SWDEV-488396,489257 - Fixed the regression in CTS pipes sub-test failure Change-Id: Id4004f0d6da5754b12c9a21038de50472cb1fee5	2024-10-25 05:58:46 -04:00
German Andryeyev	dceb320ba7	SWDEV-440746 - Fix a typo with GPU_PINNED_XFER_SIZE Change-Id: I8fdbfb4e1c6b1274206c28a529eee9ebeaaa26fb	2024-10-24 18:33:14 -04:00
Sourabh Betigeri	64e1b15551	SWDEV-450052 - Return if numDevices is more than device count on the platform Change-Id: I538106d1b02084df9cd06b41427629207312e76f	2024-10-24 17:07:11 -04:00
Julia Jiang	6f30ae102c	SWDEV-479940 - Updating the format of changelog Change-Id: I8aedb47b0de3ed656993bbcf9d7bc0fe3720f391	2024-10-23 11:32:40 -04:00
Anusha GodavarthySurya	f9f995c6d0	SWDEV-480209 - Handle GraphExec object release => GraphExec instance is destroyed before async launch completes, destroy after all pending graph launches => Remove GraphExec destroy during next sync point(hipStreamSync, hipDeviceSync etc..) Change-Id: I4df682aae5787fd6e5240a7be936ce50361345d0	2024-10-22 12:30:46 -04:00
David	05d6f75830	Changes needed for hipcc/hipconfig rename and cleanup - HIPCC, on Linux, will be removing high-level perl scripts (hipcc/hipconfig) in ROCm 6.3 - removes renaming hipcc.bin/hipconfig.bin logic SWDEV-467478 - HIPCC Clean up Perl Change-Id: I829e915d56b37cb2ba76bb876c6656166534f15c	2024-10-22 04:46:33 -04:00
Anusha GodavarthySurya	b498103f9b	SWDEV-485904 - propagate hsa_amd_vmem_address_free error to hip API Unit_hipMemSetAccess_GrowVMM test fails with HSA_STATUS_ERROR_RESOURCE_FREE silently Change-Id: I7a78410e432de4a2e877062782abf8761645f392	2024-10-21 10:12:32 -04:00
Jaydeep Patel	1f55a707b4	SWDEV-482751 - Use ocl-icd-devel package for SLES. Change-Id: I30e6243d697dc984a42051c20e336551d50d8e94	2024-10-20 23:55:02 -04:00
German Andryeyev	6bb7d1afdc	SWDEV-486602 - Fix Windows 32 bit build Windows alings fields to 8 bytes even with 32bit builds. Add BUG_CLR_SYSMEM_POOL to cotnrol sysmempool. Change-Id: I8622aabc9f7391ed7dd8583b252ce9eb41d62293	2024-10-18 11:35:54 -04:00
Vladana Stojiljkovic	6deecf1bfe	SWDEV-490474 - Allow hipMallocManaged capturing only in relaxed mode Change-Id: I02dccc6c45e39082ef925509a28bbe3c2a0fb7c6	2024-10-18 04:52:01 -04:00
Saleel Kudchadker	0f2342bc13	SWDEV-491375 - Optimize multithreaded dispatches - Fix typo Change-Id: If4c68455dcfa03fee18cb4720e8b5b438642703c	2024-10-17 17:02:23 -04:00
German Andryeyev	ad18146d8f	SWDEV-486602 - Change SysmemPool implementation - Remove the list of all chunks and use embedded chunk information in each allocation. That simplifies Free() logic, avoiding expensive loop if for some reason the number of outstanding allocations significantly grew. Change-Id: I9ea84d314320ce356ed24dd3180f262e2116c59b	2024-10-17 12:39:39 -04:00
Rahul Manocha	e729f08704	SWDEV-468039,SWDEV-482579 - Enable FP8 SW Conversions on pre gfx940 archs 1) SW Conversions for ocp and fnuz are enabled on pre mi300 archs 2) for mi300 only fnuz is enabled 3) for gfx1200 only ocp is enabled Change-Id: I90373752a2d15eff20d5deec874ed396ba4e1788	2024-10-17 11:49:22 -04:00
German Andryeyev	8657a77029	SWDEV-491375 - Limit the SW batch size Applications may submit commands withoout waits for GPU. That causes a growth of SW unreleased commands. Make sure runtime flushes SW queue, if it grows over some threshold, controlled by DEBUG_CLR_MAX_BATCH_SIZE. Change-Id: Ia4d85c24210ef91c394f638ab6b53b14323a0396	2024-10-17 10:53:57 -04:00
Alex Xie	df9ae754a4	SWDEV-482751 - Depends on distro opencl icd loader Since we don't distribute icd loader, we need to install distro icd loader. Change-Id: I1ea86bcf7c642a034c53f71130b15de1fa27e31e	2024-10-16 16:21:58 -04:00
Ajay	ff306ce9d8	SWDEV-482751 - add distro path to find package AMD_ICD Change-Id: I0d21f6ba6ade3ed932b134da503f639fd5d0d552	2024-10-14 15:27:34 -07:00
German Andryeyev	364dfb0ed1	SWDEV-486602 - Optimize HSA callback performance - Don't generate callbacks for HIP events - Don't process profiling info in the callback for HIP events - Wait for CPU status update of the submitted commands every 50 calls. That will allow to drain the commands and destroy HSA signals. Change-Id: Ib601a350e7e7c2b6c6209a172385389baccf73a9	2024-10-11 14:50:25 -04:00
Ioannis Assiouras	5da72f9d52	SWDEV-490323 - Fix validateMemAccess in hipMemset Changed the validation to occur on the sub-object rather than the parent. Change-Id: I87bf5ef3526d0db9304099ef9ac1a5494e9a01a9	2024-10-10 18:08:28 -04:00
Todd tiantuo Li	41dc4545fc	SWDEV-472357 - support Rect copy with staging buffer for 2D & 3D memcpy in PAL Change-Id: Ie32f3e5a6fa077f6b2db20fc1ab1e2e0da8344cb	2024-10-10 18:00:19 -04:00
kjayapra-amd	e7c0e06b5e	SWDEV-486510 - Delete hip::Function object, in case compiler passes duplicate hostFunction ptr. Change-Id: Ic8714eb9022a0f2150b2ea5dc008cecd7a9fae27	2024-10-10 12:45:58 -04:00
Vladana Stojiljkovic	6f2bad3998	SWDEV-489823 - Fix hipStreamEndCapture leak when capture is invalidated Change-Id: If8f5163d70e04d34a75fd0a7ba6c0a15ea59bb8b	2024-10-10 04:38:06 -04:00

1 2 3 4 5 ...

13217 Commit