GunaShekar, Ajay
bfcf0ef4e8
SWDEV-543366 - Bump PAL_CLIENT_INTERFACE_MAJOR_VERSION 916 --> 932 ( #725 )
...
Co-authored-by: Lin, Qun <Quentin.Lin@amd.com >
Co-authored-by: Lin,Qun <qlin@amd.com >
[ROCm/clr commit: ed903e8889 ]
2025-08-08 08:45:42 -07:00
Assiouras, Ioannis
b3ae0230a5
SWDEV-542218 - Remove attribute(aligned) from ext_vector_type as it i… ( #802 )
...
SWDEV-542218 - Remove attribute(aligned) from ext_vector_type as it is iignored
[ROCm/clr commit: 4646a756d9 ]
2025-08-08 13:09:56 +01:00
Kudchadker, Saleel
3a849c6962
SWDEV-538195 - Introduce threshold for handler submission ( #723 )
...
- When doing device/stream sync, we can submit a handler which may
introduce some host side delays. Use DEBUG_CLR_BATCH_CPU_SYNC_SIZE to
batch commands for host wait. Default for HIP is 8 commands.
- Investigation is underway in ROCr but need to address this for now in
HIP runtime.
[ROCm/clr commit: 9b045922a8 ]
2025-08-06 20:34:42 -07:00
Xie, Pengda
9cbbee4d6e
SWDEV-520384 - Improve Fat Binary loading latency ( #390 )
...
Init and fini kernel needs to be launched when we load and unload code object. Avoid looping through all kernels within a code object just to run the init and fini kernels. Compiler currently only generates 1 init and fini kernel.
[ROCm/clr commit: cd46294b31 ]
2025-08-05 14:02:05 -07:00
Xie, Pengda
213623506f
SWDEV-534394 - Kernel launch stream check ( #533 )
...
hipErrorInvalidResourceHandle is returned if stream is not on the launching device
[ROCm/clr commit: 7e777d3755 ]
2025-08-05 10:39:37 -07:00
Xie, Pengda
b7d8cb56d1
SWDEV-505833 - Remove DEBUG_CLR_SKIP_RELEASE_SCOPE flag ( #735 )
...
Cleanup debug flag DEBUG_CLR_SKIP_RELEASE_SCOPE
[ROCm/clr commit: 4121a860bf ]
2025-08-05 08:31:55 -07:00
Brzak, Branislav
d54e70063e
SWDEV-546137 - Don't rely on __cplusplus under hiprtc environment ( #806 )
...
[ROCm/clr commit: 111f12cbed ]
2025-08-04 21:33:26 +02:00
Assiouras, Ioannis
b443903ad8
SWDEV-533754 - Fixed mem leak in capturing gpu packets ( #365 )
...
[ROCm/clr commit: f094d16157 ]
2025-08-04 14:58:04 +01:00
Betigeri, Sourabh
40999496c1
SWDEV-545273 - Respect HIP_LAUNCH_PARAM_BUFFER_SIZE ( #770 )
...
[ROCm/clr commit: 2a02d2c2f3 ]
2025-08-03 17:32:52 -07:00
Kudchadker, Saleel
a1d3da6bd3
SWDEV-547614 - Demangle graph kernel names ( #809 )
...
[ROCm/clr commit: 1492328894 ]
2025-08-01 14:35:30 -07:00
Sumner, Brian
b617e8feb6
Fix undefined symbol at runtime with newer versions of LLVM ( #791 )
...
[ROCm/clr commit: 2da49283c8 ]
2025-08-01 08:15:02 -07:00
Patel, Jaydeepkumar
4458842ceb
SWDEV-541799 - Return hipSuccess from hipFreeAsync if ptr to free is … ( #683 )
...
SWDEV-541799 - Return hipSuccess from hipFreeAsync if ptr to free is nullptr to match with hipFree.
[ROCm/clr commit: c329b548eb ]
2025-08-01 14:41:40 +05:30
Chaudhary, Jatin Jaikishan
b69f832430
SWDEV-543350 - restrictions on usage of hipMemcpyWithStream ( #727 )
...
* SWDEV-543350 - Do not hipMemcpyWithStream to be called during stream capture
* update comment
---------
Co-authored-by: Jatin Chaudhary <jatchaud@amd.com >
[ROCm/clr commit: 76d2c4135e ]
2025-08-01 14:41:02 +05:30
Manocha, Rahul
4a93a614e5
SWDEV-539710 - Defer allocation of managed variable ( #652 )
...
Co-authored-by: Rahul Manocha <rmanocha@amd.com >
[ROCm/clr commit: 3f6f9d6081 ]
2025-07-31 08:30:23 -07:00
Xie, Jiabao(Jimbo)
405962df53
SWDEV-532824 - Fixed log_printf errors for codeql ( #427 )
...
Co-authored-by: Jimbo Xie <jiabaxie@amd.com >
[ROCm/clr commit: c88f345229 ]
2025-07-31 08:17:41 -07:00
Manocha, Rahul
86039381a2
SWDEV-540733 - Add PCI CHIP ID device attribute ( #647 )
...
SWDEV-540733 - Add PCI CHIP ID Device Attribute
Co-authored-by: Rahul Manocha <rmanocha@amd.com >
[ROCm/clr commit: 3bbc8bd170 ]
2025-07-30 19:43:07 -07:00
Hernandez, Gerardo
f47693b755
SWDEV-534207 - fix use of uninitialized variable at hipEventRecord_common() ( #771 )
...
[ROCm/clr commit: 2c6ec7df45 ]
2025-07-23 16:49:12 +05:30
Sang, Tao
1aae121e86
SWDEV-533964 - use __builtin_readcyclecounter() only in clock64 ( #368 )
...
[ROCm/clr commit: 5fc6208b19 ]
2025-07-22 10:41:23 +05:30
Sang, Tao
2296162578
SWDEV-502579 - Remove duplicated symbols ( #21 )
...
Remove duplicated compiler symbols from hip-rt. hip-rt should use symbols such as EF_AMDGPU_MACH_AMDGCN_XXX and
ELFABIVERSION_AMDGPU_HSA_VX defined in compiler header.
[ROCm/clr commit: b15c1657b8 ]
2025-07-22 10:41:01 +05:30
Dittakavi, Satyanvesh
a3e576a317
SWDEV-542905 - Update the appropriate grid size in capture hipExtModuleLaunchKernel ( #740 )
...
[ROCm/clr commit: c43ee71e96 ]
2025-07-21 13:57:15 +05:30
Jiang, Julia
005672c314
SWDEV-523483 - Update atomics add-operation in CLR ( #751 )
...
[ROCm/clr commit: 9cd8757717 ]
2025-07-19 06:48:54 +05:30
Betigeri, Sourabh
5683c5fb61
SWDEV-537486 - Change the SPIRV bundle entry id to match 4-field triple ( #513 )
...
[ROCm/clr commit: 1edac1bf3a ]
2025-07-17 09:29:51 +05:30
Betigeri, Sourabh
52363f797e
SWDEV-540629 - Update the current device ID ( #699 )
...
[ROCm/clr commit: 0313e20706 ]
2025-07-15 21:40:26 +05:30
Betigeri, Sourabh
680c7fbf64
SWDEV-483895 - Use ROCr to reserve memory for HMM ( #590 )
...
[ROCm/clr commit: a1f056bd11 ]
2025-07-15 21:40:13 +05:30
Dittakavi, Satyanvesh
b6d69d162e
SWDEV-542832 - Refactor vector types alignment ( #713 )
...
[ROCm/clr commit: 918b0bdb7a ]
2025-07-15 21:09:31 +05:30
Xie, Jiabao(Jimbo)
e5226f2b7e
SWDEV-460151 - add gfx1201 to amd-staging clr ( #183 )
...
* SWDEV-460151 - add gfx1201 to amd-staging clr
* SWDEV-460151 - removed pal macro
---------
Co-authored-by: Jimbo Xie <jiabaxie@amd.com >
[ROCm/clr commit: a5d932f160 ]
2025-07-11 08:51:47 -07:00
Sang, Tao
a777d6eca0
SWDEV-539145 - Simplify host memory pool management ( #668 )
...
* SWDEV-539145 - Simplify host memory pool management
Remove unnecessary variables and functions.
Make code simpler and clear.
* Change cpu_agent_info_ into pointer.
* Restore getPreferredNumaNode()
[ROCm/clr commit: 1351cd7fa8 ]
2025-07-11 10:38:40 -04:00
GunaShekar, Ajay
8e1f5f50cd
SWDEV-541623 - do not print hip library path on console ( #707 )
...
DeviceTest.exe Unit_Uuid_FntlTstsFor_SetEnv_HIP_VISIBLE_DEVICES
[ROCm/clr commit: b568971718 ]
2025-07-10 12:07:03 -07:00
Sang, Tao
7fd322af90
SWDEV-508776 - support VGPRs validation ( #274 )
...
Clarify some VGPRs terms description.
Fix some wrong query logics of availableVGPRs_ and
availableRegistersPerCU_ in device info.
Add hipDeviceAttributeMaxAvailableVgprsPerThread
attribute query.
Remove hardcoding of following
info_.vgprAllocGranularity_
info_.vgprsPerSimd_
[ROCm/clr commit: 397f303d97 ]
2025-07-09 10:46:52 -04:00
Salinas, David
84e9845897
SWDEV-536367 - Make DEPRECATED message in roc-obj tools STDERR ( #469 )
...
[ROCm/clr commit: 71c6535b07 ]
2025-07-09 17:01:33 +05:30
Brzak, Branislav
afa60c590c
SWDEV-540609 - hipGraph capture of hipExtModuleLaunchKernel assumed grid size was a multiple of block size ( #694 )
...
[ROCm/clr commit: 32e027e66c ]
2025-07-09 16:05:40 +05:30
Kudchadker, Saleel
433c25eab0
SWDEV-539378 - Use agent of IPC memory owner ( #570 )
...
- Currently runtime just uses the local agent as it did not check for
IPCShared()
- With this fix we query hsa_amd_pointer_info and get the right agent
for the memory to pass it to the HSA copy api
[ROCm/clr commit: 46d766e4e2 ]
2025-07-08 12:02:01 -07:00
Belton-Schure, Aidan
88c1717658
SWDEV-515426 - Remove HIP_USE_RUNTIME_UNBUNDLER ( #205 )
...
* remove HIP_USE_RUNTIME_UNBUNDLER
* clang-format
* Generic to use comgr
* Remove HIP_ALWAYS_USE_NEW_COMGR_UNBUNDLING_ACTION flag
* Removes runtime unbundling unused and debug Code
* Removes stale functions
[ROCm/clr commit: 81238db679 ]
2025-07-08 21:45:31 +05:30
Manocha, Rahul
8e8dc41cf0
SWDEV-532420 - Fix kokkos P2P copy failure with vmheap ( #426 )
...
Co-authored-by: Rahul Manocha <rmanocha@amd.com >
[ROCm/clr commit: 22b1ca4d8c ]
2025-07-07 17:27:13 -07:00
Xie, Pengda
8e5921174f
SWDEV-540576 - Abort if user request a core dump ( #653 )
...
* SWDEV-539414 - Return error status from runtime handler when HIP_SKIP_ABORT_ON_GPU_ERROR is false
* SWDEV-539414 - default handler when GPU core file is generated
* SWDEV-540576 - Abort if user request a core dump
Change-Id: I9e2c640acf559880bd13641de9103e660ef822a3
---------
Co-authored-by: Assiouras, Ioannis <Ioannis.Assiouras@amd.com >
Co-authored-by: agunashe <ajay.gunashekar@amd.com >
[ROCm/clr commit: 9c1bff0ae7 ]
2025-07-03 00:29:14 +05:30
Xie, Pengda
19007f7015
SWDEV-536925 - Check for C++ compiler instead of C when building hip ( #490 )
...
Change cmake to use CMAKE_CXX_COMPILER_ID since CMAKE_COMPILER_IS_GNUCC is deprecated
[ROCm/clr commit: 21f6842fff ]
2025-07-01 12:59:54 -07:00
Sang, Tao
287ec83f5c
SWDEV-539145 - Support extended fine grained system memory pool ( #603 )
...
* Add hipHostMalloc() new flag hipHostMallocUncached which will force to allocate pinned
host memory on extended fine grained system memory pool.
* Add hipHostAlloc() new flag hipHostAllocUncached which will force to allocate pinned
host memory on extended fine grained system memory pool.
* Add hipHostRegister() new flag hipHostRegisterUncached which will force to map
host memory onto extended fine grained system momory pool.
[ROCm/clr commit: a7d7687b8f ]
2025-07-01 10:10:10 +05:30
Trinh, Ethan
66b125d5dc
SWDEV-539861 - add ROCclr version to logs ( #612 )
...
[ROCm/clr commit: 2ce45143d8 ]
2025-06-27 16:37:58 -04:00
Xie, AlexBin
534bd7413a
SWDEV-530767 ocltst GL test work with GL context on 2nd GPU ( #324 )
...
Change-Id: Ia3dbc661b3b9b1de71a632969707318f208131cd
[ROCm/clr commit: ef26fcdbd2 ]
2025-06-27 12:25:23 -04:00
Xie, AlexBin
a158b5c4a2
SWDEV-540176 - deb and rpm package include OpenCL config files ( #615 )
...
[ROCm/clr commit: 592c25e936 ]
2025-06-27 11:59:24 -04:00
Guan, Zichuan
c43b8ca05c
Fix memory leak in opengl interop ( #541 )
...
Signed-off-by: zichguan-amd <zichuan.guan@amd.com >
[ROCm/clr commit: 51508e0bef ]
2025-06-27 09:51:18 -04:00
Belton-Schure, Aidan
b735783ae7
SWDEV-536813 - Make alignment 16 for special cases ( #609 )
...
[ROCm/clr commit: 5e77c58bd0 ]
2025-06-27 14:20:16 +05:30
Lin, Qun
ced47b3588
SWDEV-508869 - Replace asicRevision with gfxip triple in paldevice ( #601 )
...
asicRevision is not unique in pal path now, use gfxip triple to match isa is more straight forward in pal path.
[ROCm/clr commit: b584fc8cd1 ]
2025-06-27 10:09:21 +08:00
Lin, Qun
3b44884a57
SWDEV-508869 - Fix Linux build error for HIP on PAL ( #176 )
...
[ROCm/clr commit: 9699cc3864 ]
2025-06-27 07:51:22 +08:00
Luu, Jonathan
20a9e311a9
SWDEV-531400_SYMLINK remove SYMLINK ( #525 )
...
Co-authored-by: Jonathan Luu <jonatluu@amd.com >
[ROCm/clr commit: a2dafbeacf ]
2025-06-26 15:53:39 -04:00
Luu, Jonathan
81c75a7acc
SWDEV-531400 - Remove file reorganization backward compatibility (clr hip) ( #472 )
...
* SWDEV-531400 Remove file reorganization backward compatibility (clr hip)
* SWDEV-531400 remove backwards compatibility for opencl
* SWDEV-531400 remove header_template.hpp.in
---------
Co-authored-by: Jonathan Luu <jonatluu@amd.com >
[ROCm/clr commit: c64d56195f ]
2025-06-26 15:53:11 -04:00
Sang, Tao
da7c6f57ef
SWDEV-539399 - Support ROCCLR_MEM_HSA_CONTIGUOUS ( #610 )
...
Fix missing support of ROCCLR_MEM_HSA_CONTIGUOUS in
hipExtMallocWithFlags().
[ROCm/clr commit: f4d78d427c ]
2025-06-26 14:05:23 -04:00
Jiang, Julia
cc2129b2f4
SWDEV-461599 - scrub HCC in clr documents ( #569 )
...
[ROCm/clr commit: b5cd8321d3 ]
2025-06-26 15:05:59 +05:30
Sicarov, Dragoslav
dbfd869936
SWDEV-533232 Add num_threads API in cooperative_groups ( #437 )
...
* Add num_threads API in cooperative_groups
* Reimplementsize API as its alias.
* Update changelog
[ROCm/clr commit: a4631b5700 ]
2025-06-26 15:05:17 +05:30
Arandjelovic, Marko
c530e72a7c
SWDEV-512344 - Allow unmapping of only complete subbuffers ( #608 )
...
* Do not allow partial unmapping
* Add check to prevent out of bounds unmap
* Improve code quality
* Use getOrigin instead of getOffset
[ROCm/clr commit: 349b83c1f6 ]
2025-06-26 15:01:33 +05:30