Stojiljkovic, Vladana
499e2ee044
SWDEV-537855 - Free created event if hipIpcOpenEventHandle fails ( #680 )
2025-08-13 11:36:13 +02:00
Jiang, Julia
eb438107a7
525231- Adding more info in changelog for 7.0 in staging ( #527 )
...
525231- Update changelog for 7.0 in staging
2025-08-13 14:47:17 +05:30
Assiouras, Ioannis
2aa2212d93
SWDEV-549247 - Fix stream checks for hipModuleLaunchCooperativeKernel… ( #858 )
...
SWDEV-549247 - Fix stream validation checks in hipModuleLaunchCooperativeKernelMultiDevice
In CooperativeKernelMultiDevice mode, the stream's device
should be validated against the grid's assigned device
rather than the current device.
2025-08-13 07:47:49 +01:00
Andryeyev, German
2305f8ae56
SWDEV-465041 - Add support for user events with DD ( #321 )
...
* SWDEV-465041 - Add support for user events with DD
User events can be replaced with HSA signals. Add the interface
to allocate HSA signal for user events and update the status on
CL_COMPLETE.
Force pinned path with DD to avoid blocking calls. Pinned memory
can be released only when the command is complete.
Simplify device enqueue path to use generic kernel arg buffer and
signals
* Fix notifyCmdQueue() logic for OCL
* Avoid blocking calls in OCL with DD
* Add event destruciton in a case of the failure.
2025-08-12 19:04:36 -04:00
Assiouras, Ioannis
dc34af61d7
SWDEV-543340 - Remove shared memory objects after IPC event cleanup ( #745 )
...
This change ensures that shared memory objects (e.g., files in /dev/shm)
are unlinked once all related IPC events have been destroyed.
2025-08-12 21:23:09 +01:00
Chaudhary, Jatin Jaikishan
42730527c6
SWDEV-545100 - rename the template variable ( #798 )
2025-08-12 21:58:55 +05:30
Chaudhary, Jatin Jaikishan
4d5fe2206d
SWDEV-539481 - Add _rn variants of fp16 APIs ( #582 )
...
* Add _rn variants of fp16 APIs
* cover bf16 as well
2025-08-12 21:58:38 +05:30
Dittakavi, Satyanvesh
c60888b0cd
SWDEV-545947 - Add Implementation for hipSetValidDevices ( #805 )
2025-08-12 16:29:09 +05:30
Dittakavi, Satyanvesh
b50d70564e
SWDEV-540272 - copy params 3D struct should be populated even for memcpy 1D graph node ( #702 )
...
SWDEV-540272 - copy params 3D struct should be populated even for 1D graph node
2025-08-12 16:21:53 +05:30
Chaudhary, Jatin Jaikishan
d7f90a3120
SWDEV-540645 - use single path for code object loading ( #773 )
...
* use single path for code object loading
* Remove printf
* Address review comments
* Fix the declaration of ihipMallocManaged
2025-08-12 15:15:18 +05:30
Betigeri, Sourabh
0d9428135c
SWDEV-528351 - Clean up hipJitOptions ( #439 )
2025-08-11 21:24:44 +05:30
Betigeri, Sourabh
da61c5d1da
SWDEV-528351 - Cleanup template specializations and overloads for enums ( #360 )
...
* SWDEV-528351 - Adds temlplate specialization for enums
* SWDEV-528351 - Removes template specializations for enums
2025-08-11 21:24:24 +05:30
Chaudhary, Jatin Jaikishan
926eedc230
SWDEV-508540 - adjust for compile time wavefront 64 ( #747 )
2025-08-11 10:17:27 +05:30
Chaudhary, Jatin Jaikishan
ee124dda09
SWDEV-508540 - fix the max blocks per multiprocessor calculation ( #737 )
2025-08-11 10:17:18 +05:30
GunaShekar, Ajay
ed903e8889
SWDEV-543366 - Bump PAL_CLIENT_INTERFACE_MAJOR_VERSION 916 --> 932 ( #725 )
...
Co-authored-by: Lin, Qun <Quentin.Lin@amd.com >
Co-authored-by: Lin,Qun <qlin@amd.com >
2025-08-08 08:45:42 -07:00
Assiouras, Ioannis
4646a756d9
SWDEV-542218 - Remove attribute(aligned) from ext_vector_type as it i… ( #802 )
...
SWDEV-542218 - Remove attribute(aligned) from ext_vector_type as it is iignored
2025-08-08 13:09:56 +01:00
Kudchadker, Saleel
9b045922a8
SWDEV-538195 - Introduce threshold for handler submission ( #723 )
...
- When doing device/stream sync, we can submit a handler which may
introduce some host side delays. Use DEBUG_CLR_BATCH_CPU_SYNC_SIZE to
batch commands for host wait. Default for HIP is 8 commands.
- Investigation is underway in ROCr but need to address this for now in
HIP runtime.
2025-08-06 20:34:42 -07:00
Xie, Pengda
cd46294b31
SWDEV-520384 - Improve Fat Binary loading latency ( #390 )
...
Init and fini kernel needs to be launched when we load and unload code object. Avoid looping through all kernels within a code object just to run the init and fini kernels. Compiler currently only generates 1 init and fini kernel.
2025-08-05 14:02:05 -07:00
Xie, Pengda
7e777d3755
SWDEV-534394 - Kernel launch stream check ( #533 )
...
hipErrorInvalidResourceHandle is returned if stream is not on the launching device
2025-08-05 10:39:37 -07:00
Xie, Pengda
4121a860bf
SWDEV-505833 - Remove DEBUG_CLR_SKIP_RELEASE_SCOPE flag ( #735 )
...
Cleanup debug flag DEBUG_CLR_SKIP_RELEASE_SCOPE
2025-08-05 08:31:55 -07:00
Brzak, Branislav
111f12cbed
SWDEV-546137 - Don't rely on __cplusplus under hiprtc environment ( #806 )
2025-08-04 21:33:26 +02:00
Assiouras, Ioannis
f094d16157
SWDEV-533754 - Fixed mem leak in capturing gpu packets ( #365 )
2025-08-04 14:58:04 +01:00
Betigeri, Sourabh
2a02d2c2f3
SWDEV-545273 - Respect HIP_LAUNCH_PARAM_BUFFER_SIZE ( #770 )
2025-08-03 17:32:52 -07:00
Kudchadker, Saleel
1492328894
SWDEV-547614 - Demangle graph kernel names ( #809 )
2025-08-01 14:35:30 -07:00
Sumner, Brian
2da49283c8
Fix undefined symbol at runtime with newer versions of LLVM ( #791 )
2025-08-01 08:15:02 -07:00
Patel, Jaydeepkumar
c329b548eb
SWDEV-541799 - Return hipSuccess from hipFreeAsync if ptr to free is … ( #683 )
...
SWDEV-541799 - Return hipSuccess from hipFreeAsync if ptr to free is nullptr to match with hipFree.
2025-08-01 14:41:40 +05:30
Chaudhary, Jatin Jaikishan
76d2c4135e
SWDEV-543350 - restrictions on usage of hipMemcpyWithStream ( #727 )
...
* SWDEV-543350 - Do not hipMemcpyWithStream to be called during stream capture
* update comment
---------
Co-authored-by: Jatin Chaudhary <jatchaud@amd.com >
2025-08-01 14:41:02 +05:30
Manocha, Rahul
3f6f9d6081
SWDEV-539710 - Defer allocation of managed variable ( #652 )
...
Co-authored-by: Rahul Manocha <rmanocha@amd.com >
2025-07-31 08:30:23 -07:00
Xie, Jiabao(Jimbo)
c88f345229
SWDEV-532824 - Fixed log_printf errors for codeql ( #427 )
...
Co-authored-by: Jimbo Xie <jiabaxie@amd.com >
2025-07-31 08:17:41 -07:00
Manocha, Rahul
3bbc8bd170
SWDEV-540733 - Add PCI CHIP ID device attribute ( #647 )
...
SWDEV-540733 - Add PCI CHIP ID Device Attribute
Co-authored-by: Rahul Manocha <rmanocha@amd.com >
2025-07-30 19:43:07 -07:00
Hernandez, Gerardo
2c6ec7df45
SWDEV-534207 - fix use of uninitialized variable at hipEventRecord_common() ( #771 )
2025-07-23 16:49:12 +05:30
Sang, Tao
5fc6208b19
SWDEV-533964 - use __builtin_readcyclecounter() only in clock64 ( #368 )
2025-07-22 10:41:23 +05:30
Sang, Tao
b15c1657b8
SWDEV-502579 - Remove duplicated symbols ( #21 )
...
Remove duplicated compiler symbols from hip-rt. hip-rt should use symbols such as EF_AMDGPU_MACH_AMDGCN_XXX and
ELFABIVERSION_AMDGPU_HSA_VX defined in compiler header.
2025-07-22 10:41:01 +05:30
Dittakavi, Satyanvesh
c43ee71e96
SWDEV-542905 - Update the appropriate grid size in capture hipExtModuleLaunchKernel ( #740 )
2025-07-21 13:57:15 +05:30
Jiang, Julia
9cd8757717
SWDEV-523483 - Update atomics add-operation in CLR ( #751 )
2025-07-19 06:48:54 +05:30
Betigeri, Sourabh
1edac1bf3a
SWDEV-537486 - Change the SPIRV bundle entry id to match 4-field triple ( #513 )
2025-07-17 09:29:51 +05:30
Betigeri, Sourabh
0313e20706
SWDEV-540629 - Update the current device ID ( #699 )
2025-07-15 21:40:26 +05:30
Betigeri, Sourabh
a1f056bd11
SWDEV-483895 - Use ROCr to reserve memory for HMM ( #590 )
2025-07-15 21:40:13 +05:30
Dittakavi, Satyanvesh
918b0bdb7a
SWDEV-542832 - Refactor vector types alignment ( #713 )
2025-07-15 21:09:31 +05:30
Xie, Jiabao(Jimbo)
a5d932f160
SWDEV-460151 - add gfx1201 to amd-staging clr ( #183 )
...
* SWDEV-460151 - add gfx1201 to amd-staging clr
* SWDEV-460151 - removed pal macro
---------
Co-authored-by: Jimbo Xie <jiabaxie@amd.com >
2025-07-11 08:51:47 -07:00
Sang, Tao
1351cd7fa8
SWDEV-539145 - Simplify host memory pool management ( #668 )
...
* SWDEV-539145 - Simplify host memory pool management
Remove unnecessary variables and functions.
Make code simpler and clear.
* Change cpu_agent_info_ into pointer.
* Restore getPreferredNumaNode()
2025-07-11 10:38:40 -04:00
GunaShekar, Ajay
b568971718
SWDEV-541623 - do not print hip library path on console ( #707 )
...
DeviceTest.exe Unit_Uuid_FntlTstsFor_SetEnv_HIP_VISIBLE_DEVICES
2025-07-10 12:07:03 -07:00
Sang, Tao
397f303d97
SWDEV-508776 - support VGPRs validation ( #274 )
...
Clarify some VGPRs terms description.
Fix some wrong query logics of availableVGPRs_ and
availableRegistersPerCU_ in device info.
Add hipDeviceAttributeMaxAvailableVgprsPerThread
attribute query.
Remove hardcoding of following
info_.vgprAllocGranularity_
info_.vgprsPerSimd_
2025-07-09 10:46:52 -04:00
Salinas, David
71c6535b07
SWDEV-536367 - Make DEPRECATED message in roc-obj tools STDERR ( #469 )
2025-07-09 17:01:33 +05:30
Brzak, Branislav
32e027e66c
SWDEV-540609 - hipGraph capture of hipExtModuleLaunchKernel assumed grid size was a multiple of block size ( #694 )
2025-07-09 16:05:40 +05:30
Kudchadker, Saleel
46d766e4e2
SWDEV-539378 - Use agent of IPC memory owner ( #570 )
...
- Currently runtime just uses the local agent as it did not check for
IPCShared()
- With this fix we query hsa_amd_pointer_info and get the right agent
for the memory to pass it to the HSA copy api
2025-07-08 12:02:01 -07:00
Belton-Schure, Aidan
81238db679
SWDEV-515426 - Remove HIP_USE_RUNTIME_UNBUNDLER ( #205 )
...
* remove HIP_USE_RUNTIME_UNBUNDLER
* clang-format
* Generic to use comgr
* Remove HIP_ALWAYS_USE_NEW_COMGR_UNBUNDLING_ACTION flag
* Removes runtime unbundling unused and debug Code
* Removes stale functions
2025-07-08 21:45:31 +05:30
Manocha, Rahul
22b1ca4d8c
SWDEV-532420 - Fix kokkos P2P copy failure with vmheap ( #426 )
...
Co-authored-by: Rahul Manocha <rmanocha@amd.com >
2025-07-07 17:27:13 -07:00
Xie, Pengda
9c1bff0ae7
SWDEV-540576 - Abort if user request a core dump ( #653 )
...
* SWDEV-539414 - Return error status from runtime handler when HIP_SKIP_ABORT_ON_GPU_ERROR is false
* SWDEV-539414 - default handler when GPU core file is generated
* SWDEV-540576 - Abort if user request a core dump
Change-Id: I9e2c640acf559880bd13641de9103e660ef822a3
---------
Co-authored-by: Assiouras, Ioannis <Ioannis.Assiouras@amd.com >
Co-authored-by: agunashe <ajay.gunashekar@amd.com >
2025-07-03 00:29:14 +05:30
Xie, Pengda
21f6842fff
SWDEV-536925 - Check for C++ compiler instead of C when building hip ( #490 )
...
Change cmake to use CMAKE_CXX_COMPILER_ID since CMAKE_COMPILER_IS_GNUCC is deprecated
2025-07-01 12:59:54 -07:00