Graf commitů

13691 Commity

Autor SHA1 Zpráva Datum
Stojiljkovic, Vladana 499e2ee044 SWDEV-537855 - Free created event if hipIpcOpenEventHandle fails (#680) 2025-08-13 11:36:13 +02:00
Jiang, Julia eb438107a7 525231- Adding more info in changelog for 7.0 in staging (#527)
525231- Update changelog for 7.0 in staging
2025-08-13 14:47:17 +05:30
Assiouras, Ioannis 2aa2212d93 SWDEV-549247 - Fix stream checks for hipModuleLaunchCooperativeKernel… (#858)
SWDEV-549247 - Fix stream validation checks in hipModuleLaunchCooperativeKernelMultiDevice

In CooperativeKernelMultiDevice mode, the stream's device
should be validated against the grid's assigned device
rather than the current device.
2025-08-13 07:47:49 +01:00
Andryeyev, German 2305f8ae56 SWDEV-465041 - Add support for user events with DD (#321)
* SWDEV-465041 - Add support for user events with DD

User events can be replaced with HSA signals. Add the interface
to allocate HSA signal for user events and update the status on
CL_COMPLETE.
Force pinned path with DD to avoid blocking calls. Pinned memory
can be released only when the command is complete.
Simplify device enqueue path to use generic kernel arg buffer and
signals

* Fix notifyCmdQueue() logic for OCL

* Avoid blocking calls in OCL with DD

* Add event  destruciton in a case of the failure.
2025-08-12 19:04:36 -04:00
Assiouras, Ioannis dc34af61d7 SWDEV-543340 - Remove shared memory objects after IPC event cleanup (#745)
This change ensures that shared memory objects (e.g., files in /dev/shm)
are unlinked once all related IPC events have been destroyed.
2025-08-12 21:23:09 +01:00
Chaudhary, Jatin Jaikishan 42730527c6 SWDEV-545100 - rename the template variable (#798) 2025-08-12 21:58:55 +05:30
Chaudhary, Jatin Jaikishan 4d5fe2206d SWDEV-539481 - Add _rn variants of fp16 APIs (#582)
* Add _rn variants of fp16 APIs
* cover bf16 as well
2025-08-12 21:58:38 +05:30
Dittakavi, Satyanvesh c60888b0cd SWDEV-545947 - Add Implementation for hipSetValidDevices (#805) 2025-08-12 16:29:09 +05:30
Dittakavi, Satyanvesh b50d70564e SWDEV-540272 - copy params 3D struct should be populated even for memcpy 1D graph node (#702)
SWDEV-540272 - copy params 3D struct should be populated even for 1D graph node
2025-08-12 16:21:53 +05:30
Chaudhary, Jatin Jaikishan d7f90a3120 SWDEV-540645 - use single path for code object loading (#773)
* use single path for code object loading
* Remove printf
* Address review comments
* Fix the declaration of ihipMallocManaged
2025-08-12 15:15:18 +05:30
Betigeri, Sourabh 0d9428135c SWDEV-528351 - Clean up hipJitOptions (#439) 2025-08-11 21:24:44 +05:30
Betigeri, Sourabh da61c5d1da SWDEV-528351 - Cleanup template specializations and overloads for enums (#360)
* SWDEV-528351 - Adds temlplate specialization for enums
* SWDEV-528351 - Removes template specializations for enums
2025-08-11 21:24:24 +05:30
Chaudhary, Jatin Jaikishan 926eedc230 SWDEV-508540 - adjust for compile time wavefront 64 (#747) 2025-08-11 10:17:27 +05:30
Chaudhary, Jatin Jaikishan ee124dda09 SWDEV-508540 - fix the max blocks per multiprocessor calculation (#737) 2025-08-11 10:17:18 +05:30
GunaShekar, Ajay ed903e8889 SWDEV-543366 - Bump PAL_CLIENT_INTERFACE_MAJOR_VERSION 916 --> 932 (#725)
Co-authored-by: Lin, Qun <Quentin.Lin@amd.com>
Co-authored-by: Lin,Qun <qlin@amd.com>
2025-08-08 08:45:42 -07:00
Assiouras, Ioannis 4646a756d9 SWDEV-542218 - Remove attribute(aligned) from ext_vector_type as it i… (#802)
SWDEV-542218 - Remove attribute(aligned) from ext_vector_type as it is iignored
2025-08-08 13:09:56 +01:00
Kudchadker, Saleel 9b045922a8 SWDEV-538195 - Introduce threshold for handler submission (#723)
- When doing device/stream sync, we can submit a handler which may
  introduce some host side delays. Use DEBUG_CLR_BATCH_CPU_SYNC_SIZE to
  batch commands for host wait. Default for HIP is 8 commands.
- Investigation is underway in ROCr but need to address this for now in
  HIP runtime.
2025-08-06 20:34:42 -07:00
Xie, Pengda cd46294b31 SWDEV-520384 - Improve Fat Binary loading latency (#390)
Init and fini kernel needs to be launched when we load and unload code object. Avoid looping through all kernels within a code object just to run the init and fini kernels. Compiler currently only generates 1 init and fini kernel.
2025-08-05 14:02:05 -07:00
Xie, Pengda 7e777d3755 SWDEV-534394 - Kernel launch stream check (#533)
hipErrorInvalidResourceHandle is returned if stream is not on the launching device
2025-08-05 10:39:37 -07:00
Xie, Pengda 4121a860bf SWDEV-505833 - Remove DEBUG_CLR_SKIP_RELEASE_SCOPE flag (#735)
Cleanup debug flag DEBUG_CLR_SKIP_RELEASE_SCOPE
2025-08-05 08:31:55 -07:00
Brzak, Branislav 111f12cbed SWDEV-546137 - Don't rely on __cplusplus under hiprtc environment (#806) 2025-08-04 21:33:26 +02:00
Assiouras, Ioannis f094d16157 SWDEV-533754 - Fixed mem leak in capturing gpu packets (#365) 2025-08-04 14:58:04 +01:00
Betigeri, Sourabh 2a02d2c2f3 SWDEV-545273 - Respect HIP_LAUNCH_PARAM_BUFFER_SIZE (#770) 2025-08-03 17:32:52 -07:00
Kudchadker, Saleel 1492328894 SWDEV-547614 - Demangle graph kernel names (#809) 2025-08-01 14:35:30 -07:00
Sumner, Brian 2da49283c8 Fix undefined symbol at runtime with newer versions of LLVM (#791) 2025-08-01 08:15:02 -07:00
Patel, Jaydeepkumar c329b548eb SWDEV-541799 - Return hipSuccess from hipFreeAsync if ptr to free is … (#683)
SWDEV-541799 - Return hipSuccess from hipFreeAsync if ptr to free is nullptr to match with hipFree.
2025-08-01 14:41:40 +05:30
Chaudhary, Jatin Jaikishan 76d2c4135e SWDEV-543350 - restrictions on usage of hipMemcpyWithStream (#727)
* SWDEV-543350 - Do not hipMemcpyWithStream to be called during stream capture

* update comment

---------

Co-authored-by: Jatin Chaudhary <jatchaud@amd.com>
2025-08-01 14:41:02 +05:30
Manocha, Rahul 3f6f9d6081 SWDEV-539710 - Defer allocation of managed variable (#652)
Co-authored-by: Rahul Manocha <rmanocha@amd.com>
2025-07-31 08:30:23 -07:00
Xie, Jiabao(Jimbo) c88f345229 SWDEV-532824 - Fixed log_printf errors for codeql (#427)
Co-authored-by: Jimbo Xie <jiabaxie@amd.com>
2025-07-31 08:17:41 -07:00
Manocha, Rahul 3bbc8bd170 SWDEV-540733 - Add PCI CHIP ID device attribute (#647)
SWDEV-540733 - Add PCI CHIP ID Device Attribute

Co-authored-by: Rahul Manocha <rmanocha@amd.com>
2025-07-30 19:43:07 -07:00
Hernandez, Gerardo 2c6ec7df45 SWDEV-534207 - fix use of uninitialized variable at hipEventRecord_common() (#771) 2025-07-23 16:49:12 +05:30
Sang, Tao 5fc6208b19 SWDEV-533964 - use __builtin_readcyclecounter() only in clock64 (#368) 2025-07-22 10:41:23 +05:30
Sang, Tao b15c1657b8 SWDEV-502579 - Remove duplicated symbols (#21)
Remove duplicated compiler symbols from hip-rt. hip-rt should use symbols such as EF_AMDGPU_MACH_AMDGCN_XXX and
ELFABIVERSION_AMDGPU_HSA_VX defined in compiler header.
2025-07-22 10:41:01 +05:30
Dittakavi, Satyanvesh c43ee71e96 SWDEV-542905 - Update the appropriate grid size in capture hipExtModuleLaunchKernel (#740) 2025-07-21 13:57:15 +05:30
Jiang, Julia 9cd8757717 SWDEV-523483 - Update atomics add-operation in CLR (#751) 2025-07-19 06:48:54 +05:30
Betigeri, Sourabh 1edac1bf3a SWDEV-537486 - Change the SPIRV bundle entry id to match 4-field triple (#513) 2025-07-17 09:29:51 +05:30
Betigeri, Sourabh 0313e20706 SWDEV-540629 - Update the current device ID (#699) 2025-07-15 21:40:26 +05:30
Betigeri, Sourabh a1f056bd11 SWDEV-483895 - Use ROCr to reserve memory for HMM (#590) 2025-07-15 21:40:13 +05:30
Dittakavi, Satyanvesh 918b0bdb7a SWDEV-542832 - Refactor vector types alignment (#713) 2025-07-15 21:09:31 +05:30
Xie, Jiabao(Jimbo) a5d932f160 SWDEV-460151 - add gfx1201 to amd-staging clr (#183)
* SWDEV-460151 - add gfx1201 to amd-staging clr

* SWDEV-460151 - removed pal macro

---------

Co-authored-by: Jimbo Xie <jiabaxie@amd.com>
2025-07-11 08:51:47 -07:00
Sang, Tao 1351cd7fa8 SWDEV-539145 - Simplify host memory pool management (#668)
* SWDEV-539145 - Simplify host memory pool management

Remove unnecessary variables and functions.
Make code simpler and clear.

* Change cpu_agent_info_ into pointer.

* Restore getPreferredNumaNode()
2025-07-11 10:38:40 -04:00
GunaShekar, Ajay b568971718 SWDEV-541623 - do not print hip library path on console (#707)
DeviceTest.exe Unit_Uuid_FntlTstsFor_SetEnv_HIP_VISIBLE_DEVICES
2025-07-10 12:07:03 -07:00
Sang, Tao 397f303d97 SWDEV-508776 - support VGPRs validation (#274)
Clarify some VGPRs terms description.
Fix some wrong query logics of availableVGPRs_ and
availableRegistersPerCU_ in device info.
Add hipDeviceAttributeMaxAvailableVgprsPerThread
attribute query.
Remove hardcoding of following
 info_.vgprAllocGranularity_
 info_.vgprsPerSimd_
2025-07-09 10:46:52 -04:00
Salinas, David 71c6535b07 SWDEV-536367 - Make DEPRECATED message in roc-obj tools STDERR (#469) 2025-07-09 17:01:33 +05:30
Brzak, Branislav 32e027e66c SWDEV-540609 - hipGraph capture of hipExtModuleLaunchKernel assumed grid size was a multiple of block size (#694) 2025-07-09 16:05:40 +05:30
Kudchadker, Saleel 46d766e4e2 SWDEV-539378 - Use agent of IPC memory owner (#570)
- Currently runtime just uses the local agent as it did not check for
  IPCShared()
- With this fix we query hsa_amd_pointer_info and get the right agent
  for the memory to pass it to the HSA copy api
2025-07-08 12:02:01 -07:00
Belton-Schure, Aidan 81238db679 SWDEV-515426 - Remove HIP_USE_RUNTIME_UNBUNDLER (#205)
* remove HIP_USE_RUNTIME_UNBUNDLER
* clang-format
* Generic to use comgr
* Remove HIP_ALWAYS_USE_NEW_COMGR_UNBUNDLING_ACTION flag
* Removes runtime unbundling unused and debug Code
* Removes stale functions
2025-07-08 21:45:31 +05:30
Manocha, Rahul 22b1ca4d8c SWDEV-532420 - Fix kokkos P2P copy failure with vmheap (#426)
Co-authored-by: Rahul Manocha <rmanocha@amd.com>
2025-07-07 17:27:13 -07:00
Xie, Pengda 9c1bff0ae7 SWDEV-540576 - Abort if user request a core dump (#653)
* SWDEV-539414 - Return error status from runtime handler when HIP_SKIP_ABORT_ON_GPU_ERROR is false

* SWDEV-539414 - default handler when GPU core file is generated

* SWDEV-540576 - Abort if user request a core dump

Change-Id: I9e2c640acf559880bd13641de9103e660ef822a3

---------

Co-authored-by: Assiouras, Ioannis <Ioannis.Assiouras@amd.com>
Co-authored-by: agunashe <ajay.gunashekar@amd.com>
2025-07-03 00:29:14 +05:30
Xie, Pengda 21f6842fff SWDEV-536925 - Check for C++ compiler instead of C when building hip (#490)
Change cmake to use CMAKE_CXX_COMPILER_ID since CMAKE_COMPILER_IS_GNUCC is deprecated
2025-07-01 12:59:54 -07:00