نمودار کامیت

13677 کامیت‌ها

مولف SHA1 پیام تاریخ
GunaShekar, Ajay bfcf0ef4e8 SWDEV-543366 - Bump PAL_CLIENT_INTERFACE_MAJOR_VERSION 916 --> 932 (#725)
Co-authored-by: Lin, Qun <Quentin.Lin@amd.com>
Co-authored-by: Lin,Qun <qlin@amd.com>

[ROCm/clr commit: ed903e8889]
2025-08-08 08:45:42 -07:00
Assiouras, Ioannis b3ae0230a5 SWDEV-542218 - Remove attribute(aligned) from ext_vector_type as it i… (#802)
SWDEV-542218 - Remove attribute(aligned) from ext_vector_type as it is iignored

[ROCm/clr commit: 4646a756d9]
2025-08-08 13:09:56 +01:00
Kudchadker, Saleel 3a849c6962 SWDEV-538195 - Introduce threshold for handler submission (#723)
- When doing device/stream sync, we can submit a handler which may
  introduce some host side delays. Use DEBUG_CLR_BATCH_CPU_SYNC_SIZE to
  batch commands for host wait. Default for HIP is 8 commands.
- Investigation is underway in ROCr but need to address this for now in
  HIP runtime.

[ROCm/clr commit: 9b045922a8]
2025-08-06 20:34:42 -07:00
Xie, Pengda 9cbbee4d6e SWDEV-520384 - Improve Fat Binary loading latency (#390)
Init and fini kernel needs to be launched when we load and unload code object. Avoid looping through all kernels within a code object just to run the init and fini kernels. Compiler currently only generates 1 init and fini kernel.

[ROCm/clr commit: cd46294b31]
2025-08-05 14:02:05 -07:00
Xie, Pengda 213623506f SWDEV-534394 - Kernel launch stream check (#533)
hipErrorInvalidResourceHandle is returned if stream is not on the launching device

[ROCm/clr commit: 7e777d3755]
2025-08-05 10:39:37 -07:00
Xie, Pengda b7d8cb56d1 SWDEV-505833 - Remove DEBUG_CLR_SKIP_RELEASE_SCOPE flag (#735)
Cleanup debug flag DEBUG_CLR_SKIP_RELEASE_SCOPE

[ROCm/clr commit: 4121a860bf]
2025-08-05 08:31:55 -07:00
Brzak, Branislav d54e70063e SWDEV-546137 - Don't rely on __cplusplus under hiprtc environment (#806)
[ROCm/clr commit: 111f12cbed]
2025-08-04 21:33:26 +02:00
Assiouras, Ioannis b443903ad8 SWDEV-533754 - Fixed mem leak in capturing gpu packets (#365)
[ROCm/clr commit: f094d16157]
2025-08-04 14:58:04 +01:00
Betigeri, Sourabh 40999496c1 SWDEV-545273 - Respect HIP_LAUNCH_PARAM_BUFFER_SIZE (#770)
[ROCm/clr commit: 2a02d2c2f3]
2025-08-03 17:32:52 -07:00
Kudchadker, Saleel a1d3da6bd3 SWDEV-547614 - Demangle graph kernel names (#809)
[ROCm/clr commit: 1492328894]
2025-08-01 14:35:30 -07:00
Sumner, Brian b617e8feb6 Fix undefined symbol at runtime with newer versions of LLVM (#791)
[ROCm/clr commit: 2da49283c8]
2025-08-01 08:15:02 -07:00
Patel, Jaydeepkumar 4458842ceb SWDEV-541799 - Return hipSuccess from hipFreeAsync if ptr to free is … (#683)
SWDEV-541799 - Return hipSuccess from hipFreeAsync if ptr to free is nullptr to match with hipFree.

[ROCm/clr commit: c329b548eb]
2025-08-01 14:41:40 +05:30
Chaudhary, Jatin Jaikishan b69f832430 SWDEV-543350 - restrictions on usage of hipMemcpyWithStream (#727)
* SWDEV-543350 - Do not hipMemcpyWithStream to be called during stream capture

* update comment

---------

Co-authored-by: Jatin Chaudhary <jatchaud@amd.com>

[ROCm/clr commit: 76d2c4135e]
2025-08-01 14:41:02 +05:30
Manocha, Rahul 4a93a614e5 SWDEV-539710 - Defer allocation of managed variable (#652)
Co-authored-by: Rahul Manocha <rmanocha@amd.com>

[ROCm/clr commit: 3f6f9d6081]
2025-07-31 08:30:23 -07:00
Xie, Jiabao(Jimbo) 405962df53 SWDEV-532824 - Fixed log_printf errors for codeql (#427)
Co-authored-by: Jimbo Xie <jiabaxie@amd.com>

[ROCm/clr commit: c88f345229]
2025-07-31 08:17:41 -07:00
Manocha, Rahul 86039381a2 SWDEV-540733 - Add PCI CHIP ID device attribute (#647)
SWDEV-540733 - Add PCI CHIP ID Device Attribute

Co-authored-by: Rahul Manocha <rmanocha@amd.com>

[ROCm/clr commit: 3bbc8bd170]
2025-07-30 19:43:07 -07:00
Hernandez, Gerardo f47693b755 SWDEV-534207 - fix use of uninitialized variable at hipEventRecord_common() (#771)
[ROCm/clr commit: 2c6ec7df45]
2025-07-23 16:49:12 +05:30
Sang, Tao 1aae121e86 SWDEV-533964 - use __builtin_readcyclecounter() only in clock64 (#368)
[ROCm/clr commit: 5fc6208b19]
2025-07-22 10:41:23 +05:30
Sang, Tao 2296162578 SWDEV-502579 - Remove duplicated symbols (#21)
Remove duplicated compiler symbols from hip-rt. hip-rt should use symbols such as EF_AMDGPU_MACH_AMDGCN_XXX and
ELFABIVERSION_AMDGPU_HSA_VX defined in compiler header.

[ROCm/clr commit: b15c1657b8]
2025-07-22 10:41:01 +05:30
Dittakavi, Satyanvesh a3e576a317 SWDEV-542905 - Update the appropriate grid size in capture hipExtModuleLaunchKernel (#740)
[ROCm/clr commit: c43ee71e96]
2025-07-21 13:57:15 +05:30
Jiang, Julia 005672c314 SWDEV-523483 - Update atomics add-operation in CLR (#751)
[ROCm/clr commit: 9cd8757717]
2025-07-19 06:48:54 +05:30
Betigeri, Sourabh 5683c5fb61 SWDEV-537486 - Change the SPIRV bundle entry id to match 4-field triple (#513)
[ROCm/clr commit: 1edac1bf3a]
2025-07-17 09:29:51 +05:30
Betigeri, Sourabh 52363f797e SWDEV-540629 - Update the current device ID (#699)
[ROCm/clr commit: 0313e20706]
2025-07-15 21:40:26 +05:30
Betigeri, Sourabh 680c7fbf64 SWDEV-483895 - Use ROCr to reserve memory for HMM (#590)
[ROCm/clr commit: a1f056bd11]
2025-07-15 21:40:13 +05:30
Dittakavi, Satyanvesh b6d69d162e SWDEV-542832 - Refactor vector types alignment (#713)
[ROCm/clr commit: 918b0bdb7a]
2025-07-15 21:09:31 +05:30
Xie, Jiabao(Jimbo) e5226f2b7e SWDEV-460151 - add gfx1201 to amd-staging clr (#183)
* SWDEV-460151 - add gfx1201 to amd-staging clr

* SWDEV-460151 - removed pal macro

---------

Co-authored-by: Jimbo Xie <jiabaxie@amd.com>

[ROCm/clr commit: a5d932f160]
2025-07-11 08:51:47 -07:00
Sang, Tao a777d6eca0 SWDEV-539145 - Simplify host memory pool management (#668)
* SWDEV-539145 - Simplify host memory pool management

Remove unnecessary variables and functions.
Make code simpler and clear.

* Change cpu_agent_info_ into pointer.

* Restore getPreferredNumaNode()

[ROCm/clr commit: 1351cd7fa8]
2025-07-11 10:38:40 -04:00
GunaShekar, Ajay 8e1f5f50cd SWDEV-541623 - do not print hip library path on console (#707)
DeviceTest.exe Unit_Uuid_FntlTstsFor_SetEnv_HIP_VISIBLE_DEVICES

[ROCm/clr commit: b568971718]
2025-07-10 12:07:03 -07:00
Sang, Tao 7fd322af90 SWDEV-508776 - support VGPRs validation (#274)
Clarify some VGPRs terms description.
Fix some wrong query logics of availableVGPRs_ and
availableRegistersPerCU_ in device info.
Add hipDeviceAttributeMaxAvailableVgprsPerThread
attribute query.
Remove hardcoding of following
 info_.vgprAllocGranularity_
 info_.vgprsPerSimd_

[ROCm/clr commit: 397f303d97]
2025-07-09 10:46:52 -04:00
Salinas, David 84e9845897 SWDEV-536367 - Make DEPRECATED message in roc-obj tools STDERR (#469)
[ROCm/clr commit: 71c6535b07]
2025-07-09 17:01:33 +05:30
Brzak, Branislav afa60c590c SWDEV-540609 - hipGraph capture of hipExtModuleLaunchKernel assumed grid size was a multiple of block size (#694)
[ROCm/clr commit: 32e027e66c]
2025-07-09 16:05:40 +05:30
Kudchadker, Saleel 433c25eab0 SWDEV-539378 - Use agent of IPC memory owner (#570)
- Currently runtime just uses the local agent as it did not check for
  IPCShared()
- With this fix we query hsa_amd_pointer_info and get the right agent
  for the memory to pass it to the HSA copy api

[ROCm/clr commit: 46d766e4e2]
2025-07-08 12:02:01 -07:00
Belton-Schure, Aidan 88c1717658 SWDEV-515426 - Remove HIP_USE_RUNTIME_UNBUNDLER (#205)
* remove HIP_USE_RUNTIME_UNBUNDLER
* clang-format
* Generic to use comgr
* Remove HIP_ALWAYS_USE_NEW_COMGR_UNBUNDLING_ACTION flag
* Removes runtime unbundling unused and debug Code
* Removes stale functions

[ROCm/clr commit: 81238db679]
2025-07-08 21:45:31 +05:30
Manocha, Rahul 8e8dc41cf0 SWDEV-532420 - Fix kokkos P2P copy failure with vmheap (#426)
Co-authored-by: Rahul Manocha <rmanocha@amd.com>

[ROCm/clr commit: 22b1ca4d8c]
2025-07-07 17:27:13 -07:00
Xie, Pengda 8e5921174f SWDEV-540576 - Abort if user request a core dump (#653)
* SWDEV-539414 - Return error status from runtime handler when HIP_SKIP_ABORT_ON_GPU_ERROR is false

* SWDEV-539414 - default handler when GPU core file is generated

* SWDEV-540576 - Abort if user request a core dump

Change-Id: I9e2c640acf559880bd13641de9103e660ef822a3

---------

Co-authored-by: Assiouras, Ioannis <Ioannis.Assiouras@amd.com>
Co-authored-by: agunashe <ajay.gunashekar@amd.com>

[ROCm/clr commit: 9c1bff0ae7]
2025-07-03 00:29:14 +05:30
Xie, Pengda 19007f7015 SWDEV-536925 - Check for C++ compiler instead of C when building hip (#490)
Change cmake to use CMAKE_CXX_COMPILER_ID since CMAKE_COMPILER_IS_GNUCC is deprecated

[ROCm/clr commit: 21f6842fff]
2025-07-01 12:59:54 -07:00
Sang, Tao 287ec83f5c SWDEV-539145 - Support extended fine grained system memory pool (#603)
* Add hipHostMalloc() new flag hipHostMallocUncached which will force to allocate pinned
host memory on extended fine grained system memory pool.
* Add hipHostAlloc() new flag hipHostAllocUncached which will force to allocate pinned
host memory on extended fine grained system memory pool.
* Add hipHostRegister() new flag hipHostRegisterUncached which will force to map
host memory onto extended fine grained system momory pool.

[ROCm/clr commit: a7d7687b8f]
2025-07-01 10:10:10 +05:30
Trinh, Ethan 66b125d5dc SWDEV-539861 - add ROCclr version to logs (#612)
[ROCm/clr commit: 2ce45143d8]
2025-06-27 16:37:58 -04:00
Xie, AlexBin 534bd7413a SWDEV-530767 ocltst GL test work with GL context on 2nd GPU (#324)
Change-Id: Ia3dbc661b3b9b1de71a632969707318f208131cd

[ROCm/clr commit: ef26fcdbd2]
2025-06-27 12:25:23 -04:00
Xie, AlexBin a158b5c4a2 SWDEV-540176 - deb and rpm package include OpenCL config files (#615)
[ROCm/clr commit: 592c25e936]
2025-06-27 11:59:24 -04:00
Guan, Zichuan c43b8ca05c Fix memory leak in opengl interop (#541)
Signed-off-by: zichguan-amd <zichuan.guan@amd.com>

[ROCm/clr commit: 51508e0bef]
2025-06-27 09:51:18 -04:00
Belton-Schure, Aidan b735783ae7 SWDEV-536813 - Make alignment 16 for special cases (#609)
[ROCm/clr commit: 5e77c58bd0]
2025-06-27 14:20:16 +05:30
Lin, Qun ced47b3588 SWDEV-508869 - Replace asicRevision with gfxip triple in paldevice (#601)
asicRevision is not unique in pal path now, use gfxip triple to match isa is more straight forward in pal path.

[ROCm/clr commit: b584fc8cd1]
2025-06-27 10:09:21 +08:00
Lin, Qun 3b44884a57 SWDEV-508869 - Fix Linux build error for HIP on PAL (#176)
[ROCm/clr commit: 9699cc3864]
2025-06-27 07:51:22 +08:00
Luu, Jonathan 20a9e311a9 SWDEV-531400_SYMLINK remove SYMLINK (#525)
Co-authored-by: Jonathan Luu <jonatluu@amd.com>

[ROCm/clr commit: a2dafbeacf]
2025-06-26 15:53:39 -04:00
Luu, Jonathan 81c75a7acc SWDEV-531400 - Remove file reorganization backward compatibility (clr hip) (#472)
* SWDEV-531400 Remove file reorganization backward compatibility (clr hip)

* SWDEV-531400 remove backwards compatibility for opencl

* SWDEV-531400 remove header_template.hpp.in

---------

Co-authored-by: Jonathan Luu <jonatluu@amd.com>

[ROCm/clr commit: c64d56195f]
2025-06-26 15:53:11 -04:00
Sang, Tao da7c6f57ef SWDEV-539399 - Support ROCCLR_MEM_HSA_CONTIGUOUS (#610)
Fix missing support of ROCCLR_MEM_HSA_CONTIGUOUS in
hipExtMallocWithFlags().

[ROCm/clr commit: f4d78d427c]
2025-06-26 14:05:23 -04:00
Jiang, Julia cc2129b2f4 SWDEV-461599 - scrub HCC in clr documents (#569)
[ROCm/clr commit: b5cd8321d3]
2025-06-26 15:05:59 +05:30
Sicarov, Dragoslav dbfd869936 SWDEV-533232 Add num_threads API in cooperative_groups (#437)
* Add num_threads API in cooperative_groups
* Reimplementsize API as its alias.
* Update changelog

[ROCm/clr commit: a4631b5700]
2025-06-26 15:05:17 +05:30
Arandjelovic, Marko c530e72a7c SWDEV-512344 - Allow unmapping of only complete subbuffers (#608)
* Do not allow partial unmapping
* Add check to prevent out of bounds unmap
* Improve code quality
* Use getOrigin instead of getOffset

[ROCm/clr commit: 349b83c1f6]
2025-06-26 15:01:33 +05:30