Grafik Komit

1677 Melakukan

Penulis SHA1 Pesan Tanggal
Maneesh Gupta 4a9833e70e Revert "Add HasExpertSchedMode device prop (#2241)" (#2371)
This reverts commit c0b4aef5ad.
2025-12-17 21:26:44 -08:00
Filip Jankovic c0b4aef5ad Add HasExpertSchedMode device prop (#2241)
* Add HasExpertSchedMode device prop

* Add unit tests for HasExpertSchedMode

* Add gfx12 check for HasExpertSchedMode prop

* Update gfx major version check and test for ExpertSchedMode

* Minor fix and ROCr version bump

* Update projects/rocr-runtime/runtime/hsa-runtime/inc/hsa_ext_amd.h

* Update projects/rocr-runtime/runtime/hsa-runtime/inc/hsa_ext_amd.h

* Apply suggestion from @dayatsin-amd

* Apply suggestion from @dayatsin-amd

---------

Co-authored-by: Stefan Sokolovic <stefan.sokolovic2@amd.com>
Co-authored-by: David Yat Sin <77975354+dayatsin-amd@users.noreply.github.com>
2025-12-17 17:06:08 +01:00
systems-assistant[bot] 0b61026495 SWDEV-543350 - Add graph capture limitation for hipMemcpyWithStream (#462) 2025-12-05 13:15:52 +00:00
Satyanvesh Dittakavi 07dd4c85e7 SWDEV-546308 - Implement hipKernelGetParamInfo API (#1783) 2025-11-12 14:09:26 +05:30
Todd tiantuo Li cf536a8c1a SWDEV-554372 - Add 3 HIP_GET_PROC_ADDRESS_xxx flags (#1771) 2025-11-10 23:29:40 -08:00
Scott Todd fdbafd7757 Revert "SWDEV-554372 - Add 3 HIP_GET_PROC_ADDRESS_xxx flags (#1057)" (#1690)
Reverts ROCm/rocm-systems#1057

Suspected of breaking the build, see https://github.com/ROCm/rocm-systems/pull/1057#issuecomment-3487715129

Logs: https://github.com/ROCm/rocm-systems/actions/runs/19062134668/job/54444052479#step:12:315
```
[rocprofiler-sdk] FAILED: source/lib/rocprofiler-sdk/CMakeFiles/rocprofiler-sdk-object-library.dir/hip/abi.cpp.o 
[rocprofiler-sdk] ccache /opt/rh/gcc-toolset-12/root/usr/bin/c++ -DAMD_INTERNAL_BUILD=1 -DGLOG_USE_GLOG_EXPORT -DROCPROFILER_DL=1 -DROCPROFILER_HAS_GHC_LIB_FILESYSTEM=1 -DROCPROFILER_SDK_USE_SYSTEM_RCCL=0 -DROCPROFILER_SDK_USE_SYSTEM_ROCDECODE=0 -DROCPROFILER_SDK_USE_SYSTEM_ROCJPEG=0 -DUSE_PROF_API=1 -DYAML_CPP_STATIC_DEFINE -D__HIP_PLATFORM_AMD__=1 -Drocprofiler_EXPORTS=1 -I/__w/rocm-systems/rocm-systems/TheRock/build/profiler/rocprofiler-sdk/build/source/include -I/__w/rocm-systems/rocm-systems/projects/rocprofiler-sdk/source/include -I/__w/rocm-systems/rocm-systems/projects/rocprofiler-sdk/source -I/__w/rocm-systems/rocm-systems/projects/rocprofiler-sdk/external/yaml-cpp/include -I/__w/rocm-systems/rocm-systems/projects/rocprofiler-sdk/external/ptl/source -I/__w/rocm-systems/rocm-systems/TheRock/build/profiler/rocprofiler-sdk/build/external/ptl/source -isystem /__w/rocm-systems/rocm-systems/TheRock/build/core/clr/dist/include -isystem /__w/rocm-systems/rocm-systems/TheRock/build/core/ROCR-Runtime/dist/include -isystem /__w/rocm-systems/rocm-systems/projects/rocprofiler-sdk/external/filesystem/include -isystem /__w/rocm-systems/rocm-systems/TheRock/build/profiler/rocprofiler-sdk/build/external/glog -isystem /__w/rocm-systems/rocm-systems/projects/rocprofiler-sdk/external/glog/src -isystem /__w/rocm-systems/rocm-systems/projects/rocprofiler-sdk/external/fmt/include -isystem /__w/rocm-systems/rocm-systems/projects/rocprofiler-sdk/external/elfio -isystem /__w/rocm-systems/rocm-systems/TheRock/build/compiler/amd-comgr-stub/dist/include -isystem /__w/rocm-systems/rocm-systems/TheRock/build/third-party/sysdeps/linux/libdrm/build/stage/lib/rocm_sysdeps/lib/pkgconfig/../../include -isystem /__w/rocm-systems/rocm-systems/TheRock/build/third-party/sysdeps/linux/libdrm/build/stage/lib/rocm_sysdeps/lib/pkgconfig/../../include/libdrm -isystem /__w/rocm-systems/rocm-systems/TheRock/build/third-party/sysdeps/linux/elfutils/build/dist/lib/rocm_sysdeps/include -O3 -DNDEBUG -std=c++17 -fPIC -fvisibility=hidden -fvisibility-inlines-hidden -W -Wall -Wno-unknown-pragmas -faligned-new -rdynamic -fstack-protector-strong -Wstack-protector -MD -MT source/lib/rocprofiler-sdk/CMakeFiles/rocprofiler-sdk-object-library.dir/hip/abi.cpp.o -MF source/lib/rocprofiler-sdk/CMakeFiles/rocprofiler-sdk-object-library.dir/hip/abi.cpp.o.d -o source/lib/rocprofiler-sdk/CMakeFiles/rocprofiler-sdk-object-library.dir/hip/abi.cpp.o -c /__w/rocm-systems/rocm-systems/projects/rocprofiler-sdk/source/lib/rocprofiler-sdk/hip/abi.cpp
[rocprofiler-sdk] In file included from /__w/rocm-systems/rocm-systems/projects/rocprofiler-sdk/source/lib/rocprofiler-sdk/hip/abi.cpp:26:
[rocprofiler-sdk] /__w/rocm-systems/rocm-systems/projects/rocprofiler-sdk/source/lib/common/abi.hpp:62:27: error: static assertion failed: size of the API table struct has changed. Update the STEP_VERSION number (or in rare cases, the MAJOR_VERSION number)
[rocprofiler-sdk]    62 |             sizeof(TABLE) == ::rocprofiler::common::abi::compute_table_offset(NUM),                \
```
2025-11-04 14:29:58 -08:00
Todd tiantuo Li 7573fa168d SWDEV-554372 - Add 3 HIP_GET_PROC_ADDRESS_xxx flags (#1057) 2025-11-04 00:16:12 -08:00
dsicarov-amd 4915496bf9 SWDEV-533237 Add hipOccupancyAvailableDynamicSMemPerBlock API (#899)
* SWDEV-533237 Add initial support for hipOccupancyAvailableDynamicSMemPerBlock API

* SWDEV-533237 Add hipOccupancyAvailableDynamicSMemPerBlock wrapper for nvidia

* SWDEV-533237 Add implementation of hipOccupancyAvailableDynamicSMemPerBlock API

* SWDEV-533237 Add LDSAlignment field in Isa table

---------

Co-authored-by: Rahul Manocha <rmanocha@amd.com>
2025-10-29 10:58:42 +01:00
Rahul Manocha f5d901f016 SWDEV-546311 - implement hipKernelGetLibrary & hipLibraryEnumerateKer… (#1143)
* SWDEV-546311 - implement hipKernelGetLibrary & hipLibraryEnumerateKernels API

* Fix for LibraryEnumerateKernel and KernelGetName

* Update Enumerate Kernels to handle 0 numKernels

* Minor fixes to function names

* fix error checking in internal function

* Update changelog for new apis

---------

Co-authored-by: Rahul Manocha <rmanocha@amd.com>
2025-10-27 14:13:17 -07:00
MachineTom 5f76cb916d SWDEV-555888 - Refactor Numa code (#1191)
1. Create a set of mini numa interface.
In Linux, the interface is based on system call rather than libnuma.
In Windows, the interface can also work, but the policy class is dummy.
Different from Linux, Windows doesn't provide numactl tool or numa lib to setup numa policy, thus
the default policy is followed in Windows, that is, using the closest host numa node to allocate
pinned host memory in hipHostMalloc().
To get the closest host numa node of a GPU device, you need query the new attribute
hipDeviceAttributeHostNumaId. Then you can create a thread with CPU affinity on the numa node.
For example, reference the test in hip-tests/catch/perftests/memory/hipPerfHostNumaAllocWin.cc.

2. Remove pfnSetThreadGroupAffinity and pfnGetNumaNodeProcessorMaskEx as the functions have been exposed since Win7 and Win server 2008.

3. Other minor fixes.
2025-10-23 21:56:15 -04:00
Jimbo 37f2be9140 SWDEV-554608 - Add hipHostRegisterIoMemory for hipHostRegister (#962)
* SWDEV-554608 - Add hipHostRegisterIoMemory for hipHostRegister

* SWDEV-554608 - Add hipHostRegisterIoMemory for hipHostRegister

* SWDEV-554174 Added hipHostRegisterIoMemory flag in test cases

* SWDEV-554174 : Did formatting corrections

* SWDEV-554608 - set HSA_AMD_MEMORY_POOL_UNCACHED_FLAG if IoMemory is set

* SWDEV-554608 - set HSA_AMD_MEMORY_POOL_UNCACHED_FLAG if IoMemory is set

* SWDEV-554608 - Add hipHostRegisterIoMemory for hipHostRegister

---------

Co-authored-by: Anavena Venkatesh <Anavena.Venkatesh@amd.com>
Co-authored-by: Rambabu Swargam <rambabu.swargam@amd.com>
2025-10-22 20:25:59 -04:00
Satyanvesh Dittakavi 46e683d41a SWDEV-545950 - Add hipStreamCopyAttributes API Implementation (#914)
* SWDEV-545950 - Add hipStreamCopyAttributes API Implementation

* Add unit test for hipStreamCopyAttributes API

* Add ChangeLog and nvidia mapping for the API

* Update rocprofiler-sdk with new HIP API details

* [rocprofiler-sdk] handle hipStreamCopyAttributes in stream tracing service

- this new HIP function has multiple stream arguments and needs to be skipped because it does not have an explicit create/destroy/set functionality

* Update HIP_RUNTIME_API_TABLE_STEP_VERSION in clr and rocprofiler-sdk

* Resolve merge conflicts

---------

Co-authored-by: Jonathan R. Madsen <jonathanrmadsen@gmail.com>
2025-10-12 19:57:05 +05:30
Rahul Manocha 27ec19116d SWDEV-557828 - fix hip-tests on cuda (#1152)
Co-authored-by: Rahul Manocha <rmanocha@amd.com>
2025-10-07 08:28:56 -07:00
Istvan Kiss 83fb0c8c47 SWDEV-541514 - Docs update 2025-09-15 (#993)
Co-authored-by: Julia Jiang <56359287+jujiang-del@users.noreply.github.com>
2025-09-24 09:57:00 -07:00
MachineTom 25922d08c3 SWDEV-539145 - Return error when ext_fine_grain_pool unavailable (#877)
Return error when ext_fine_grain_pool is unavailable for
hipHostMallocUncached, hipHostAllocUncached and
hipExtHostRegisterUncached.
Disable related tests on Navi4x where
ext_fine_grain_pool is unavailable
2025-09-21 19:25:28 -04:00
Jatin Chaudhary e79eaaa8a5 SWDEV-546287 - Implement hipLibrary load/unload (#975) 2025-09-19 22:23:49 +01:00
Julia Jiang 7ab2e49c57 SWDEV-554072 - Update description for hipModuleLoadData (#929) 2025-09-16 17:10:06 -04:00
Julia Jiang ae9a7f34c0 SWDEV-504376 - Update definition for hipHostAllocWriteCombined (#887) 2025-09-10 09:22:24 -07:00
Julia Jiang 2889114b4a SWDEV-553031 - Adding Copyright in files for HIP (#827) 2025-09-05 13:48:43 -07:00
systems-assistant[bot] 9f11d73561 SWDEV-541096 - add hipEventWaitDefault and hipEventWaitExternal (#460)
Co-authored-by: Li, Todd tiantuo <Toddtiantuo.Li@amd.com>
2025-09-03 20:19:06 -07:00
systems-assistant[bot] 3e62d0d2e6 SWDEV-545953 - Add hipStreamGetId API header (#428)
Authored-by: Satyanvesh Dittakavi <Satyanvesh.Dittakavi@amd.com>
2025-08-26 21:33:26 +05:30
Jimbo c03048d68e Implement hipMemAllocationTypeUncached in hipMemCreate (#747)
* Revert "SWDEV-547589 - Add hipDeviceMallocUncached to hipMemCreate (#815)"

This reverts commit 5ce7103555.

* Revert "SWDEV-547589 - comment for flag hipDeviceMallocUncached in hipMemcreate (#339)"

This reverts commit 04dac5eae3.

* SWDEV-551942 - implement hipMemAllocationTypeUncached in hipMemCreate
2025-08-26 11:34:49 -04:00
Julia Jiang e9e98daf24 SWDEV-524348 - Update description for multiProcessorCount (#648) 2025-08-22 21:40:02 +05:30
Danylo Lytovchenko 2ff2316227 Adjust clang format to the new versions, revert broken macro layout (#714) 2025-08-22 17:23:22 +02:00
Danylo Lytovchenko f7338717ae SWDEV-470698 - fix formatting, add format check workflow (#657) 2025-08-20 19:58:06 +05:30
Manocha, Rahul 4f905a199e SWDEV-546295 - Fix typo in hipMemcpy3DOperand (#341)
Fix typo in hipMemcpy3DOperand

Co-authored-by: Rahul Manocha <rmanocha@amd.com>

[ROCm/hip commit: 55b54866fb]
2025-08-15 16:00:04 -07:00
Li, Todd tiantuo dd818e76e5 SWDEV-546285 - add hipGetDriverEntryPoint (#323)
[ROCm/hip commit: 564ed55902]
2025-08-15 15:57:08 -07:00
Betigeri, Sourabh d04623c0af SWDEV-546294 - Declares hipMemPrefetchAsync_v2 (#306)
SWDEV-546294 - hipMemPrefetchAsync hipMemAdvise_v2

[ROCm/hip commit: 28ffff78b1]
2025-08-15 14:14:12 -07:00
Xie, Jiabao(Jimbo) 04dac5eae3 SWDEV-547589 - comment for flag hipDeviceMallocUncached in hipMemcreate (#339)
SWDEV-547589 - editted comment to support hipDeviceMallocUncached in hipMemCreate

Co-authored-by: Jimbo Xie <jiabaxie@amd.com>

[ROCm/hip commit: d2ec89d03a]
2025-08-15 14:06:44 -07:00
Manocha, Rahul 335ef77e69 SWDEV-546295 - Header changes for new APIs hipMemcpyBatch3DAsync and … (#307)
SWDEV-546295 - Header changes for new APIs hipMemcpyBatch3DAsync and hipMemcpy3DPeer

Co-authored-by: Rahul Manocha <rmanocha@amd.com>

[ROCm/hip commit: 49ae3bfa72]
2025-08-15 13:20:12 -07:00
GunaShekar, Ajay c14988b6ee SWDEV-546178 - hipModuleLoadFatBinary hip header (#292)
* SWDEV-546178 - hipModuleLoadFatBinary hip header

* SWDEV-546178 - rebase and fix minor comments

* SWDEV-546178 - clean up header function declaration

[ROCm/hip commit: 4a4959b49e]
2025-08-15 13:13:16 -07:00
Arandjelovic, Marko a2409bebd0 SWDEV-542815 - Fix unused-parameter compiler warning (#263)
* SWDEV-542815 - Fix unused-parameter compiler warning

* SWDEV-542815 - Run clang-format

[ROCm/hip commit: f94cc3711e]
2025-08-15 12:41:35 -07:00
Manocha, Rahul 59536c1331 SWDEV-545952 - Header changes for hipStreamSet/GetAttribute (#311)
Co-authored-by: Rahul Manocha <rmanocha@amd.com>

[ROCm/hip commit: c9b9717f57]
2025-08-15 12:27:37 -07:00
Manocha, Rahul 31cf3d8487 SWDEV-546328 - hipMemsetD2D API declarations (#294)
Co-authored-by: Rahul Manocha <rmanocha@amd.com>

[ROCm/hip commit: fb19412da3]
2025-08-14 14:47:21 -07:00
GunaShekar, Ajay 5d53edf5b7 SWDEV-546179 - hipModuleGetFunctionCount header (#289)
[ROCm/hip commit: 3b5a55b0f5]
2025-08-13 20:28:28 -07:00
Chaudhary, Jatin Jaikishan 76f4ab5a79 SWDEV-540645 - use utility header for std::forward (#324)
[ROCm/hip commit: ea2d79c3e2]
2025-08-12 15:15:45 +05:30
Alves, Pedro d4b6a4086d SWDEV-545245 - Fix hipDeviceGetTexture1DLinearMaxWidth declaration (#280)
The hipDeviceGetTexture1DLinearMaxWidth declaration is incorrect.  It
has the wrong arguments.  This commit fixes it.

[ROCm/hip commit: e74b05a7bd]
2025-08-04 15:35:03 +05:30
Jiang, Julia 72d3a25bd9 SWDEV-545870 - Remove beta warning in HIP Graph APIs (#286)
[ROCm/hip commit: a7f453c5b1]
2025-07-28 21:24:07 +05:30
Manocha, Rahul 9d6260a810 SWDEV-540733 - Add PCI CHIP ID device attribute (#224)
Co-authored-by: Rahul Manocha <rmanocha@amd.com>

[ROCm/hip commit: c5f4cdd304]
2025-07-16 16:37:43 -07:00
Sang, Tao fb408b70d1 SWDEV-508776 - support VGPRs validation (#77)
Add hipDeviceAttributeMaxAvailableVgprsPerThread attribute

[ROCm/hip commit: 3b6b7642a7]
2025-07-08 10:03:57 -04:00
Sang, Tao ae856901aa SWDEV-539145 - Support extended fine grained system memory pool (#195)
* Add hipHostMalloc() new flag hipHostMallocUncached which will force to allocate pinned
host memory on extended fine grained system memory pool.
* Add hipHostAlloc() new flag hipHostAllocUncached which will force to allocate pinned
host memory on extended fine grained system memory pool.
* Add hipHostRegister() new flag hipHostRegisterUncached which will force to map
host memory onto extended fine grained system momory pool.

[ROCm/hip commit: f310c7f642]
2025-07-01 10:09:47 +05:30
Jiang, Julia f890d0d4f7 SWDEV-461599 - scrub HCC in hip documents (#178)
[ROCm/hip commit: 8eca6453dc]
2025-06-26 15:06:31 +05:30
Betigeri, Sourabh a22275441e SWDEV-454484 - Removes const qualifier for hipMemcpyAtoH (#123)
[ROCm/hip commit: da1b73ee47]
2025-06-06 12:21:27 +05:30
Gill, Harkirat be1d99f411 SWDEV-461599 - Remove references to HCC in hip_runtime_api.h (#102)
[ROCm/hip commit: b301ef2282]
2025-05-30 11:44:01 +05:30
Betigeri, Sourabh 8ad6c23d58 SWDEV-508963 - Change hipMemcpy subset of methods to use const qualifier for src (#55) (#98)
Change-Id: I03bb31f48c3552774eb7354a50f5eb77e0c3d44b

[ROCm/hip commit: 754b45b91d]
2025-05-29 13:07:25 +05:30
Dittakavi, Satyanvesh 89e363cab1 SWDEV-530921 - Remove the usage of __AMDGCN_WAVEFRONT_SIZE as compile time constant (#82)
[ROCm/hip commit: b32b642fef]
2025-05-29 12:53:42 +05:30
Roy, Rakesh 1af8cd7acc SWDEV-412457 - remove unused standard headers (#97)
Change-Id: I0801701f0743a2b7da56696052291f60d693c24b

Co-authored-by: Chaudhary, Jatin Jaikishan <JatinJaikishan.Chaudhary@amd.com>

[ROCm/hip commit: e3cb5399cc]
2025-05-28 17:42:55 +05:30
Brzak, Branislav 159f1d3d93 SWDEV-518962 - Match hipCtxGetApiVersion signature with Cuda (#81)
[ROCm/hip commit: c80b7f7cb1]
2025-05-26 20:28:04 +05:30
Dittakavi, Satyanvesh a9d8bf303a SWDEV-433031 - Match the signature of HIPRTC APIs with NVRTC (#93)
Change-Id: If91f5a240ae119d0bc5a21c2d162f0c37df32606

[ROCm/hip commit: 301ebd5936]
2025-05-22 13:30:24 +05:30
Assiouras, Ioannis 0b64115b2a SWDEV-508965 - [6.4 Preview] Remove HIP_MEMSET_NODE_PARAMS struct (#92)
[ROCm/hip commit: 13556b72a7]
2025-05-21 21:16:14 +05:30