Wykres commitów

6399 Commity

Autor SHA1 Wiadomość Data
randyh62 92b3629b25 Update environment.yml (#1884)
Update path to requirements.txt
2025-11-17 12:10:56 -08:00
Istvan Kiss a0f53a5fdb Sync HIP documentation leftover (#1597)
* Sync HIP documentation leftover

* Update HIP docs environment.yaml and doxyfile
2025-11-13 09:19:33 -08:00
Satyanvesh Dittakavi 07dd4c85e7 SWDEV-546308 - Implement hipKernelGetParamInfo API (#1783) 2025-11-12 14:09:26 +05:30
Todd tiantuo Li cf536a8c1a SWDEV-554372 - Add 3 HIP_GET_PROC_ADDRESS_xxx flags (#1771) 2025-11-10 23:29:40 -08:00
Julia Jiang 68c2a2b86b SWDEV-565694 - Fix config errors while building HIP documentation (#1767) 2025-11-10 10:30:36 -05:00
Rakesh Roy 9cac2e46e4 SWDEV-565668 - Bump minor version for ROCm 7.2 (#1762)
Additionally remove cmake option HIP_OFFICIAL_BUILD
2025-11-10 18:55:52 +05:30
Scott Todd fdbafd7757 Revert "SWDEV-554372 - Add 3 HIP_GET_PROC_ADDRESS_xxx flags (#1057)" (#1690)
Reverts ROCm/rocm-systems#1057

Suspected of breaking the build, see https://github.com/ROCm/rocm-systems/pull/1057#issuecomment-3487715129

Logs: https://github.com/ROCm/rocm-systems/actions/runs/19062134668/job/54444052479#step:12:315
```
[rocprofiler-sdk] FAILED: source/lib/rocprofiler-sdk/CMakeFiles/rocprofiler-sdk-object-library.dir/hip/abi.cpp.o 
[rocprofiler-sdk] ccache /opt/rh/gcc-toolset-12/root/usr/bin/c++ -DAMD_INTERNAL_BUILD=1 -DGLOG_USE_GLOG_EXPORT -DROCPROFILER_DL=1 -DROCPROFILER_HAS_GHC_LIB_FILESYSTEM=1 -DROCPROFILER_SDK_USE_SYSTEM_RCCL=0 -DROCPROFILER_SDK_USE_SYSTEM_ROCDECODE=0 -DROCPROFILER_SDK_USE_SYSTEM_ROCJPEG=0 -DUSE_PROF_API=1 -DYAML_CPP_STATIC_DEFINE -D__HIP_PLATFORM_AMD__=1 -Drocprofiler_EXPORTS=1 -I/__w/rocm-systems/rocm-systems/TheRock/build/profiler/rocprofiler-sdk/build/source/include -I/__w/rocm-systems/rocm-systems/projects/rocprofiler-sdk/source/include -I/__w/rocm-systems/rocm-systems/projects/rocprofiler-sdk/source -I/__w/rocm-systems/rocm-systems/projects/rocprofiler-sdk/external/yaml-cpp/include -I/__w/rocm-systems/rocm-systems/projects/rocprofiler-sdk/external/ptl/source -I/__w/rocm-systems/rocm-systems/TheRock/build/profiler/rocprofiler-sdk/build/external/ptl/source -isystem /__w/rocm-systems/rocm-systems/TheRock/build/core/clr/dist/include -isystem /__w/rocm-systems/rocm-systems/TheRock/build/core/ROCR-Runtime/dist/include -isystem /__w/rocm-systems/rocm-systems/projects/rocprofiler-sdk/external/filesystem/include -isystem /__w/rocm-systems/rocm-systems/TheRock/build/profiler/rocprofiler-sdk/build/external/glog -isystem /__w/rocm-systems/rocm-systems/projects/rocprofiler-sdk/external/glog/src -isystem /__w/rocm-systems/rocm-systems/projects/rocprofiler-sdk/external/fmt/include -isystem /__w/rocm-systems/rocm-systems/projects/rocprofiler-sdk/external/elfio -isystem /__w/rocm-systems/rocm-systems/TheRock/build/compiler/amd-comgr-stub/dist/include -isystem /__w/rocm-systems/rocm-systems/TheRock/build/third-party/sysdeps/linux/libdrm/build/stage/lib/rocm_sysdeps/lib/pkgconfig/../../include -isystem /__w/rocm-systems/rocm-systems/TheRock/build/third-party/sysdeps/linux/libdrm/build/stage/lib/rocm_sysdeps/lib/pkgconfig/../../include/libdrm -isystem /__w/rocm-systems/rocm-systems/TheRock/build/third-party/sysdeps/linux/elfutils/build/dist/lib/rocm_sysdeps/include -O3 -DNDEBUG -std=c++17 -fPIC -fvisibility=hidden -fvisibility-inlines-hidden -W -Wall -Wno-unknown-pragmas -faligned-new -rdynamic -fstack-protector-strong -Wstack-protector -MD -MT source/lib/rocprofiler-sdk/CMakeFiles/rocprofiler-sdk-object-library.dir/hip/abi.cpp.o -MF source/lib/rocprofiler-sdk/CMakeFiles/rocprofiler-sdk-object-library.dir/hip/abi.cpp.o.d -o source/lib/rocprofiler-sdk/CMakeFiles/rocprofiler-sdk-object-library.dir/hip/abi.cpp.o -c /__w/rocm-systems/rocm-systems/projects/rocprofiler-sdk/source/lib/rocprofiler-sdk/hip/abi.cpp
[rocprofiler-sdk] In file included from /__w/rocm-systems/rocm-systems/projects/rocprofiler-sdk/source/lib/rocprofiler-sdk/hip/abi.cpp:26:
[rocprofiler-sdk] /__w/rocm-systems/rocm-systems/projects/rocprofiler-sdk/source/lib/common/abi.hpp:62:27: error: static assertion failed: size of the API table struct has changed. Update the STEP_VERSION number (or in rare cases, the MAJOR_VERSION number)
[rocprofiler-sdk]    62 |             sizeof(TABLE) == ::rocprofiler::common::abi::compute_table_offset(NUM),                \
```
2025-11-04 14:29:58 -08:00
Todd tiantuo Li 7573fa168d SWDEV-554372 - Add 3 HIP_GET_PROC_ADDRESS_xxx flags (#1057) 2025-11-04 00:16:12 -08:00
dsicarov-amd 4915496bf9 SWDEV-533237 Add hipOccupancyAvailableDynamicSMemPerBlock API (#899)
* SWDEV-533237 Add initial support for hipOccupancyAvailableDynamicSMemPerBlock API

* SWDEV-533237 Add hipOccupancyAvailableDynamicSMemPerBlock wrapper for nvidia

* SWDEV-533237 Add implementation of hipOccupancyAvailableDynamicSMemPerBlock API

* SWDEV-533237 Add LDSAlignment field in Isa table

---------

Co-authored-by: Rahul Manocha <rmanocha@amd.com>
2025-10-29 10:58:42 +01:00
Istvan Kiss 197f73dac9 Sync HIP documentation 2025-10-20 (#1258)
* Add examples to tools folder
* Correct P2P memory access section
* Sync poriting guide
* Add HIP Graph tutorial
* Add hint about using amdgpu-dkms for IPC API
* Add a few more env variables
2025-10-29 07:42:06 +01:00
Rahul Manocha f5d901f016 SWDEV-546311 - implement hipKernelGetLibrary & hipLibraryEnumerateKer… (#1143)
* SWDEV-546311 - implement hipKernelGetLibrary & hipLibraryEnumerateKernels API

* Fix for LibraryEnumerateKernel and KernelGetName

* Update Enumerate Kernels to handle 0 numKernels

* Minor fixes to function names

* fix error checking in internal function

* Update changelog for new apis

---------

Co-authored-by: Rahul Manocha <rmanocha@amd.com>
2025-10-27 14:13:17 -07:00
MachineTom 5f76cb916d SWDEV-555888 - Refactor Numa code (#1191)
1. Create a set of mini numa interface.
In Linux, the interface is based on system call rather than libnuma.
In Windows, the interface can also work, but the policy class is dummy.
Different from Linux, Windows doesn't provide numactl tool or numa lib to setup numa policy, thus
the default policy is followed in Windows, that is, using the closest host numa node to allocate
pinned host memory in hipHostMalloc().
To get the closest host numa node of a GPU device, you need query the new attribute
hipDeviceAttributeHostNumaId. Then you can create a thread with CPU affinity on the numa node.
For example, reference the test in hip-tests/catch/perftests/memory/hipPerfHostNumaAllocWin.cc.

2. Remove pfnSetThreadGroupAffinity and pfnGetNumaNodeProcessorMaskEx as the functions have been exposed since Win7 and Win server 2008.

3. Other minor fixes.
2025-10-23 21:56:15 -04:00
Jimbo 37f2be9140 SWDEV-554608 - Add hipHostRegisterIoMemory for hipHostRegister (#962)
* SWDEV-554608 - Add hipHostRegisterIoMemory for hipHostRegister

* SWDEV-554608 - Add hipHostRegisterIoMemory for hipHostRegister

* SWDEV-554174 Added hipHostRegisterIoMemory flag in test cases

* SWDEV-554174 : Did formatting corrections

* SWDEV-554608 - set HSA_AMD_MEMORY_POOL_UNCACHED_FLAG if IoMemory is set

* SWDEV-554608 - set HSA_AMD_MEMORY_POOL_UNCACHED_FLAG if IoMemory is set

* SWDEV-554608 - Add hipHostRegisterIoMemory for hipHostRegister

---------

Co-authored-by: Anavena Venkatesh <Anavena.Venkatesh@amd.com>
Co-authored-by: Rambabu Swargam <rambabu.swargam@amd.com>
2025-10-22 20:25:59 -04:00
Julia Jiang 5919671102 SWDEV-560097 - Fix math API doc for __frcp_rn and __drcp_rn (#1393) 2025-10-17 11:20:39 -04:00
Gerardo Hernandez fc5551a724 SWDEV-536360 - fix another bullet point in reduce sync operations section not being displayed on its own line (#1374) 2025-10-15 14:51:43 +01:00
Gerardo Hernandez bfbc48bb0e SWDEV-536360 - fix bullet points in reduce sync operations section not being displayed on different lines in the browser (#1346) 2025-10-14 22:02:34 +01:00
Satyanvesh Dittakavi 46e683d41a SWDEV-545950 - Add hipStreamCopyAttributes API Implementation (#914)
* SWDEV-545950 - Add hipStreamCopyAttributes API Implementation

* Add unit test for hipStreamCopyAttributes API

* Add ChangeLog and nvidia mapping for the API

* Update rocprofiler-sdk with new HIP API details

* [rocprofiler-sdk] handle hipStreamCopyAttributes in stream tracing service

- this new HIP function has multiple stream arguments and needs to be skipped because it does not have an explicit create/destroy/set functionality

* Update HIP_RUNTIME_API_TABLE_STEP_VERSION in clr and rocprofiler-sdk

* Resolve merge conflicts

---------

Co-authored-by: Jonathan R. Madsen <jonathanrmadsen@gmail.com>
2025-10-12 19:57:05 +05:30
Istvan Kiss 51452b643f Revise the section on CU & WGP modes (#1255) 2025-10-10 09:14:03 -07:00
Rahul Manocha 27ec19116d SWDEV-557828 - fix hip-tests on cuda (#1152)
Co-authored-by: Rahul Manocha <rmanocha@amd.com>
2025-10-07 08:28:56 -07:00
Istvan Kiss 83fb0c8c47 SWDEV-541514 - Docs update 2025-09-15 (#993)
Co-authored-by: Julia Jiang <56359287+jujiang-del@users.noreply.github.com>
2025-09-24 09:57:00 -07:00
MachineTom 25922d08c3 SWDEV-539145 - Return error when ext_fine_grain_pool unavailable (#877)
Return error when ext_fine_grain_pool is unavailable for
hipHostMallocUncached, hipHostAllocUncached and
hipExtHostRegisterUncached.
Disable related tests on Navi4x where
ext_fine_grain_pool is unavailable
2025-09-21 19:25:28 -04:00
Jatin Chaudhary e79eaaa8a5 SWDEV-546287 - Implement hipLibrary load/unload (#975) 2025-09-19 22:23:49 +01:00
Julia Jiang 7ab2e49c57 SWDEV-554072 - Update description for hipModuleLoadData (#929) 2025-09-16 17:10:06 -04:00
Joseph Macaranas dd1a2dbf8a Fix LICENSE path for opencl build (#939) 2025-09-10 17:54:22 -04:00
Julia Jiang ae9a7f34c0 SWDEV-504376 - Update definition for hipHostAllocWriteCombined (#887) 2025-09-10 09:22:24 -07:00
Julia Jiang 4981fa2139 SWDEV-552580 - Update instructions to build HIP from source in monorepo (#806) 2025-09-10 09:21:27 -07:00
Joseph Macaranas 696881ae82 LICENSE clean up (#919)
- Clean up and standardization of MIT licenses after discussion with legal team.
- Update README.md with blurb for top-level files.
- MIT License explicitly mentioned for relevant projects.
- Removal of years.
- Copyright attribution should be to `Advanced Micro Devices, Inc.` and not `AMD ROCm(TM) Software`
- Removal of `All rights reserved.`
- Reduce line width of the text for readability.
- Add clear visual separators for additional licenses.
- Convert text files to markdown format for aforementioned separators.
- Update build scripts to point to renamed files.
- Fixed SMI doc references

Co-authored-by: Maisam Arif <Maisam.Arif@amd.com>
2025-09-10 12:06:14 -04:00
Julia Jiang 2889114b4a SWDEV-553031 - Adding Copyright in files for HIP (#827) 2025-09-05 13:48:43 -07:00
systems-assistant[bot] 9f11d73561 SWDEV-541096 - add hipEventWaitDefault and hipEventWaitExternal (#460)
Co-authored-by: Li, Todd tiantuo <Toddtiantuo.Li@amd.com>
2025-09-03 20:19:06 -07:00
systems-assistant[bot] 0468340d03 SWDEV-524815 - Specify path to hipconfig.exe instead of hipconfig on windows (#421)
Co-authored-by: Ioannis Assiouras <Ioannis.Assiouras@amd.com>
2025-09-02 15:15:26 +01:00
Julia Jiang 17ffa13035 SWDEV-538999 - Make correction in porting guid for launch_bounds (#646) 2025-08-26 16:55:52 -04:00
systems-assistant[bot] 3e62d0d2e6 SWDEV-545953 - Add hipStreamGetId API header (#428)
Authored-by: Satyanvesh Dittakavi <Satyanvesh.Dittakavi@amd.com>
2025-08-26 21:33:26 +05:30
Jimbo c03048d68e Implement hipMemAllocationTypeUncached in hipMemCreate (#747)
* Revert "SWDEV-547589 - Add hipDeviceMallocUncached to hipMemCreate (#815)"

This reverts commit 5ce7103555.

* Revert "SWDEV-547589 - comment for flag hipDeviceMallocUncached in hipMemcreate (#339)"

This reverts commit 04dac5eae3.

* SWDEV-551942 - implement hipMemAllocationTypeUncached in hipMemCreate
2025-08-26 11:34:49 -04:00
Julia Jiang e9e98daf24 SWDEV-524348 - Update description for multiProcessorCount (#648) 2025-08-22 21:40:02 +05:30
Danylo Lytovchenko 2ff2316227 Adjust clang format to the new versions, revert broken macro layout (#714) 2025-08-22 17:23:22 +02:00
Joseph Macaranas f5ac5efd79 Language Runtime README Changes (#650)
- Renaming old `README.md` files to keep their information intact.
- Default `README.md` files will have the deprecation notice to be mirrored back into the individual repos.
- Change ROCR-Runtime mirroring to `develop` branch.
2025-08-20 15:48:55 -04:00
Danylo Lytovchenko f7338717ae SWDEV-470698 - fix formatting, add format check workflow (#657) 2025-08-20 19:58:06 +05:30
Manocha, Rahul 4f905a199e SWDEV-546295 - Fix typo in hipMemcpy3DOperand (#341)
Fix typo in hipMemcpy3DOperand

Co-authored-by: Rahul Manocha <rmanocha@amd.com>

[ROCm/hip commit: 55b54866fb]
2025-08-15 16:00:04 -07:00
Li, Todd tiantuo dd818e76e5 SWDEV-546285 - add hipGetDriverEntryPoint (#323)
[ROCm/hip commit: 564ed55902]
2025-08-15 15:57:08 -07:00
Betigeri, Sourabh d04623c0af SWDEV-546294 - Declares hipMemPrefetchAsync_v2 (#306)
SWDEV-546294 - hipMemPrefetchAsync hipMemAdvise_v2

[ROCm/hip commit: 28ffff78b1]
2025-08-15 14:14:12 -07:00
Xie, Jiabao(Jimbo) 04dac5eae3 SWDEV-547589 - comment for flag hipDeviceMallocUncached in hipMemcreate (#339)
SWDEV-547589 - editted comment to support hipDeviceMallocUncached in hipMemCreate

Co-authored-by: Jimbo Xie <jiabaxie@amd.com>

[ROCm/hip commit: d2ec89d03a]
2025-08-15 14:06:44 -07:00
Manocha, Rahul 335ef77e69 SWDEV-546295 - Header changes for new APIs hipMemcpyBatch3DAsync and … (#307)
SWDEV-546295 - Header changes for new APIs hipMemcpyBatch3DAsync and hipMemcpy3DPeer

Co-authored-by: Rahul Manocha <rmanocha@amd.com>

[ROCm/hip commit: 49ae3bfa72]
2025-08-15 13:20:12 -07:00
GunaShekar, Ajay c14988b6ee SWDEV-546178 - hipModuleLoadFatBinary hip header (#292)
* SWDEV-546178 - hipModuleLoadFatBinary hip header

* SWDEV-546178 - rebase and fix minor comments

* SWDEV-546178 - clean up header function declaration

[ROCm/hip commit: 4a4959b49e]
2025-08-15 13:13:16 -07:00
Arandjelovic, Marko a2409bebd0 SWDEV-542815 - Fix unused-parameter compiler warning (#263)
* SWDEV-542815 - Fix unused-parameter compiler warning

* SWDEV-542815 - Run clang-format

[ROCm/hip commit: f94cc3711e]
2025-08-15 12:41:35 -07:00
Hernandez, Gerardo f9dba251b7 SWDEV-420237 - add documentation for reduce sync operations (#21)
* SWDEV-420237 - add documentation for reduce sync operations

* SWDEV-515087 - resolve some comments

* SWDEV-420237 - Rewrite sentence about the usage of HIP_ENABLE_WARP_SYNC_TYPES

Co-authored-by: Hartgrove, Randy <randy.hartgrove@amd.com>

* SWDEV-420237 - Update docs as HIP_ENABLE_WARP_SYNC_BUILTINS is not used anymore

* SWDEV-420237 - change wording 'would be' for 'are'

Co-authored-by: Sahasrabuddhe, Sameer <Sameer.Sahasrabuddhe@amd.com>

* SWDEV-420237 - fix redundant wording

Co-authored-by: Sahasrabuddhe, Sameer <Sameer.Sahasrabuddhe@amd.com>

* SWDEV-420237 - use 'and'/'or' consistently when talking about supported types

Co-authored-by: Sahasrabuddhe, Sameer <Sameer.Sahasrabuddhe@amd.com>

* SWDEV-420237 - Do not use future tense in reduce sync operation documentation

Co-authored-by: Sahasrabuddhe, Sameer <Sameer.Sahasrabuddhe@amd.com>

* SWDEV-420237 - make performance remark an 'informational note', regarding reduce sync operations performance

Co-authored-by: Sahasrabuddhe, Sameer <Sameer.Sahasrabuddhe@amd.com>

* SWDEV-420237 - do not use future tense

Co-authored-by: Sahasrabuddhe, Sameer <Sameer.Sahasrabuddhe@amd.com>

* SWDEV-543182 - changing more future tense that should be present tense

---------

Co-authored-by: Hartgrove, Randy <randy.hartgrove@amd.com>
Co-authored-by: Sahasrabuddhe, Sameer <Sameer.Sahasrabuddhe@amd.com>

[ROCm/hip commit: 07ddf90631]
2025-08-15 12:39:33 -07:00
Manocha, Rahul 59536c1331 SWDEV-545952 - Header changes for hipStreamSet/GetAttribute (#311)
Co-authored-by: Rahul Manocha <rmanocha@amd.com>

[ROCm/hip commit: c9b9717f57]
2025-08-15 12:27:37 -07:00
Manocha, Rahul 31cf3d8487 SWDEV-546328 - hipMemsetD2D API declarations (#294)
Co-authored-by: Rahul Manocha <rmanocha@amd.com>

[ROCm/hip commit: fb19412da3]
2025-08-14 14:47:21 -07:00
GunaShekar, Ajay 5d53edf5b7 SWDEV-546179 - hipModuleGetFunctionCount header (#289)
[ROCm/hip commit: 3b5a55b0f5]
2025-08-13 20:28:28 -07:00
Chaudhary, Jatin Jaikishan 76f4ab5a79 SWDEV-540645 - use utility header for std::forward (#324)
[ROCm/hip commit: ea2d79c3e2]
2025-08-12 15:15:45 +05:30
Alves, Pedro d4b6a4086d SWDEV-545245 - Fix hipDeviceGetTexture1DLinearMaxWidth declaration (#280)
The hipDeviceGetTexture1DLinearMaxWidth declaration is incorrect.  It
has the wrong arguments.  This commit fixes it.

[ROCm/hip commit: e74b05a7bd]
2025-08-04 15:35:03 +05:30