Longlong Yao
f402bf55eb
wsl/hsakmt: add VendorPacket to VersionCapInfo
...
Provide the query for hsa-runtime to decide whether dispatch
vendor packet to wsl-thunk
Signed-off-by: Longlong Yao <Longlong.Yao@amd.com >
Reviewed-by: Flora Cui <flora.cui@amd.com >
Part-of: <http://10.67.69.192/wsl/libhsakmt/-/merge_requests/30 >
2025-11-05 18:53:35 +08:00
Flora Cui
7bf51b889f
wsl/hsakmt: enable sdma
...
Signed-off-by: Flora Cui <flora.cui@amd.com >
Reviewed-by: Shane Xiao <shane.xiao@amd.com >
Part-of: <http://10.67.69.192/wsl/libhsakmt/-/merge_requests/17 >
2025-11-05 18:53:35 +08:00
Flora Cui
ab8771ccae
wsl/hsakmt: add sdma queue implementation
...
Signed-off-by: Flora Cui <flora.cui@amd.com >
Reviewed-by: Shane Xiao <shane.xiao@amd.com >
Part-of: <http://10.67.69.192/wsl/libhsakmt/-/merge_requests/17 >
2025-11-05 18:53:35 +08:00
Flora Cui
e06e9b1d57
wsl/hsakmt: add more flag to control sdma
...
Signed-off-by: Flora Cui <flora.cui@amd.com >
Reviewed-by: Shane Xiao <shane.xiao@amd.com >
Part-of: <http://10.67.69.192/wsl/libhsakmt/-/merge_requests/17 >
2025-11-05 18:53:35 +08:00
tiancyin
4ecb939111
wsl/hsakmt: remove Vmem and dmabuf from VersionCapability
...
Signed-off-by: tiancyin <tianci.yin@amd.com >
2025-11-05 18:53:35 +08:00
Longlong Yao
48395b2295
wsl/hsakmt: rename rocr_proxy to thunk_proxy
...
Signed-off-by: Longlong Yao <Longlong.Yao@amd.com >
Part-of: <http://10.67.69.192/wsl/libhsakmt/-/merge_requests/26 >
2025-11-05 18:53:35 +08:00
Longlong Yao
c6fe6807e8
wsl/hsakmt: Set uncached for kernel arg region
...
Signed-off-by: Longlong Yao <Longlong.Yao@amd.com >
Reviewed-by: Flora Cui <flora.cui@amd.com >
Part-of: <http://10.67.69.192/wsl/libhsakmt/-/merge_requests/5 >
2025-11-05 18:53:35 +08:00
lyndonli
bc9b11d754
wsl/hsakmt: Remove pre-allocation free memory check
...
This change removes the check for sufficient free memory before allocation.
The previous check could cause performance degradation. Reserving a portion
of system memory helps prevent system hangs due to insufficient memory.
However, if free memory is still insufficient, memory allocation may still
lead to system hangs.
Signed-off-by: lyndonli <Lyndon.Li@amd.com >
Reviewed-by: Flora Cui <flora.cui@amd.com >
Part-of: <http://10.67.69.192/wsl/libhsakmt/-/merge_requests/10 >
2025-11-05 18:53:35 +08:00
tiancyin
90759bc89a
wsl/hsakmt: implement vmm
...
Reviewed-by: Shi.Leslie <Yuliang.Shi@amd.com >
Reviewed-by: Aaron Liu <aaron.liu@amd.com >
Signed-off-by: tiancyin <tianci.yin@amd.com >
2025-11-05 18:53:35 +08:00
tiancyin
d00c4446e9
wsl/hsakmt: just reserve SVM virtual memory for scratch pool
...
the allocation from InitScratchPool() does not really need VRAM
Reviewed-by: Flora Cui <flora.cui@amd.com >
Signed-off-by: tiancyin <tianci.yin@amd.com >
2025-11-05 18:53:35 +08:00
lyndonli
b4b0b54c79
wsl/hsakmt: Add clear_allocation_map
...
If fork() is called, clear allocation map
that is invalid in the child process.
Signed-off-by: lyndonli <Lyndon.Li@amd.com >
Reviewed-by: Shi.Leslie Yuliang.Shi@amd.com
Part-of: <http://10.67.69.192/wsl/libhsakmt/-/merge_requests/3 >
2025-11-05 18:53:35 +08:00
Longlong Yao
ccb3950068
wsl/hsakmt: clean up namespace
...
Signed-off-by: Longlong Yao <Longlong.Yao@amd.com >
Reviewed-by: lyndonli <Lyndon.Li@amd.com >
Reviewed-by: Flora Cui <flora.cui@amd.com >
Part-of: <http://10.67.69.192/wsl/libhsakmt/-/merge_requests/7 >
2025-11-05 18:53:35 +08:00
Flora Cui
79a63cf292
wsl/hsakmt: initial commit
...
Signed-off-by: lyndonli <Lyndon.Li@amd.com >
Signed-off-by: Horatio Zhang <Hongkun.Zhang@amd.com >
Signed-off-by: Shi.Leslie <Yuliang.Shi@amd.com >
Signed-off-by: LonglongYao <Longlong.Yao@amd.com >
Signed-off-by: tiancyin <tianci.yin@amd.com >
Signed-off-by: Frank Min <Frank.Min@amd.com >
Signed-off-by: Aaron Liu <aaron.liu@amd.com >
Signed-off-by: Shane Xiao <shane.xiao@amd.com >
Signed-off-by: Lang Yu <lang.yu@amd.com >
Signed-off-by: Feifei Xu <Feifei.Xu@amd.com >
Signed-off-by: Ruili Ji <ruiliji2@amd.com >
Signed-off-by: Qiang Yu <qiang.yu@amd.com >
Signed-off-by: Flora Cui <flora.cui@amd.com >
2025-11-05 18:53:15 +08:00
systems-assistant[bot]
16334bc15d
SWDEV-525244 - Additional graph validation test ( #584 )
...
* SWDEV-525244 - Additional graph validation test
* SWDEV-525244 - Change capture mode to relaxed
* SWDEV-525244 - Additional comments
---------
Co-authored-by: Vladana Stojiljkovic <Vladana.Stojiljkovic@amd.com >
2025-11-05 09:32:47 +01:00
vstojilj
18d4fc460b
SWDEV-541362 - Fix error code mismatch in test ( #1377 )
2025-11-05 09:31:30 +01:00
vstojilj
7d078b9875
SWDEV-553920 - Fix Unit_hipStreamCreateWithPriority_ValidateWithEvents ( #1622 )
2025-11-05 09:31:01 +01:00
pcritchl-amd
8d31383dfe
SWDEV-562996 - Build fix: Ubertrace callback calling convention mismatch on x86 ( #1587 )
...
Co-authored-by: Rakesh Roy <137397847+rakesroy@users.noreply.github.com >
2025-11-05 10:37:45 +05:30
Jeff Jiang
8b86f61514
* rocDecode: Added several fixes to samples. ( #668 )
...
- Fixed the build error with videodecodepicfiles sample.
- Added error handling of sample app command option combination of memory type OUT_SURFACE_MEM_NOT_MAPPED and MD5 generation.
[ROCm/rocdecode commit: c388518965 ]
2025-11-04 21:12:56 -05:00
Jeff Jiang
c388518965
* rocDecode: Added several fixes to samples. ( #668 )
...
- Fixed the build error with videodecodepicfiles sample.
- Added error handling of sample app command option combination of memory type OUT_SURFACE_MEM_NOT_MAPPED and MD5 generation.
2025-11-04 21:12:56 -05:00
Sourabh U Betigeri
78e9e47852
SWDEV-551244 - Fixes CUDA 13 compilation issues ( #1237 )
2025-11-04 16:56:49 -08:00
David Galiffi
1e501dd89a
Free runner disk space ( #1693 )
...
Motivation:
Basic runners are frequently running out of space
Technical Details:
Running autoclean after package installations.
Use the jlumbroso/free-disk-space action.
2025-11-04 17:31:23 -05:00
Scott Todd
fdbafd7757
Revert "SWDEV-554372 - Add 3 HIP_GET_PROC_ADDRESS_xxx flags ( #1057 )" ( #1690 )
...
Reverts ROCm/rocm-systems#1057
Suspected of breaking the build, see https://github.com/ROCm/rocm-systems/pull/1057#issuecomment-3487715129
Logs: https://github.com/ROCm/rocm-systems/actions/runs/19062134668/job/54444052479#step:12:315
```
[rocprofiler-sdk] FAILED: source/lib/rocprofiler-sdk/CMakeFiles/rocprofiler-sdk-object-library.dir/hip/abi.cpp.o
[rocprofiler-sdk] ccache /opt/rh/gcc-toolset-12/root/usr/bin/c++ -DAMD_INTERNAL_BUILD=1 -DGLOG_USE_GLOG_EXPORT -DROCPROFILER_DL=1 -DROCPROFILER_HAS_GHC_LIB_FILESYSTEM=1 -DROCPROFILER_SDK_USE_SYSTEM_RCCL=0 -DROCPROFILER_SDK_USE_SYSTEM_ROCDECODE=0 -DROCPROFILER_SDK_USE_SYSTEM_ROCJPEG=0 -DUSE_PROF_API=1 -DYAML_CPP_STATIC_DEFINE -D__HIP_PLATFORM_AMD__=1 -Drocprofiler_EXPORTS=1 -I/__w/rocm-systems/rocm-systems/TheRock/build/profiler/rocprofiler-sdk/build/source/include -I/__w/rocm-systems/rocm-systems/projects/rocprofiler-sdk/source/include -I/__w/rocm-systems/rocm-systems/projects/rocprofiler-sdk/source -I/__w/rocm-systems/rocm-systems/projects/rocprofiler-sdk/external/yaml-cpp/include -I/__w/rocm-systems/rocm-systems/projects/rocprofiler-sdk/external/ptl/source -I/__w/rocm-systems/rocm-systems/TheRock/build/profiler/rocprofiler-sdk/build/external/ptl/source -isystem /__w/rocm-systems/rocm-systems/TheRock/build/core/clr/dist/include -isystem /__w/rocm-systems/rocm-systems/TheRock/build/core/ROCR-Runtime/dist/include -isystem /__w/rocm-systems/rocm-systems/projects/rocprofiler-sdk/external/filesystem/include -isystem /__w/rocm-systems/rocm-systems/TheRock/build/profiler/rocprofiler-sdk/build/external/glog -isystem /__w/rocm-systems/rocm-systems/projects/rocprofiler-sdk/external/glog/src -isystem /__w/rocm-systems/rocm-systems/projects/rocprofiler-sdk/external/fmt/include -isystem /__w/rocm-systems/rocm-systems/projects/rocprofiler-sdk/external/elfio -isystem /__w/rocm-systems/rocm-systems/TheRock/build/compiler/amd-comgr-stub/dist/include -isystem /__w/rocm-systems/rocm-systems/TheRock/build/third-party/sysdeps/linux/libdrm/build/stage/lib/rocm_sysdeps/lib/pkgconfig/../../include -isystem /__w/rocm-systems/rocm-systems/TheRock/build/third-party/sysdeps/linux/libdrm/build/stage/lib/rocm_sysdeps/lib/pkgconfig/../../include/libdrm -isystem /__w/rocm-systems/rocm-systems/TheRock/build/third-party/sysdeps/linux/elfutils/build/dist/lib/rocm_sysdeps/include -O3 -DNDEBUG -std=c++17 -fPIC -fvisibility=hidden -fvisibility-inlines-hidden -W -Wall -Wno-unknown-pragmas -faligned-new -rdynamic -fstack-protector-strong -Wstack-protector -MD -MT source/lib/rocprofiler-sdk/CMakeFiles/rocprofiler-sdk-object-library.dir/hip/abi.cpp.o -MF source/lib/rocprofiler-sdk/CMakeFiles/rocprofiler-sdk-object-library.dir/hip/abi.cpp.o.d -o source/lib/rocprofiler-sdk/CMakeFiles/rocprofiler-sdk-object-library.dir/hip/abi.cpp.o -c /__w/rocm-systems/rocm-systems/projects/rocprofiler-sdk/source/lib/rocprofiler-sdk/hip/abi.cpp
[rocprofiler-sdk] In file included from /__w/rocm-systems/rocm-systems/projects/rocprofiler-sdk/source/lib/rocprofiler-sdk/hip/abi.cpp:26:
[rocprofiler-sdk] /__w/rocm-systems/rocm-systems/projects/rocprofiler-sdk/source/lib/common/abi.hpp:62:27: error: static assertion failed: size of the API table struct has changed. Update the STEP_VERSION number (or in rare cases, the MAJOR_VERSION number)
[rocprofiler-sdk] 62 | sizeof(TABLE) == ::rocprofiler::common::abi::compute_table_offset(NUM), \
```
2025-11-04 14:29:58 -08:00
marandje
811609c0e1
SWDEV-523137 - Fix and enable a few negative tests on NV ( #1157 )
2025-11-04 23:28:11 +01:00
David Galiffi
4b0fb2cdf5
Rename "corr_id" to "stack_id" in Perfetto annotations to match new n… ( #1618 )
...
* Rename "corr_id" to "stack_id" in Perfetto annotations to match new naming in schema.
Signed-off-by: David Galiffi <David.Galiffi@amd.com >
* correlation_id.ancestor was not added until ROCPROFILER_VERSION 1.0
---------
Signed-off-by: David Galiffi <David.Galiffi@amd.com >
2025-11-04 14:20:11 -05:00
Joseph Macaranas
b19cf0aadf
Revert "Disable therock summary check, make it always positive ( #1675 )" ( #1686 )
...
This reverts commit 0c32b90130 .
2025-11-04 14:17:37 -05:00
Pratik Basyal
6356c179ff
[ROCm Systems Profiler] 7.1.0 Formatting updated for understanding rocpd output docs" ( #1663 )
...
* Formatting udpated for ROCm Systems rocpd docs"
* Minor change
* Formatting in profiler script fixed"
* Sphinx warnings and formatting fixes
* Formatting fixed
* Formatting fixed
* Collapsible code block added
* Dixygne change reverted
2025-11-04 12:48:02 -05:00
erieaton-amd
4aaa1a27f5
Fix rocshmem_ptr definition signature ( #306 )
...
Makes the signature of the definition match the declaration in rocshmem.hpp.
Signed-off-by: Eric Eaton <erieaton@amd.com >
[ROCm/rocshmem commit: 7b5765ec0e ]
2025-11-04 12:42:47 -05:00
erieaton-amd
7b5765ec0e
Fix rocshmem_ptr definition signature ( #306 )
...
Makes the signature of the definition match the declaration in rocshmem.hpp.
Signed-off-by: Eric Eaton <erieaton@amd.com >
2025-11-04 12:42:47 -05:00
Pratik Basyal
9d84958527
JSON profiler option removed ( #1649 )
2025-11-04 17:49:22 +01:00
Arm Patinyasakdikul
25005c1cce
proxy: handle progressOps return code properly. ( #2029 )
...
[ROCm/rccl commit: d6a53d2022 ]
2025-11-04 09:09:50 -06:00
Arm Patinyasakdikul
d6a53d2022
proxy: handle progressOps return code properly. ( #2029 )
2025-11-04 09:09:50 -06:00
Sam Ruscica
757de39caa
Updated amdFileRead/Write in rocdevice to support windows build ( #1435 )
...
* Updated amdFileRead in rocdevice to support windows build
* Updated amdFileRead in rocdevice to support windows build
2025-11-04 10:03:03 -05:00
Danylo Lytovchenko
0c32b90130
Disable therock summary check, make it always positive ( #1675 )
2025-11-04 12:58:53 +01:00
marantic-amd
f4a65875d5
Added SQLite3 as a external submodule to rocprofiler-systems ( #1348 )
2025-11-04 11:33:25 +01:00
Todd tiantuo Li
7573fa168d
SWDEV-554372 - Add 3 HIP_GET_PROC_ADDRESS_xxx flags ( #1057 )
2025-11-04 00:16:12 -08:00
Aravind Ravikumar
4babb01f4d
Add S3 upload support for Perf and test reports by run ID and architecture ( #2020 )
...
* Commits to enable scp report copy
* Added Post report upload step
* Added extra arg for fetch artifacts
* Moved to a specific commit
* Add write permissions to s3
* Added comment for TheRock sha commit date
---------
Co-authored-by: arravikum <arravikum@amd.com >
[ROCm/rccl commit: 07f8f6d6c6 ]
2025-11-03 19:09:34 -05:00
Aravind Ravikumar
07f8f6d6c6
Add S3 upload support for Perf and test reports by run ID and architecture ( #2020 )
...
* Commits to enable scp report copy
* Added Post report upload step
* Added extra arg for fetch artifacts
* Moved to a specific commit
* Add write permissions to s3
* Added comment for TheRock sha commit date
---------
Co-authored-by: arravikum <arravikum@amd.com >
2025-11-03 19:09:34 -05:00
nawrinsu
6d22ce9b1a
Fix protocol and channel override when tuner is used ( #1985 )
...
* Fix protocol and channel override when tuner is used
* Added comment
* Fix README for basic tuner implementation
[ROCm/rccl commit: 166268d715 ]
2025-11-03 13:56:34 -08:00
nawrinsu
166268d715
Fix protocol and channel override when tuner is used ( #1985 )
...
* Fix protocol and channel override when tuner is used
* Added comment
* Fix README for basic tuner implementation
2025-11-03 13:56:34 -08:00
MachineTom
fb006546d0
SWDEV-1 - Fix a typo ( #1615 )
...
* SWDEV-1 - Fix a typo
Fix a typo.
Remove unnecessary log.
* Removing patch
---------
Co-authored-by: geomin12 <geomin12@amd.com >
Co-authored-by: Scott Todd <scott.todd0@gmail.com >
2025-11-03 12:59:00 -08:00
marandje
d411d101fb
SWDEV-557363 - Fix and enable Unit_hipWallClock64_Positive_Basic ( #1223 )
2025-11-03 21:08:33 +01:00
marandje
1ccc0ce2e4
SWDEV-555299 - Fix and enable Unit_Device_modf_modff_Negative_RTC ( #1012 )
2025-11-03 21:08:09 +01:00
Ammar ELWazir
da297d46e8
[ROCProfiler-sdk] [Docs CI] Refactor Git setup and CMake commands in workflow ( #1662 )
2025-11-03 12:12:35 -06:00
German Andryeyev
ee1158b7b8
rocr: Fix Windows build and Ctz implementation ( #1634 )
2025-11-03 12:07:11 -05:00
Ajay GunaShekar
d998a5280a
Revert "clr: SWDEV-547890 - Maintain an MQD for the emulated AQL queue ( #1316 )" ( #1654 )
...
This reverts commit f5bbb09c0d .
windows build failure and requires PAL update
2025-11-03 08:17:26 -08:00
Ammar ELWazir
9fa1d1b97e
[ROCProfiler-SDK] Remove 'gfx900' and 'gfx940' from GPU targets ( #1661 )
...
* [ROCProfiler-SDK] Remove 'gfx900' and 'gfx940' from GPU targets
* Remove unsupported GPU targets from workflow
* Remove gfx900 and gfx940 from GPU targets
2025-11-03 11:09:29 -05:00
Kian Cossettini
2a080641a1
[rocprofiler-systems] Consolidate CTests to tests/ folder ( #1461 )
...
* Consolidate CTests to tests/ folder
* Remove comment
* Consolidate CTests to tests/ folder
* Remove comment
* Separate source code and test code for thread-limit into appropriate folders
* Remove sleeper.cpp and instead use linux sleep cmd
* Merge python-console tests into python-tests
2025-11-03 11:03:35 -05:00
sluzynsk-amd
9f940c7265
Add missing API calls to rocprofiler ( #1599 )
...
Signed-off-by: Sebastian Luzynski <Sebastian.Luzynski@amd.com >
2025-11-03 09:40:16 -06:00
Ammar ELWazir
fee5bd9a4e
Fixing ROCProfiler Register CI & ROCProfiler-SDK Docs CI ( #1570 )
...
---------
Co-authored-by: bgopesh <gopesh.bhardwaj@amd.com >
2025-11-03 09:24:32 -06:00
systems-assistant[bot]
740b27528f
kfdtest: Enable GPU selection via CLI for multi-GPU tests ( #245 )
...
* kfdtest: Enable GPU selection via CLI for multi-GPU tests
Replaced environment variable-based GPU selection with
GPU selection via command-line parameter --concurrentnodes (-c)
Modified g_TestGPUsNum to be passed in via command-line
parameter --testnodenum (t)
Signed-off-by: Alysa Liu <Alysa.Liu@amd.com >
* kfdtest: Enable GPU selection via CLI for multi-GPU tests
Replaced environment variable-based GPU selection with
GPU selection via command-line parameter --concurrentnodes (-c)
Modified g_TestGPUsNum to be passed in via command-line
parameter --testnodenum (t)
---------
Signed-off-by: Alysa Liu <Alysa.Liu@amd.com >
Co-authored-by: Alysa Liu <Alysa.Liu@amd.com >
2025-11-03 09:27:38 -05:00