Sourav Chakraborty
a3a5631f53
Fix incorrect benchmark name in JitterBench script ( #1983 )
...
[ROCm/rccl commit: 57286d5df3 ]
2025-10-21 12:52:20 -05:00
Sourav Chakraborty
57286d5df3
Fix incorrect benchmark name in JitterBench script ( #1983 )
2025-10-21 12:52:20 -05:00
Sourav Chakraborty
046af13751
Fix build failure in rccl_prim_test ( #1984 )
...
Added missing header in rccl_prim_test
[ROCm/rccl commit: 5b345d105c ]
2025-10-21 12:51:14 -05:00
Sourav Chakraborty
5b345d105c
Fix build failure in rccl_prim_test ( #1984 )
...
Added missing header in rccl_prim_test
2025-10-21 12:51:14 -05:00
systems-assistant[bot]
bebe65f104
rocr: fix nullptr dereference ( #262 )
...
* rocr: fix nullptr dereference
Return early in the case that malloc fails to avoid dereferencing of a
null pointer on eventDescrp.
Signed-off-by: Sunday Clement <Sunday.Clement@amd.com >
* rocr: Fix potential nullptr dereference
returns early if sym->section() fails to properly acquire the object.
Signed-off-by: Sunday Clement <Sunday.Clement@amd.com >
---------
Signed-off-by: Sunday Clement <Sunday.Clement@amd.com >
Co-authored-by: Sunday Clement <Sunday.Clement@amd.com >
2025-10-21 13:49:01 -04:00
Jeremy Newton
7f6938c7d1
Drop libva-amdgpu use for RHEL8 ( #660 )
...
After further inspection, it looks like libva is new enough for decode
support, so we've dropped the requirement on the mesa end. Some encode
features are missing on < 2.16, but as per our current RHEL8 support
policy, these features are no longer supported.
Ubuntu 22.04 is still supported to have a full encode feature set, so we
can't drop the libva requirement quite yet.
SWDEV-548213
Signed-off-by: Jeremy Newton <Jeremy.Newton@amd.com >
Co-authored-by: Kiriti Gowda <kiritigowda@gmail.com >
[ROCm/rocdecode commit: a721963b3c ]
2025-10-21 10:19:51 -07:00
Jeremy Newton
a721963b3c
Drop libva-amdgpu use for RHEL8 ( #660 )
...
After further inspection, it looks like libva is new enough for decode
support, so we've dropped the requirement on the mesa end. Some encode
features are missing on < 2.16, but as per our current RHEL8 support
policy, these features are no longer supported.
Ubuntu 22.04 is still supported to have a full encode feature set, so we
can't drop the libva requirement quite yet.
SWDEV-548213
Signed-off-by: Jeremy Newton <Jeremy.Newton@amd.com >
Co-authored-by: Kiriti Gowda <kiritigowda@gmail.com >
2025-10-21 10:19:51 -07:00
Jonathan R. Madsen
55ca813ded
roctx annotation in mandelbrot benchmark ( #1084 )
2025-10-21 11:21:28 -05:00
Jonathan R. Madsen
6b0f87c3a6
[rocprofiler-sdk][rocpd] Optional message field in region/sample extdata ( #1170 )
2025-10-21 11:20:46 -05:00
David Yat Sin
e2f3bd2429
Changes for RDMA with VMM ( #801 )
...
* rocr: Add support for VMM and RDMA
Add extra CPU mapping so that kernel-mode drivers can look up the memory
mapping by virtual address.
* Update projects/rocr-runtime/runtime/hsa-runtime/core/runtime/runtime.cpp
Co-authored-by: Yiannis Papadopoulos <102817138+ypapadop-amd@users.noreply.github.com >
* Update projects/rocr-runtime/runtime/hsa-runtime/core/inc/runtime.h
Co-authored-by: Yiannis Papadopoulos <102817138+ypapadop-amd@users.noreply.github.com >
* rocr: Honor uncache flag in memory_lock_to_pool()
Also, combined several flag options used in apis into a
single integer.
Signed-off-by: Chris Freehill <cfreehil@amd.com >
* rocr: Fix hsa_amd_pointer_info on CPU agents
Fix hsa_amd_pointer_info query returning allowd on VMM pointers for CPU
agents when CPU mapping was mapped with PROT_NONE.
---------
Signed-off-by: Chris Freehill <cfreehil@amd.com >
Co-authored-by: Yiannis Papadopoulos <102817138+ypapadop-amd@users.noreply.github.com >
Co-authored-by: Chris Freehill <cfreehil@amd.com >
Co-authored-by: cfreeamd <166262151+cfreeamd@users.noreply.github.com >
2025-10-21 12:19:02 -04:00
Aryan Salmanpour
24211717d5
Updated libdrm path configuration and libva version requirements for ROCm and TheRock platforms ( #666 )
...
[ROCm/rocdecode commit: c743ca38f6 ]
2025-10-21 08:57:56 -07:00
Aryan Salmanpour
c743ca38f6
Updated libdrm path configuration and libva version requirements for ROCm and TheRock platforms ( #666 )
2025-10-21 08:57:56 -07:00
Aryan Salmanpour
ce2ffe3307
Updated libdrm path configuration and libva version requirements for ROCm and TheRock ( #195 )
...
[ROCm/rocjpeg commit: 261c0fc171 ]
2025-10-21 08:57:22 -07:00
Aryan Salmanpour
261c0fc171
Updated libdrm path configuration and libva version requirements for ROCm and TheRock ( #195 )
2025-10-21 08:57:22 -07:00
Mythreya Kuricheti
65d4ff9d04
[CI][rocprofiler-compute] Fix rhel python deps ( #1370 )
...
Install `python39-devel` dependency for pandas. Fixes build on RHEL 8.10.
2025-10-21 08:28:57 -07:00
Sourabh U Betigeri
7926bd593b
SWDEV-545245 - Adds new test for hipDeviceGetTExture1DLinearMaxWidth ( #763 )
2025-10-21 08:22:17 -07:00
Scott Todd
473025ab7d
Revert "Update rocm-core package to include rdhc script ( #1375 )" ( #1464 )
...
This reverts commit d348797017 .
2025-10-21 07:29:27 -07:00
Giovanni Lenzi Baraldi
dbb48c3e33
Fix for dynamic code object loading in the thread trace sample ( #1386 )
...
* Fix for dynamic code object loading in the thread trace sample
* Review comments
2025-10-21 16:22:26 +02:00
cfallows-amd
c215ace6c3
Update Roofline binaries with improved flops benchmarking ( #1402 )
...
* Update roof bins- rebuild from rocm-amdgpu-bench as of oct15/25
Signed-off-by: Carrie Fallows <Carrie.Fallows@amd.com >
* Update CHANGELOG.md
---------
Signed-off-by: Carrie Fallows <Carrie.Fallows@amd.com >
Co-authored-by: Pratik Basyal <pratik.basyal@amd.com >
2025-10-21 10:14:43 -04:00
jamessiddeley-amd
a14f0c30ab
fix output ( #1462 )
2025-10-21 09:55:18 -04:00
Fei Zheng
2c59a82fe1
Fix rocprof-compute TUI build err with python 39 ( #303 )
...
* Upgrade min python version from 3.8 to 3.9
* Set min version for textual-fspicker for TUI support
* Update workflows to use python 3.9 instead of 3.8
* fix formatting
* fix bug
---------
Co-authored-by: Vignesh Edithal <Vignesh.Edithal@amd.com >
2025-10-21 00:27:35 -04:00
Flora Cui
a433e6f48d
rocminfo: early detect absence of unsupported GPU in wsl ( #1317 )
...
Add an early check in WSL to detect when no supported GPU is available.
otherwise rocm_agent_enumerator will keep retrying rocminfo for one
minute, causing unnecessary delay.
Signed-off-by: Flora Cui <flora.cui@amd.com >
Co-authored-by: harkgill-amd <harkgill@amd.com >
2025-10-21 11:15:44 +08:00
Jonathan R. Madsen
4cca398b56
[rocprofiler-sdk] Update rocprofiler-sdk CONTRIBUTING.md ( #1371 )
2025-10-20 21:46:24 -05:00
David Galiffi
32f9fa6ca5
Enable some simple ROCpd testing ( #834 )
...
* Add for rocpd testing and output validation
Add for transpose, video-decode, jpeg-decode, roctx, and openmp-target
Add JSON check to pre-commit-config
Co-authored-by: Marjan Antic <Marjan.Antic@amd.com >
* Remove redundant environment variable
* Fix spelling typo
* Fix typo in error message
* Fix memory_allocation query
* Incorperate feedback from review. Handle case where there are multiple matching "name_prefix" tables.
* Fix environment settings in `rocprof-sys-testing.cmake`
Accidently removed in previous refactoring.
* Formatting python file
---------
Co-authored-by: Marjan Antic <Marjan.Antic@amd.com >
2025-10-20 17:40:10 -04:00
Ben Richard
35b07e041f
[rocprof-compute] Run roofline test on GPU 0 by default ( #1390 )
...
* rocprof-compute: Default roofline to GPU 0
Previously was running the roofline test on ALL GPUs but only
selecting the first entry in the roofline.csv. So even in default
ALL case, GPU 0 was selected.
* Update CHANGELOG.MD
* Use better wording in changelog entry
2025-10-20 16:36:55 -04:00
mberenjk
96c62b091d
Add support for additional paths in RCCL DMABUF kernel configuration loading ( #1825 )
...
* Adding more path to the kernel load and an environment variable to force enable DMABUF
---------
Co-authored-by: Marzieh Berenjkoub <mberenjk@amd.com >
[ROCm/rccl commit: b58f234539 ]
2025-10-20 13:35:22 -07:00
mberenjk
b58f234539
Add support for additional paths in RCCL DMABUF kernel configuration loading ( #1825 )
...
* Adding more path to the kernel load and an environment variable to force enable DMABUF
---------
Co-authored-by: Marzieh Berenjkoub <mberenjk@amd.com >
2025-10-20 13:35:22 -07:00
Milan Radosavljevic
48fdcebf62
Add caching of category region for rocpd ( #1420 )
...
* Add caching of category region
Fix vaapi traces
Remove region_with_name
* Applied suggestions from code review
2025-10-20 16:05:14 -04:00
Poag, Charis
01b4fe6614
[SWDEV-535159] Add support for GPU partition metrics ( #490 )
...
[SWDEV-535159] Add support for GPU partition metrics
Changes include:
- Internal logic to smart-switch between gpu_metrics/xcp_metrics files
- [WIP] Initial plumbing for new partition metric API
Change-Id: I4340fb1b48bac0117d80d5d486b9e871430d5cd8
Signed-off-by: Charis Poag <Charis.Poag@amd.com >
Add amdsmi_get_gpu_partition_metrics_info() + minor cleanup
Change-Id: I5d60604f18baddbd03852dc90e88aa0b8107d50e
Signed-off-by: Charis Poag <Charis.Poag@amd.com >
Fix partition metric logic + update logging/tests
Change-Id: I9e89b19ead17694c54e224f8e13ff8ee3eb2e22a
Signed-off-by: Charis Poag <Charis.Poag@amd.com >
Adjust amd-smi metric/monitor/default to show (some) partition information
Change-Id: I2e8d2745876a19bdaec3c039daa97345c9f701b5
Signed-off-by: Charis Poag <Charis.Poag@amd.com >
Add C++ tests
Change-Id: Ib9eb0b57a6d7a280992e05a4c6eba632826952ef
Signed-off-by: Charis Poag <Charis.Poag@amd.com >
Remove modification of energy counter, not needed
Change-Id: I5c48eaaae248ee6dc79abba609d837ec35d78022
Signed-off-by: Charis Poag <Charis.Poag@amd.com >
[CLI] amd-smi metric: cleaned up N/A'd multi-valued to show just N/A
Changes:
1. amd-smi metric: cleaned up N/A'd multi-valued to show just N/A
ex.
JPEG_ACTIVITY: [N/A, N/A, N/A, N/A, N/A, N/A, N/A, N/A, N/A, N/A, N/A, N/A, N/A, N/A, N/A, N/A, N/A, N/A, N/A, N/A, N/A, N/A, N/A, N/A, N/A, N/A, N/A, N/A, N/A, N/A, N/A, N/A]
Now just shows: N/A
2. [Python Unit Test] Changed testname TestAmdSmiPythonBDF(unittest.TestCase) ->
AmdSmiPythonUnitTest
Test name was confusing.
Change-Id: Ieb3b036f30002fd22362508eb9fc5d443df395ae
Signed-off-by: Charis Poag <Charis.Poag@amd.com >
Log cleanup
Change-Id: I1b1a95f1844d35bec7a7bd8cb996f87e4914c069
Signed-off-by: Charis Poag <Charis.Poag@amd.com >
Add amd-smi partition-metrics CLI + general cleanup
Change-Id: Ia91488e6cb3a4d62b4087afbddfe0b3bb9378fdc
Signed-off-by: Charis Poag <Charis.Poag@amd.com >
[1.3 metrics] Remove forwards compatibility for partition metrics
Change-Id: Iab928983e6f6f1587bc9307f6f3fa2b2696ca6f7
Signed-off-by: Charis Poag <Charis.Poag@amd.com >
Fixed violation output not showing % + general cleanup
Change-Id: Icac1b0a55b18c7628b07109ae0c377d17e0825f1
Signed-off-by: Charis Poag <Charis.Poag@amd.com >
Clean up amdsmi_get_gpu_partition_metrics_info & amd-smi partition-metric outputs
Change-Id: I6427028b980874641e9ffb3b5d88ad493dbf9cf4
Signed-off-by: Charis Poag <Charis.Poag@amd.com >
* Fix metrics not found + extra logging/formatting
Change-Id: I841a27bb2c305e97ec7579a13ac915e5be497c3a
Signed-off-by: Charis Poag <Charis.Poag@amd.com >
* Update license to current default
Change-Id: I0de9b8a2d5dbbeab4491097f0354ba17b0d30866
Signed-off-by: Charis Poag <Charis.Poag@amd.com >
* Cleanup for review
Change-Id: I96ed25c3f2b8968eea1af24c5e5860c2b4e74e6e
Signed-off-by: Charis Poag <Charis.Poag@amd.com >
* Moderize updated/new interal APIs.
Change-Id: I3c48a250eeb703709b14cb5ffa68268d8321626c
Signed-off-by: Charis Poag <Charis.Poag@amd.com >
* Remove extra logging in dynamic metrics
Change-Id: Idb97547bcbe143d6fa1cb5cb278ffe4da615ce14
Signed-off-by: Charis Poag <Charis.Poag@amd.com >
* Remove amd-smi partition-metric command
Change-Id: Ib83c17e5cd7e0da3798198943bddd46c296b411c
Signed-off-by: Charis Poag <Charis.Poag@amd.com >
* Move new CLI updates to another PR + minor fixes
Change-Id: I3b1163eec12f9b5f7d95ee33de08e168cec1b1fe
Signed-off-by: Charis Poag <Charis.Poag@amd.com >
* Allow dynamic metrics to work for gpu/xcp metrics 1.9+/1.1+
Updated some logging as well.
Change-Id: I2ed9f5a5ef8afb1520508820ca6153525f0644b4
Signed-off-by: Charis Poag <Charis.Poag@amd.com >
* Allow dyn gpu/xcp metric v1.9+/v1.1+
Added tests for quick check
Change-Id: I576d6f6582a55afb08e5ac57791ce95e2fa184a2
Signed-off-by: Charis Poag <Charis.Poag@amd.com >
* Update tests for larger subset of version checks
Change-Id: I3cdf4f8bb4fc6161f4c76566939f90545d0f362a
Signed-off-by: Charis Poag <Charis.Poag@amd.com >
* Fix XCP metrics in gpu/partition metric pre-v1.9/v1.1 (dynamic)
Change-Id: I4dabc1ed6bef6b86c8e7f92bf9cb5992f3966fe2
Signed-off-by: Charis Poag <Charis.Poag@amd.com >
---------
Signed-off-by: Charis Poag <Charis.Poag@amd.com >
2025-10-20 14:43:40 -05:00
Poag, Charis
ce19b921b0
[SWDEV-535159] Add support for GPU partition metrics ( #490 )
...
[SWDEV-535159] Add support for GPU partition metrics
Changes include:
- Internal logic to smart-switch between gpu_metrics/xcp_metrics files
- [WIP] Initial plumbing for new partition metric API
Change-Id: I4340fb1b48bac0117d80d5d486b9e871430d5cd8
Signed-off-by: Charis Poag <Charis.Poag@amd.com >
Add amdsmi_get_gpu_partition_metrics_info() + minor cleanup
Change-Id: I5d60604f18baddbd03852dc90e88aa0b8107d50e
Signed-off-by: Charis Poag <Charis.Poag@amd.com >
Fix partition metric logic + update logging/tests
Change-Id: I9e89b19ead17694c54e224f8e13ff8ee3eb2e22a
Signed-off-by: Charis Poag <Charis.Poag@amd.com >
Adjust amd-smi metric/monitor/default to show (some) partition information
Change-Id: I2e8d2745876a19bdaec3c039daa97345c9f701b5
Signed-off-by: Charis Poag <Charis.Poag@amd.com >
Add C++ tests
Change-Id: Ib9eb0b57a6d7a280992e05a4c6eba632826952ef
Signed-off-by: Charis Poag <Charis.Poag@amd.com >
Remove modification of energy counter, not needed
Change-Id: I5c48eaaae248ee6dc79abba609d837ec35d78022
Signed-off-by: Charis Poag <Charis.Poag@amd.com >
[CLI] amd-smi metric: cleaned up N/A'd multi-valued to show just N/A
Changes:
1. amd-smi metric: cleaned up N/A'd multi-valued to show just N/A
ex.
JPEG_ACTIVITY: [N/A, N/A, N/A, N/A, N/A, N/A, N/A, N/A, N/A, N/A, N/A, N/A, N/A, N/A, N/A, N/A, N/A, N/A, N/A, N/A, N/A, N/A, N/A, N/A, N/A, N/A, N/A, N/A, N/A, N/A, N/A, N/A]
Now just shows: N/A
2. [Python Unit Test] Changed testname TestAmdSmiPythonBDF(unittest.TestCase) ->
AmdSmiPythonUnitTest
Test name was confusing.
Change-Id: Ieb3b036f30002fd22362508eb9fc5d443df395ae
Signed-off-by: Charis Poag <Charis.Poag@amd.com >
Log cleanup
Change-Id: I1b1a95f1844d35bec7a7bd8cb996f87e4914c069
Signed-off-by: Charis Poag <Charis.Poag@amd.com >
Add amd-smi partition-metrics CLI + general cleanup
Change-Id: Ia91488e6cb3a4d62b4087afbddfe0b3bb9378fdc
Signed-off-by: Charis Poag <Charis.Poag@amd.com >
[1.3 metrics] Remove forwards compatibility for partition metrics
Change-Id: Iab928983e6f6f1587bc9307f6f3fa2b2696ca6f7
Signed-off-by: Charis Poag <Charis.Poag@amd.com >
Fixed violation output not showing % + general cleanup
Change-Id: Icac1b0a55b18c7628b07109ae0c377d17e0825f1
Signed-off-by: Charis Poag <Charis.Poag@amd.com >
Clean up amdsmi_get_gpu_partition_metrics_info & amd-smi partition-metric outputs
Change-Id: I6427028b980874641e9ffb3b5d88ad493dbf9cf4
Signed-off-by: Charis Poag <Charis.Poag@amd.com >
* Fix metrics not found + extra logging/formatting
Change-Id: I841a27bb2c305e97ec7579a13ac915e5be497c3a
Signed-off-by: Charis Poag <Charis.Poag@amd.com >
* Update license to current default
Change-Id: I0de9b8a2d5dbbeab4491097f0354ba17b0d30866
Signed-off-by: Charis Poag <Charis.Poag@amd.com >
* Cleanup for review
Change-Id: I96ed25c3f2b8968eea1af24c5e5860c2b4e74e6e
Signed-off-by: Charis Poag <Charis.Poag@amd.com >
* Moderize updated/new interal APIs.
Change-Id: I3c48a250eeb703709b14cb5ffa68268d8321626c
Signed-off-by: Charis Poag <Charis.Poag@amd.com >
* Remove extra logging in dynamic metrics
Change-Id: Idb97547bcbe143d6fa1cb5cb278ffe4da615ce14
Signed-off-by: Charis Poag <Charis.Poag@amd.com >
* Remove amd-smi partition-metric command
Change-Id: Ib83c17e5cd7e0da3798198943bddd46c296b411c
Signed-off-by: Charis Poag <Charis.Poag@amd.com >
* Move new CLI updates to another PR + minor fixes
Change-Id: I3b1163eec12f9b5f7d95ee33de08e168cec1b1fe
Signed-off-by: Charis Poag <Charis.Poag@amd.com >
* Allow dynamic metrics to work for gpu/xcp metrics 1.9+/1.1+
Updated some logging as well.
Change-Id: I2ed9f5a5ef8afb1520508820ca6153525f0644b4
Signed-off-by: Charis Poag <Charis.Poag@amd.com >
* Allow dyn gpu/xcp metric v1.9+/v1.1+
Added tests for quick check
Change-Id: I576d6f6582a55afb08e5ac57791ce95e2fa184a2
Signed-off-by: Charis Poag <Charis.Poag@amd.com >
* Update tests for larger subset of version checks
Change-Id: I3cdf4f8bb4fc6161f4c76566939f90545d0f362a
Signed-off-by: Charis Poag <Charis.Poag@amd.com >
* Fix XCP metrics in gpu/partition metric pre-v1.9/v1.1 (dynamic)
Change-Id: I4dabc1ed6bef6b86c8e7f92bf9cb5992f3966fe2
Signed-off-by: Charis Poag <Charis.Poag@amd.com >
---------
Signed-off-by: Charis Poag <Charis.Poag@amd.com >
[ROCm/amdsmi commit: 01b4fe6614 ]
2025-10-20 14:43:40 -05:00
Ajay GunaShekar
4a1a4aa472
SWDEV-560725 - remove final from amd_warp_functions ( #1395 )
...
final word used as a literal over a keyword.
2025-10-20 12:27:59 -07:00
Yiltan
dd92cb2af8
Provide an error when there are no NICs on a system ( #286 )
...
[ROCm/rocshmem commit: b534423de7 ]
2025-10-20 13:07:56 -04:00
Yiltan
b534423de7
Provide an error when there are no NICs on a system ( #286 )
2025-10-20 13:07:56 -04:00
Aurelien Bouteiller
349d7f6ad3
Print an error and quit cleanly if GDA required but could not init ( #284 )
...
[ROCm/rocshmem commit: c44f4ece1f ]
2025-10-20 13:04:13 -04:00
Aurelien Bouteiller
c44f4ece1f
Print an error and quit cleanly if GDA required but could not init ( #284 )
2025-10-20 13:04:13 -04:00
Pengda Xie
29c2ca8cbc
SWDEV-559867 - Fix CU mask printing ( #1328 )
2025-10-20 09:47:17 -07:00
Mythreya Kuricheti
ef1ed44e93
[rocprofiler-sdk] Update codeowner for api-trace.h ( #1974 )
...
Feedback from #1933
[ROCm/rccl commit: 9ae5956ca5 ]
2025-10-20 10:43:42 -06:00
Mythreya Kuricheti
9ae5956ca5
[rocprofiler-sdk] Update codeowner for api-trace.h ( #1974 )
...
Feedback from #1933
2025-10-20 10:43:42 -06:00
Ioannis Assiouras
30a14a8a05
SWDEV-559166 - Fix potential data race in ReferenceCountedObject::release() ( #1388 )
...
Use fetch_sub(std::memory_order_acq_rel) on release
so the destroying thread acquires prior writes.
2025-10-20 17:15:56 +01:00
systems-assistant[bot]
ad3cb435ee
SWDEV-507874 - Psdb perf: reduce test iterations ( #618 )
...
Change-Id: Ia9af40ecfb2ea121388a3c7a932805b49cef7669
Co-authored-by: agunashe <ajay.gunashekar@amd.com >
2025-10-20 09:00:05 -07:00
Nilesh M Negi
ab1bd9d87f
[FORMAT] Add .clang-format for C++ code ( #1404 )
...
[ROCm/rccl commit: 34d469864b ]
2025-10-20 10:54:03 -05:00
Nilesh M Negi
34d469864b
[FORMAT] Add .clang-format for C++ code ( #1404 )
2025-10-20 10:54:03 -05:00
JC
08d93e763e
[CI] Enable ccache w/ namespace for external use ( #1966 )
...
* Enable ccache w/ namespace for external use
* Remove TheRock from setup_tools.py command line
* Bump TheRock commit to use health_status.py
Resolves https://github.com/ROCm/rccl/pull/1966/files/f9d6d76440b88ecf67d08765ee0e9bac00b55b40#r2420734710
* Bump TheRock to older commit with health_status.py
* Add git safe directory for working directory
* Move install python deps
* Remove pip freeze
[ROCm/rccl commit: b1589a5786 ]
2025-10-20 08:44:42 -07:00
JC
b1589a5786
[CI] Enable ccache w/ namespace for external use ( #1966 )
...
* Enable ccache w/ namespace for external use
* Remove TheRock from setup_tools.py command line
* Bump TheRock commit to use health_status.py
Resolves https://github.com/ROCm/rccl/pull/1966/files/c6d2e8ce5c14a2c94bfb47e21d3e2d466f25c9b4#r2420734710
* Bump TheRock to older commit with health_status.py
* Add git safe directory for working directory
* Move install python deps
* Remove pip freeze
2025-10-20 08:44:42 -07:00
Yiltan
92a7904656
Implement rocshmem_pe_quiet() ( #282 )
...
Co-authored-by: Aurelien Bouteiller <aurelien.bouteiller@amd.com >
[ROCm/rocshmem commit: c3eeae473b ]
2025-10-20 11:42:39 -04:00
Yiltan
c3eeae473b
Implement rocshmem_pe_quiet() ( #282 )
...
Co-authored-by: Aurelien Bouteiller <aurelien.bouteiller@amd.com >
2025-10-20 11:42:39 -04:00
Julia Jiang
ee4021d6c5
SWDEV-556212 - Update changelog for HIP 7.1 in develop ( #1326 )
...
* SWDEV-556212 - Update changelog for HIP 7.1 in develop
* Update CHANGELOG.md
* Update CHANGELOG.md
2025-10-20 11:41:58 -04:00
Edgar Gabriel
6bc1cc63ae
update tester for RO ( #281 )
...
update the tester script to only tests the amo functions on RO that are
expected to pass. We can revisit the non-passing tests later, but this
prevents us from having passing CIs at the moment, while RO is simply
lower priority than other asks.
[ROCm/rocshmem commit: 6f74cdfd75 ]
2025-10-20 09:03:17 -05:00
Edgar Gabriel
6f74cdfd75
update tester for RO ( #281 )
...
update the tester script to only tests the amo functions on RO that are
expected to pass. We can revisit the non-passing tests later, but this
prevents us from having passing CIs at the moment, while RO is simply
lower priority than other asks.
2025-10-20 09:03:17 -05:00
Danylo Lytovchenko
61fc256db9
Fix memsetD2XX tests ( #1405 )
...
* Fix memsetD2XX tests
* Remove redundant interpret_cast
2025-10-20 11:55:17 +02:00