Pryor, Adam
ec661d5d17
[SWDEV-243250] RDC Process Start/Stop integration ( #189 )
...
Change-Id: I3d2be33b5d23cd259b3d06fb572f81d19e6c3798
Signed-off-by: adapryor <Adam.pryor@amd.com >
[ROCm/rdc commit: 0e9c3b2c4f ]
2025-06-02 14:42:21 -05:00
dependabot[bot]
30397e77f3
Bump rocm-docs-core[api-reference] from 1.18.1 to 1.20.0 in /docs/sphinx
...
Bumps [rocm-docs-core[api-reference]](https://github.com/ROCm/rocm-docs-core ) from 1.18.1 to 1.20.0.
- [Release notes](https://github.com/ROCm/rocm-docs-core/releases )
- [Changelog](https://github.com/ROCm/rocm-docs-core/blob/develop/CHANGELOG.md )
- [Commits](https://github.com/ROCm/rocm-docs-core/compare/v1.18.1...v1.20.0 )
---
updated-dependencies:
- dependency-name: rocm-docs-core[api-reference]
dependency-version: 1.20.0
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com >
[ROCm/rdc commit: ae6b1aa6e6 ]
2025-06-02 13:33:21 -05:00
alexxu-amd
a0ead071a3
Fix typo in rdc.h
...
There's a typo in rdc.h causing documentation build failure.
Change-Id: I3a7ced030e66b980645f719b41c77f79810de09d
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com >
[ROCm/rdc commit: efa66d688e ]
2025-05-27 16:15:58 -05:00
Galantsev, Dmitrii
ff8704cf76
RDCI - Fix misaligned fields
...
Change-Id: I7914c01b82e7e2fb5c63521d6d4803570447790c
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com >
[ROCm/rdc commit: 7b06b778b9 ]
2025-05-21 19:11:17 -05:00
Galantsev, Dmitrii
0d352c515e
Profiler - Align SMI and Profiler indices
...
Change-Id: If2bb850ffd1c1b8b16a8f5963a0f6971f82d4863
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com >
[ROCm/rdc commit: eff955fdf7 ]
2025-05-21 19:11:17 -05:00
srawat
71c654b0ee
Update install.rst
...
[ROCm/rdc commit: 3357346df7 ]
2025-05-14 11:44:05 -05:00
Galantsev, Dmitrii
f4e611193b
CI - Fix builds
...
Change-Id: I0d268ed2aee5c595f2a23e779000122e57165f9d
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com >
[ROCm/rdc commit: e15fdf1fbc ]
2025-05-13 17:59:18 -05:00
Hila, Nino
13c6ea75a7
Add palamida.yml
...
[ROCm/rdc commit: 8c536c9c8d ]
2025-05-12 21:43:56 -07:00
adapryor
0702a6a5a2
Profiler - Fix SIMD Utilization
...
Change-Id: I6775cce9901a714d20e80c8c17e7a563edeb48a4
[ROCm/rdc commit: 33924ea79e ]
2025-05-07 00:56:52 -05:00
Galantsev, Dmitrii
1e8bc4dc96
CMAKE - Format with cmake-format
...
Change-Id: I08e71fc5060b1f6e0168225cc5fe66886c2044bd
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com >
[ROCm/rdc commit: fa8b89f4ae ]
2025-05-06 17:28:14 -05:00
Galantsev, Dmitrii
a4e9002fc1
CMAKE - Add cmake-format
...
Change-Id: I4036859491934ed26303530d0dc1afb4f1b0d0cd
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com >
[ROCm/rdc commit: f89beb90f5 ]
2025-05-06 17:28:14 -05:00
Galantsev, Dmitrii
b6488d150d
Profiler - Add SIMD_UTILIZATION ( #171 )
...
Change-Id: I19d5acd80dbed8c4fc4e1c85eec71ca89398d299
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com >
[ROCm/rdc commit: 02c0786a2c ]
2025-05-06 13:20:03 -07:00
Rawat, Swati
0519d1bee7
RDC Doc formatting ( #166 )
...
* doc formatting
* Update job_stats_sample.rst
* Doc formatting
---------
Co-authored-by: srawat <120587655+SwRaw@users.noreply.github.com >
[ROCm/rdc commit: 3e653b7ab3 ]
2025-05-05 13:08:33 -05:00
Rawat, Swati
f9ceb0e6b9
fix broken link ( #169 )
...
Update job_stats_sample.rst
Co-authored-by: srawat <120587655+SwRaw@users.noreply.github.com >
[ROCm/rdc commit: 4a230f0180 ]
2025-05-01 10:49:31 -05:00
Pryor, Adam
2cb7903b06
[SWDEV-523349/SWDEV-527257] Fix Rdci Config ( #161 )
...
Change-Id: Iae21ea8061205f186086a3ed59c6259ddeb1dbe7
Signed-off-by: adapryor <Adam.pryor@amd.com >
[ROCm/rdc commit: 2db6ddea69 ]
2025-04-28 11:57:51 -05:00
Peter Park
7bbdffc323
remove comments in Doxyfile referring to wikipedia.org for IAS check
...
[ROCm/rdc commit: 9edacdeac4 ]
2025-04-23 17:25:21 -05:00
Peter Park
138ef967e3
Update doxyfile
...
[ROCm/rdc commit: 0484dbed94 ]
2025-04-23 17:25:21 -05:00
Peter Park
9e3ec5e48a
bump rocm-docs-core to 1.18.1
...
[ROCm/rdc commit: 712657b24e ]
2025-04-23 17:25:21 -05:00
Peter Park
6c31e1bb4a
update sphinx/conf.py
...
[ROCm/rdc commit: c3aafd846d ]
2025-04-23 17:25:21 -05:00
Peter Park
e3740bfc8e
bump rocm-docs-core to 1.17.1
...
[ROCm/rdc commit: aac1de1e76 ]
2025-04-23 17:25:21 -05:00
Hila, Nino
8095ce6cee
Add palamida.yml
...
[ROCm/rdc commit: 6a1c7d8e43 ]
2025-04-22 11:06:25 -05:00
Bill(Shuzhou) Liu
2268451188
Add license file
...
Add license files which are missing.
[ROCm/rdc commit: 855d185532 ]
2025-04-16 11:06:31 -04:00
Galantsev, Dmitrii
d8db0889d0
CI - Add cherrypick labels automatically
...
Change-Id: Icbd0c70c9cbee2b119e7e74d6cdfe83e93a83df9
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com >
[ROCm/rdc commit: 5efdcc23fc ]
2025-04-15 18:44:37 -05:00
Galantsev, Dmitrii
375ab5eace
Add RDC_FI_GPU_BUSY_PERCENT
...
AMDSMI needs to merge first and bump the version to at least 24.4.2
Change-Id: I30149bb78c79ebc3de0dabdc8e63fcef12b2f406
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com >
[ROCm/rdc commit: a5cb334f8b ]
2025-04-15 17:00:56 -05:00
Galantsev, Dmitrii
e15c5a15fa
CMAKE - Bump version to 1.1.0
...
Change-Id: I0fbc0f6d842c034ad858f30fa6418afd01e11a4f
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com >
[ROCm/rdc commit: ac50573e67 ]
2025-04-11 17:27:27 -05:00
Galantsev, Dmitrii
0a05e0db08
Profiler - Remove buffer to fix memory leaks
...
Change-Id: Ia3717ccfc147221557f5469965c2abb76b3f451c
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com >
[ROCm/rdc commit: dfae9cd37f ]
2025-04-11 17:27:27 -05:00
Pryor, Adam
9d25978a3f
[SWDEV-515192] Fix rdc topo ( #146 )
...
Change-Id: I64a8077a56e2eaf99735fafb1010d869a1fdb0c3
Signed-off-by: adapryor <Adam.pryor@amd.com >
[ROCm/rdc commit: 58811fecbb ]
2025-04-10 17:46:08 -05:00
Galantsev, Dmitrii
d87fe5bada
Profiler - Fix eval fields
...
The 'value' pointer was being written to a lot and then used for reading
within the same function. This likely caused issues all over RDC when
reading the metrics.
This commit changes it so *value is written to only once.
Change-Id: I83c158c1e46c6ce46ff87d8a2e769f26ffa8c0da
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com >
[ROCm/rdc commit: 91be467cad ]
2025-04-09 20:06:21 -05:00
Galantsev, Dmitrii
5276903800
Revert "Implement CPU discovery support"
...
This reverts commit f967f8a17d15e148464393fcd145af01dc0e1525.
[ROCm/rdc commit: 24024f0e4f ]
2025-04-07 20:45:19 -05:00
Galantsev, Dmitrii
8afcedfc96
Revert "Fix breaking changes introduced with CPU support"
...
This reverts commit e9ac9e4626e3e45ebdfafb39e251d073091429f1.
[ROCm/rdc commit: c96f5db52c ]
2025-04-07 20:45:19 -05:00
Galantsev, Dmitrii
3e8f56c430
Fix breaking changes introduced with CPU support
...
Changes introduced in f0f44d977f
broke RDC if it was compiled without ESMI support, or if esmi driver is
not loaded when RDC is being used.
Change-Id: Id54e1e9002d2e3cf09240081149eed84178700af
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com >
[ROCm/rdc commit: 0aeceefcb3 ]
2025-04-07 14:41:46 -05:00
Yuan, Perry
f0f44d977f
Implement CPU discovery support ( #77 )
...
* Implement CPU discovery support
SWDEV-482949:
enable the CPU model name info support to the RDC, rdci command
can detect GPU and CPU modules at the same time.
It will query the CPU info through the amdsmi interface like below:
1 GPUs found.
-----------------------------------------------------------------
GPU Index Device Information
0 AMD Radeon PRO W7800
=================================================================
1 CPUs found.
-----------------------------------------------------------------
CPU Index Device Information
0 AMD Ryzen Threadripper PRO 7995WX 96-Cores
-----------------------------------------------------------------
Change-Id: Ibc6533c9a61000cd86c45b1bae14c3eb6788c119
Signed-off-by: Perry Yuan <perry.yuan@amd.com >
* CMAKE - Add required version for amdsmi
Change-Id: I341a89351d196ec66cce215a5d1d3953302fcc66
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com >
---------
Signed-off-by: Perry Yuan <perry.yuan@amd.com >
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com >
Co-authored-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com >
[ROCm/rdc commit: 3bdca8b8b6 ]
2025-03-31 10:58:36 +08:00
Galantsev, Dmitrii
874a7b438f
CMAKE - Fix build types
...
Addresses issue https://github.com/ROCm/rdc/issues/43
Change-Id: I456184358524a6feef4bf83eecb655678c3bc42d
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com >
[ROCm/rdc commit: 80ee980cdb ]
2025-03-30 18:54:54 -05:00
Mallya, Ameya Keshava
05d4974836
Added KWS check for amd-mainline ( #140 )
...
[ROCm/rdc commit: 4067831731 ]
2025-03-28 08:23:38 -07:00
Galantsev, Dmitrii
e80760c890
RVS - Add long-running tests
...
Change-Id: Iddeb7f2d4fdcd69d7ac1ae94b2fa128ee3011b1a
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com >
[ROCm/rdc commit: bdb2367010 ]
2025-03-27 23:42:56 -05:00
Galantsev, Dmitrii
3273e2993b
Profiler - Remove bootstrap link
...
Change-Id: Ieea57515d77c2d521d95568c3bc2660cc829d829
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com >
[ROCm/rdc commit: 58350a8bb8 ]
2025-03-27 23:29:30 -05:00
Galantsev, Dmitrii
5c1757c48c
Fix diagnostic example and allow building
...
Change-Id: Icc85e8018a11b66d1190fa910151acb79cd17b83
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com >
[ROCm/rdc commit: ea7ccd0660 ]
2025-03-27 23:29:30 -05:00
Galantsev, Dmitrii
7ce869c8d6
CMAKE - Add BUILD_INTERFACE include dirs for rdc_bootstrap
...
Change-Id: I93df878b21e245277c7a8d9589102a15c2517f4f
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com >
[ROCm/rdc commit: 059d015ea4 ]
2025-03-27 23:29:30 -05:00
Galantsev, Dmitrii
bfee4ae9ee
Profiler - Add CPC and CPF metrics
...
Change-Id: I27fd725e9e1868c9afe7624d6e4aafad2a42d47e
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com >
[ROCm/rdc commit: 51de344be7 ]
2025-03-27 19:01:23 -05:00
Pryor, Adam
fe868f6763
[SWDEV-498711] RDC Partition Implementation ( #119 )
...
* [SWDEV-498711] RDC Partition Implementation
Change-Id: Ibfc3709793770537e4c9d36458f34c6b4f461724
Signed-off-by: adapryor <Adam.pryor@amd.com >
[ROCm/rdc commit: 47692d3ed5 ]
2025-03-27 14:10:11 -05:00
Galantsev, Dmitrii
791fa376e9
Fix amdsmi_get_power_info API
...
This change creates a workaround for a broken C api in amdsmi.
amdsmi_get_power_info API is broken in rocm 6.4.0 (amdsmi 25.2) and is fixed
in rocm 6.4.1 (amdsmi 25.3).
Breaking AMDSMI change:
https://github.com/ROCm/amdsmi/commit/dc4a16da6fb45d581a6e23c78d340172989418a0
Change-Id: Ib45a2702aa722c7735f3ccd1081d8f62e4d34216
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com >
[ROCm/rdc commit: 929041b556 ]
2025-03-19 23:12:45 -05:00
Galantsev, Dmitrii
68c02bda78
RVS - Use config files and make GPU aware
...
Change-Id: I7a5c80ed4e6122d102e494d1ae38b4b7d40c42cd
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com >
[ROCm/rdc commit: f5a4402ce5 ]
2025-03-11 15:39:16 -05:00
Galantsev, Dmitrii
122ab5c053
RVS - Disable IET test
...
Change-Id: I015d68735316d2dc6af18d16f972d9f379b76bcf
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com >
[ROCm/rdc commit: 247c8c7d5e ]
2025-03-11 09:51:08 -05:00
Galantsev, Dmitrii
9915ad2a60
CHANGELOG - Add 6.4.0 updates
...
Change-Id: Ia788b1b51d6ef93c5d065c70a31a029d76fdab98
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com >
Co-authored-by: Rawat, Swati <Swati.Rawat@amd.com >
[ROCm/rdc commit: 6769f64ba0 ]
2025-03-07 20:02:07 -06:00
srawat
e56a809946
Refactor RDC documentation
...
Change-Id: Ieaba84992a8cbd185f4c2d1dc36a175c0429b754
[ROCm/rdc commit: a865793b70 ]
2025-03-07 19:50:08 -06:00
Galantsev, Dmitrii
259b7ac57b
Fix workflow until grpc updates on github
...
Change-Id: Idf3faa9f7991e4a7ecf78dfb13aafe5c6533fa01
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com >
[ROCm/rdc commit: 3ec9d6c2d2 ]
2025-03-07 19:25:53 -06:00
Galantsev, Dmitrii
b48d03515e
Update gRPC to 1.67.1
...
Change-Id: I911878a3aeec8c9234b0e1ac4447364f2ed845cc
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com >
[ROCm/rdc commit: 8b249046c0 ]
2025-03-07 18:36:34 -06:00
AL Musaffar, Yazen
b4ef4331db
RDC REST API (Sample code)
...
Please follow the README file
Update README_rdc_rest_api.txt
Update RDC_REST_API.py
Error handling updates
Updates for error handling
Updates
Updates for rdc_field_watch/rdc_field_unwatch and delete query
Updates for rdc_field_watch/rdc_field_unwatch and delete query
SWDEV-479738 [RDC] - Rest API
Delete python_binding/RDC_REST_API.py
new rdc_rest_api.py file for SWDEV-479738 [RDC] - Rest API
[ROCm/rdc commit: cf566ebd31 ]
2025-03-07 20:48:15 +00:00
adapryor
7113c62704
Fix Prometheus counters
...
default to gauage
Change-Id: Ia0428e61f023f10b02b3ebe103870d40c057abe3
Change values in question to gauges
Change-Id: I81c91c880246342a0ad0586f6dbe50b247a01117
fixes
Change-Id: I949438d3d3b511c22649640e082b59a3fb7696e0
Fix info handling
Change-Id: I8091fbfa55ba5a9c21c4569dd40e37fb432924f3
fix default
Change-Id: Ia449fed18730a06a858107e9218dc7b443a681fb
[ROCm/rdc commit: e847f74f78 ]
2025-03-07 20:48:11 +00:00
adapryor
fbeacaff0c
[SWDEV-517396] Align rdc_field with rdc_bootstrap
...
Signed-off-by: adapryor <Adam.pryor@amd.com >
Change-Id: I5e05e25c5980a3141665ae2d13a6ae09207ccb41
[ROCm/rdc commit: 9571dad23d ]
2025-03-04 08:49:28 -06:00