Galantsev, Dmitrii
8afcedfc96
Revert "Fix breaking changes introduced with CPU support"
...
This reverts commit e9ac9e4626e3e45ebdfafb39e251d073091429f1.
[ROCm/rdc commit: c96f5db52c ]
2025-04-07 20:45:19 -05:00
Galantsev, Dmitrii
3e8f56c430
Fix breaking changes introduced with CPU support
...
Changes introduced in f0f44d977f
broke RDC if it was compiled without ESMI support, or if esmi driver is
not loaded when RDC is being used.
Change-Id: Id54e1e9002d2e3cf09240081149eed84178700af
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com >
[ROCm/rdc commit: 0aeceefcb3 ]
2025-04-07 14:41:46 -05:00
Yuan, Perry
f0f44d977f
Implement CPU discovery support ( #77 )
...
* Implement CPU discovery support
SWDEV-482949:
enable the CPU model name info support to the RDC, rdci command
can detect GPU and CPU modules at the same time.
It will query the CPU info through the amdsmi interface like below:
1 GPUs found.
-----------------------------------------------------------------
GPU Index Device Information
0 AMD Radeon PRO W7800
=================================================================
1 CPUs found.
-----------------------------------------------------------------
CPU Index Device Information
0 AMD Ryzen Threadripper PRO 7995WX 96-Cores
-----------------------------------------------------------------
Change-Id: Ibc6533c9a61000cd86c45b1bae14c3eb6788c119
Signed-off-by: Perry Yuan <perry.yuan@amd.com >
* CMAKE - Add required version for amdsmi
Change-Id: I341a89351d196ec66cce215a5d1d3953302fcc66
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com >
---------
Signed-off-by: Perry Yuan <perry.yuan@amd.com >
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com >
Co-authored-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com >
[ROCm/rdc commit: 3bdca8b8b6 ]
2025-03-31 10:58:36 +08:00
Galantsev, Dmitrii
874a7b438f
CMAKE - Fix build types
...
Addresses issue https://github.com/ROCm/rdc/issues/43
Change-Id: I456184358524a6feef4bf83eecb655678c3bc42d
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com >
[ROCm/rdc commit: 80ee980cdb ]
2025-03-30 18:54:54 -05:00
Mallya, Ameya Keshava
05d4974836
Added KWS check for amd-mainline ( #140 )
...
[ROCm/rdc commit: 4067831731 ]
2025-03-28 08:23:38 -07:00
Galantsev, Dmitrii
e80760c890
RVS - Add long-running tests
...
Change-Id: Iddeb7f2d4fdcd69d7ac1ae94b2fa128ee3011b1a
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com >
[ROCm/rdc commit: bdb2367010 ]
2025-03-27 23:42:56 -05:00
Galantsev, Dmitrii
3273e2993b
Profiler - Remove bootstrap link
...
Change-Id: Ieea57515d77c2d521d95568c3bc2660cc829d829
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com >
[ROCm/rdc commit: 58350a8bb8 ]
2025-03-27 23:29:30 -05:00
Galantsev, Dmitrii
5c1757c48c
Fix diagnostic example and allow building
...
Change-Id: Icc85e8018a11b66d1190fa910151acb79cd17b83
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com >
[ROCm/rdc commit: ea7ccd0660 ]
2025-03-27 23:29:30 -05:00
Galantsev, Dmitrii
7ce869c8d6
CMAKE - Add BUILD_INTERFACE include dirs for rdc_bootstrap
...
Change-Id: I93df878b21e245277c7a8d9589102a15c2517f4f
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com >
[ROCm/rdc commit: 059d015ea4 ]
2025-03-27 23:29:30 -05:00
Galantsev, Dmitrii
bfee4ae9ee
Profiler - Add CPC and CPF metrics
...
Change-Id: I27fd725e9e1868c9afe7624d6e4aafad2a42d47e
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com >
[ROCm/rdc commit: 51de344be7 ]
2025-03-27 19:01:23 -05:00
Pryor, Adam
fe868f6763
[SWDEV-498711] RDC Partition Implementation ( #119 )
...
* [SWDEV-498711] RDC Partition Implementation
Change-Id: Ibfc3709793770537e4c9d36458f34c6b4f461724
Signed-off-by: adapryor <Adam.pryor@amd.com >
[ROCm/rdc commit: 47692d3ed5 ]
2025-03-27 14:10:11 -05:00
Galantsev, Dmitrii
791fa376e9
Fix amdsmi_get_power_info API
...
This change creates a workaround for a broken C api in amdsmi.
amdsmi_get_power_info API is broken in rocm 6.4.0 (amdsmi 25.2) and is fixed
in rocm 6.4.1 (amdsmi 25.3).
Breaking AMDSMI change:
https://github.com/ROCm/amdsmi/commit/dc4a16da6fb45d581a6e23c78d340172989418a0
Change-Id: Ib45a2702aa722c7735f3ccd1081d8f62e4d34216
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com >
[ROCm/rdc commit: 929041b556 ]
2025-03-19 23:12:45 -05:00
Galantsev, Dmitrii
68c02bda78
RVS - Use config files and make GPU aware
...
Change-Id: I7a5c80ed4e6122d102e494d1ae38b4b7d40c42cd
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com >
[ROCm/rdc commit: f5a4402ce5 ]
2025-03-11 15:39:16 -05:00
Galantsev, Dmitrii
122ab5c053
RVS - Disable IET test
...
Change-Id: I015d68735316d2dc6af18d16f972d9f379b76bcf
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com >
[ROCm/rdc commit: 247c8c7d5e ]
2025-03-11 09:51:08 -05:00
Galantsev, Dmitrii
9915ad2a60
CHANGELOG - Add 6.4.0 updates
...
Change-Id: Ia788b1b51d6ef93c5d065c70a31a029d76fdab98
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com >
Co-authored-by: Rawat, Swati <Swati.Rawat@amd.com >
[ROCm/rdc commit: 6769f64ba0 ]
2025-03-07 20:02:07 -06:00
srawat
e56a809946
Refactor RDC documentation
...
Change-Id: Ieaba84992a8cbd185f4c2d1dc36a175c0429b754
[ROCm/rdc commit: a865793b70 ]
2025-03-07 19:50:08 -06:00
Galantsev, Dmitrii
259b7ac57b
Fix workflow until grpc updates on github
...
Change-Id: Idf3faa9f7991e4a7ecf78dfb13aafe5c6533fa01
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com >
[ROCm/rdc commit: 3ec9d6c2d2 ]
2025-03-07 19:25:53 -06:00
Galantsev, Dmitrii
b48d03515e
Update gRPC to 1.67.1
...
Change-Id: I911878a3aeec8c9234b0e1ac4447364f2ed845cc
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com >
[ROCm/rdc commit: 8b249046c0 ]
2025-03-07 18:36:34 -06:00
AL Musaffar, Yazen
b4ef4331db
RDC REST API (Sample code)
...
Please follow the README file
Update README_rdc_rest_api.txt
Update RDC_REST_API.py
Error handling updates
Updates for error handling
Updates
Updates for rdc_field_watch/rdc_field_unwatch and delete query
Updates for rdc_field_watch/rdc_field_unwatch and delete query
SWDEV-479738 [RDC] - Rest API
Delete python_binding/RDC_REST_API.py
new rdc_rest_api.py file for SWDEV-479738 [RDC] - Rest API
[ROCm/rdc commit: cf566ebd31 ]
2025-03-07 20:48:15 +00:00
adapryor
7113c62704
Fix Prometheus counters
...
default to gauage
Change-Id: Ia0428e61f023f10b02b3ebe103870d40c057abe3
Change values in question to gauges
Change-Id: I81c91c880246342a0ad0586f6dbe50b247a01117
fixes
Change-Id: I949438d3d3b511c22649640e082b59a3fb7696e0
Fix info handling
Change-Id: I8091fbfa55ba5a9c21c4569dd40e37fb432924f3
fix default
Change-Id: Ia449fed18730a06a858107e9218dc7b443a681fb
[ROCm/rdc commit: e847f74f78 ]
2025-03-07 20:48:11 +00:00
adapryor
fbeacaff0c
[SWDEV-517396] Align rdc_field with rdc_bootstrap
...
Signed-off-by: adapryor <Adam.pryor@amd.com >
Change-Id: I5e05e25c5980a3141665ae2d13a6ae09207ccb41
[ROCm/rdc commit: 9571dad23d ]
2025-03-04 08:49:28 -06:00
Galantsev, Dmitrii
705d42f0f5
CMAKE - Set fallback version to 0.3.0
...
Change-Id: I2322bdb7d3a8e4f83346ca4f5d24351ad2a4eccc
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com >
[ROCm/rdc commit: d5f8ff0ab0 ]
2025-03-04 08:43:32 -06:00
Li Ma
709b621c48
Modify the error log for MM_ENC_UTIL
...
Signed-off-by: Li Ma <li.ma@amd.com >
Change-Id: I83805fc8ad7003ecd5189c8f940b44edbf0ebd1f
[ROCm/rdc commit: 26ea06bb69 ]
2025-03-04 08:42:22 -06:00
Arif, Maisam
c26abbbe9a
Fixed RDC to work with updated amdsmi_get_power_info() ( #115 )
...
Change-Id: Ic9e7a68ae58f61dbe73fc7d1b17af34152933e71
Signed-off-by: Maisam Arif <Maisam.Arif@amd.com >
[ROCm/rdc commit: 552f15a1fb ]
2025-02-11 00:51:29 -06:00
Pryor, Adam
c00a9a709d
SWDEV-512736 Fix RDC Policy callback printout ( #114 )
...
Change-Id: I6e018dcb0a6b272812c959649d913e3ba33def40
[ROCm/rdc commit: 93a8ab8915 ]
2025-02-10 08:40:03 -06:00
Williams, Justin
e2a8d40bb1
[SWDEV-479339/SWDEV-498804] Added RDC Dockerfile ( #50 )
...
* [SWDEV-479339/SWDEV-498804] Added RDC Dockerfile
* Updated Dockerfile
[ROCm/rdc commit: e1d3b6b5b8 ]
2025-02-04 12:58:40 -06:00
Justin Williams
1a0e1ff280
Make README.md pretty
...
Change-Id: I7c3341deaf3621ebbc9e495b023b1dd4971a5f1d
[ROCm/rdc commit: f106364fc7 ]
2025-01-31 12:22:45 -06:00
Galantsev, Dmitrii
0bb38058e7
Revert "Dgalants/add auth script location ( #108 )"
...
This reverts commit 2f68fe1efe .
[ROCm/rdc commit: bee9991c4a ]
2025-01-31 12:22:45 -06:00
Pryor, Adam
2f68fe1efe
Dgalants/add auth script location ( #108 )
...
* DOCS: Add authentication scripts location
Change-Id: Ie285d80ea6d9bb8f710998208d0aa7c6db661d02
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com >
* Make README.md pretty (#44 )
Change-Id: I7c3341deaf3621ebbc9e495b023b1dd4971a5f1d
---------
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com >
Co-authored-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com >
Co-authored-by: Williams, Justin <Justin.Williams@amd.com >
[ROCm/rdc commit: a70aa81cfd ]
2025-01-30 12:08:11 -06:00
Galantsev, Dmitrii
283c5570ec
DOCS: Add authentication scripts location ( #96 )
...
Change-Id: Ie285d80ea6d9bb8f710998208d0aa7c6db661d02
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com >
[ROCm/rdc commit: 4da277a64e ]
2025-01-30 12:06:18 -06:00
Galantsev, Dmitrii
b4dd8b40ab
CMAKE - Fix ABSL in clang18+ ( #106 )
...
Please see:
- https://github.com/abseil/abseil-cpp/issues/1747
- https://github.com/llvm/llvm-project/issues/102443
When GRPC is compiled with different compiler from RDC - ABI broke.
Possibly because some templates were not instantiated.
Setting '-fclang-abi-compat=17' fixes the issue.
Change-Id: Ic6409cf413c87b135f334e5b03145cb1c63356d4
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com >
[ROCm/rdc commit: a8d479c147 ]
2025-01-30 10:33:58 -06:00
Pryor, Adam
c5560793e8
SWDEV-500382 fix energy consumed ( #105 )
...
Change-Id: I3f180f34abed763db1287bf01581753534f32828
Signed-off-by: adapryor <Adam.pryor@amd.com >
[ROCm/rdc commit: af56e460c4 ]
2025-01-30 09:38:00 -06:00
Galantsev, Dmitrii
d5ce61d95e
CMAKE - Move rdc_options into share/rdc/conf/
...
Change-Id: Ib2e792aef180f0f267d86d68c57b852b2cdc8ea6
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com >
[ROCm/rdc commit: 99d4d77e20 ]
2025-01-24 12:06:05 -06:00
Pryor, Adam
0186fc2481
SWDEV-508477 Eval Flops Percent ( #85 )
...
SWDEV-508477 - Profiler add FP*_PERCENT
Change-Id: Idb6250fe6b7ba3df6fe7d30861e0fbbda7e9bdce
Signed-off-by: adapryor <Adam.pryor@amd.com >
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com >
[ROCm/rdc commit: 6f358ddc9e ]
2025-01-24 10:07:32 -06:00
Galantsev, Dmitrii
3218c2af5c
CMAKE - Rename SMI_*_DIR into AMD_SMI_*_DIR
...
Change-Id: I3b8b852e6b68f1448c8ed5d5e6ea4579c470ff53
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com >
[ROCm/rdc commit: e033fd4c55 ]
2025-01-23 20:56:00 -06:00
Ma, Li
25853f01dc
Fix Memory Current Bandwidth ( #98 )
...
Adjust the calculation order to ensure accuracy.
Change-Id: Ica10769fa3dba10c67428d09ffd454fc09ed0da8
Signed-off-by: Li Ma <li.ma@amd.com >
[ROCm/rdc commit: 9dce427c69 ]
2025-01-24 10:22:08 +08:00
stali
01990d5121
fix topology issue
...
[ROCm/rdc commit: e36d3fae22 ]
2025-01-24 09:22:42 +08:00
Galantsev, Dmitrii
61ff214b64
Fix workflow for rocprof by specifying GPU_TARGETS
...
Change-Id: I153f9e73471599fbcf68c73ad0ed9f4db7a742ef
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com >
[ROCm/rdc commit: ef77c0ed92 ]
2025-01-23 18:38:34 -06:00
Galantsev, Dmitrii
710692b8e7
Update workflow to artifacts@v4
...
Change-Id: Ib08a0afc0954ea2eb581425cbf9cf1d7715cebc5
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com >
[ROCm/rdc commit: 9dd58b6907 ]
2025-01-22 14:31:31 -06:00
adapryor
c57e200bdc
SWDEV-500382 fix energy consumed
...
Change-Id: I3f180f34abed763db1287bf01581753534f32828
[ROCm/rdc commit: e8057b1042 ]
2025-01-21 21:49:33 -06:00
adapryor
8286a92fc1
Implementation for RDC_FI_PROF_OCCUPANCY_PER_ACTIVE_CU SWDEV-50895
...
Signed-off-by: adapryor <Adam.pryor@amd.com >
Change-Id: I8da7d9846edabe5629c75f50cd2bb4b23e019a17
Signed-off-by: adapryor <Adam.pryor@amd.com >
[ROCm/rdc commit: 290b90dc89 ]
2025-01-21 21:49:19 -06:00
stali
7f4e5c85cb
fixed rdc link state print issue
...
[ROCm/rdc commit: b427c07ffe ]
2025-01-22 09:05:49 +08:00
Pryor, Adam
9f1f502d93
SWDEV-510089 Fix rocprof segfaulting on ctrl+c ( #94 )
...
Change-Id: Iaa0f3856bb8fed174cbc935b85739414ecd44758
Signed-off-by: adapryor <Adam.pryor@amd.com >
[ROCm/rdc commit: 0ae4404a09 ]
2025-01-21 10:30:31 -06:00
Mallya, Ameya Keshava
7267692f1b
Fixed Workflow for updated KWS structure
...
[ROCm/rdc commit: 0490b1c925 ]
2025-01-17 08:21:11 -08:00
Mallya, Ameya Keshava
4206250e61
Added KWS check ( #88 )
...
[ROCm/rdc commit: cadbf69b45 ]
2025-01-15 11:11:01 -08:00
limeng12
4f3b114740
[SWDEV-230863] Improve the functionality of RdcSmiHealth module.
...
Memory check:get the threshold of retired page number
EEPROM check:read and verify the checksum
Power/Thermal check: power/thermal throttle status counter
Signed-off-by: Meng Li <li.meng@amd.com >
Change-Id: Id2c751416eb5bf007e6e1da8dc05966a6ba1324e
[ROCm/rdc commit: 016a1d9d39 ]
2025-01-14 08:14:36 +08:00
Galantsev, Dmitrii
78f37c1784
Include assert.h during C compilation ( #4 )
...
Fix for https://github.com/ROCm/ROCm/issues/3997 . When compiling a C program that includes rdc/rdc.h, multiple assertion errors are thrown without this header included.
Change-Id: Ie5b5c1a1a17c8207cf9b1be23b31193e260d5c1a
Co-authored-by: harkgill-amd <harkgill@amd.com >
[ROCm/rdc commit: 83f36f1673 ]
2025-01-10 11:29:15 -05:00
srawat
8a698a7e23
Update LICENSE
...
[ROCm/rdc commit: 0e53160bee ]
2025-01-09 13:12:24 -06:00
Galantsev, Dmitrii
b78295c8f8
RVS - Add IET and PEBB tests
...
Change-Id: Ia032901d74c882e5cbfa5a3164199cd4d571341f
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com >
[ROCm/rdc commit: 5861ec7663 ]
2025-01-08 18:23:13 -06:00
Galantsev, Dmitrii
9d32387925
RVS - Add memory bandwidth test
...
Change-Id: I4c8990170861f6a0f3853615db68634fdaa7a622
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com >
[ROCm/rdc commit: b058cbecf1 ]
2025-01-08 18:23:13 -06:00