Граф коммитов

311 Коммитов

Автор SHA1 Сообщение Дата
Galantsev, Dmitrii 8afcedfc96 Revert "Fix breaking changes introduced with CPU support"
This reverts commit e9ac9e4626e3e45ebdfafb39e251d073091429f1.


[ROCm/rdc commit: c96f5db52c]
2025-04-07 20:45:19 -05:00
Galantsev, Dmitrii 3e8f56c430 Fix breaking changes introduced with CPU support
Changes introduced in f0f44d977f
broke RDC if it was compiled without ESMI support, or if esmi driver is
not loaded when RDC is being used.

Change-Id: Id54e1e9002d2e3cf09240081149eed84178700af
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com>


[ROCm/rdc commit: 0aeceefcb3]
2025-04-07 14:41:46 -05:00
Yuan, Perry f0f44d977f Implement CPU discovery support (#77)
* Implement CPU discovery support

SWDEV-482949:

enable the CPU model name info support to the RDC, rdci command
can detect GPU and CPU modules at the same time.
It will query the CPU info through the amdsmi interface like below:

1 GPUs found.
-----------------------------------------------------------------
GPU Index        Device Information
0               AMD Radeon PRO W7800
=================================================================
1 CPUs found.
-----------------------------------------------------------------
CPU Index        Device Information
0               AMD Ryzen Threadripper PRO 7995WX 96-Cores
-----------------------------------------------------------------

Change-Id: Ibc6533c9a61000cd86c45b1bae14c3eb6788c119
Signed-off-by: Perry Yuan <perry.yuan@amd.com>

* CMAKE - Add required version for amdsmi

Change-Id: I341a89351d196ec66cce215a5d1d3953302fcc66
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com>

---------

Signed-off-by: Perry Yuan <perry.yuan@amd.com>
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com>
Co-authored-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com>

[ROCm/rdc commit: 3bdca8b8b6]
2025-03-31 10:58:36 +08:00
Galantsev, Dmitrii 874a7b438f CMAKE - Fix build types
Addresses issue https://github.com/ROCm/rdc/issues/43

Change-Id: I456184358524a6feef4bf83eecb655678c3bc42d
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com>


[ROCm/rdc commit: 80ee980cdb]
2025-03-30 18:54:54 -05:00
Mallya, Ameya Keshava 05d4974836 Added KWS check for amd-mainline (#140)
[ROCm/rdc commit: 4067831731]
2025-03-28 08:23:38 -07:00
Galantsev, Dmitrii e80760c890 RVS - Add long-running tests
Change-Id: Iddeb7f2d4fdcd69d7ac1ae94b2fa128ee3011b1a
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com>


[ROCm/rdc commit: bdb2367010]
2025-03-27 23:42:56 -05:00
Galantsev, Dmitrii 3273e2993b Profiler - Remove bootstrap link
Change-Id: Ieea57515d77c2d521d95568c3bc2660cc829d829
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com>


[ROCm/rdc commit: 58350a8bb8]
2025-03-27 23:29:30 -05:00
Galantsev, Dmitrii 5c1757c48c Fix diagnostic example and allow building
Change-Id: Icc85e8018a11b66d1190fa910151acb79cd17b83
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com>


[ROCm/rdc commit: ea7ccd0660]
2025-03-27 23:29:30 -05:00
Galantsev, Dmitrii 7ce869c8d6 CMAKE - Add BUILD_INTERFACE include dirs for rdc_bootstrap
Change-Id: I93df878b21e245277c7a8d9589102a15c2517f4f
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com>


[ROCm/rdc commit: 059d015ea4]
2025-03-27 23:29:30 -05:00
Galantsev, Dmitrii bfee4ae9ee Profiler - Add CPC and CPF metrics
Change-Id: I27fd725e9e1868c9afe7624d6e4aafad2a42d47e
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com>


[ROCm/rdc commit: 51de344be7]
2025-03-27 19:01:23 -05:00
Pryor, Adam fe868f6763 [SWDEV-498711] RDC Partition Implementation (#119)
* [SWDEV-498711] RDC Partition Implementation

Change-Id: Ibfc3709793770537e4c9d36458f34c6b4f461724
Signed-off-by: adapryor <Adam.pryor@amd.com>

[ROCm/rdc commit: 47692d3ed5]
2025-03-27 14:10:11 -05:00
Galantsev, Dmitrii 791fa376e9 Fix amdsmi_get_power_info API
This change creates a workaround for a broken C api in amdsmi.

amdsmi_get_power_info API is broken in rocm 6.4.0 (amdsmi 25.2) and is fixed
in rocm 6.4.1 (amdsmi 25.3).

Breaking AMDSMI change:
https://github.com/ROCm/amdsmi/commit/dc4a16da6fb45d581a6e23c78d340172989418a0

Change-Id: Ib45a2702aa722c7735f3ccd1081d8f62e4d34216
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com>


[ROCm/rdc commit: 929041b556]
2025-03-19 23:12:45 -05:00
Galantsev, Dmitrii 68c02bda78 RVS - Use config files and make GPU aware
Change-Id: I7a5c80ed4e6122d102e494d1ae38b4b7d40c42cd
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com>


[ROCm/rdc commit: f5a4402ce5]
2025-03-11 15:39:16 -05:00
Galantsev, Dmitrii 122ab5c053 RVS - Disable IET test
Change-Id: I015d68735316d2dc6af18d16f972d9f379b76bcf
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com>


[ROCm/rdc commit: 247c8c7d5e]
2025-03-11 09:51:08 -05:00
Galantsev, Dmitrii 9915ad2a60 CHANGELOG - Add 6.4.0 updates
Change-Id: Ia788b1b51d6ef93c5d065c70a31a029d76fdab98
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com>
Co-authored-by: Rawat, Swati <Swati.Rawat@amd.com>


[ROCm/rdc commit: 6769f64ba0]
2025-03-07 20:02:07 -06:00
srawat e56a809946 Refactor RDC documentation
Change-Id: Ieaba84992a8cbd185f4c2d1dc36a175c0429b754


[ROCm/rdc commit: a865793b70]
2025-03-07 19:50:08 -06:00
Galantsev, Dmitrii 259b7ac57b Fix workflow until grpc updates on github
Change-Id: Idf3faa9f7991e4a7ecf78dfb13aafe5c6533fa01
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com>


[ROCm/rdc commit: 3ec9d6c2d2]
2025-03-07 19:25:53 -06:00
Galantsev, Dmitrii b48d03515e Update gRPC to 1.67.1
Change-Id: I911878a3aeec8c9234b0e1ac4447364f2ed845cc
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com>


[ROCm/rdc commit: 8b249046c0]
2025-03-07 18:36:34 -06:00
AL Musaffar, Yazen b4ef4331db RDC REST API (Sample code)
Please follow the README file

Update README_rdc_rest_api.txt

Update RDC_REST_API.py

Error handling updates

Updates for error handling

Updates

Updates for rdc_field_watch/rdc_field_unwatch and delete query

Updates for rdc_field_watch/rdc_field_unwatch and delete query

SWDEV-479738 [RDC] - Rest API

Delete python_binding/RDC_REST_API.py

new rdc_rest_api.py file for SWDEV-479738 [RDC] - Rest API


[ROCm/rdc commit: cf566ebd31]
2025-03-07 20:48:15 +00:00
adapryor 7113c62704 Fix Prometheus counters
default to gauage

Change-Id: Ia0428e61f023f10b02b3ebe103870d40c057abe3

Change values in question to gauges

Change-Id: I81c91c880246342a0ad0586f6dbe50b247a01117

fixes

Change-Id: I949438d3d3b511c22649640e082b59a3fb7696e0

Fix info handling

Change-Id: I8091fbfa55ba5a9c21c4569dd40e37fb432924f3

fix default

Change-Id: Ia449fed18730a06a858107e9218dc7b443a681fb


[ROCm/rdc commit: e847f74f78]
2025-03-07 20:48:11 +00:00
adapryor fbeacaff0c [SWDEV-517396] Align rdc_field with rdc_bootstrap
Signed-off-by: adapryor <Adam.pryor@amd.com>
Change-Id: I5e05e25c5980a3141665ae2d13a6ae09207ccb41


[ROCm/rdc commit: 9571dad23d]
2025-03-04 08:49:28 -06:00
Galantsev, Dmitrii 705d42f0f5 CMAKE - Set fallback version to 0.3.0
Change-Id: I2322bdb7d3a8e4f83346ca4f5d24351ad2a4eccc
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com>


[ROCm/rdc commit: d5f8ff0ab0]
2025-03-04 08:43:32 -06:00
Li Ma 709b621c48 Modify the error log for MM_ENC_UTIL
Signed-off-by: Li Ma <li.ma@amd.com>
Change-Id: I83805fc8ad7003ecd5189c8f940b44edbf0ebd1f


[ROCm/rdc commit: 26ea06bb69]
2025-03-04 08:42:22 -06:00
Arif, Maisam c26abbbe9a Fixed RDC to work with updated amdsmi_get_power_info() (#115)
Change-Id: Ic9e7a68ae58f61dbe73fc7d1b17af34152933e71

Signed-off-by: Maisam Arif <Maisam.Arif@amd.com>

[ROCm/rdc commit: 552f15a1fb]
2025-02-11 00:51:29 -06:00
Pryor, Adam c00a9a709d SWDEV-512736 Fix RDC Policy callback printout (#114)
Change-Id: I6e018dcb0a6b272812c959649d913e3ba33def40

[ROCm/rdc commit: 93a8ab8915]
2025-02-10 08:40:03 -06:00
Williams, Justin e2a8d40bb1 [SWDEV-479339/SWDEV-498804] Added RDC Dockerfile (#50)
* [SWDEV-479339/SWDEV-498804] Added RDC Dockerfile

* Updated Dockerfile

[ROCm/rdc commit: e1d3b6b5b8]
2025-02-04 12:58:40 -06:00
Justin Williams 1a0e1ff280 Make README.md pretty
Change-Id: I7c3341deaf3621ebbc9e495b023b1dd4971a5f1d


[ROCm/rdc commit: f106364fc7]
2025-01-31 12:22:45 -06:00
Galantsev, Dmitrii 0bb38058e7 Revert "Dgalants/add auth script location (#108)"
This reverts commit 2f68fe1efe.


[ROCm/rdc commit: bee9991c4a]
2025-01-31 12:22:45 -06:00
Pryor, Adam 2f68fe1efe Dgalants/add auth script location (#108)
* DOCS: Add authentication scripts location

Change-Id: Ie285d80ea6d9bb8f710998208d0aa7c6db661d02
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com>

* Make README.md pretty (#44)

Change-Id: I7c3341deaf3621ebbc9e495b023b1dd4971a5f1d

---------

Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com>
Co-authored-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com>
Co-authored-by: Williams, Justin <Justin.Williams@amd.com>

[ROCm/rdc commit: a70aa81cfd]
2025-01-30 12:08:11 -06:00
Galantsev, Dmitrii 283c5570ec DOCS: Add authentication scripts location (#96)
Change-Id: Ie285d80ea6d9bb8f710998208d0aa7c6db661d02

Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com>

[ROCm/rdc commit: 4da277a64e]
2025-01-30 12:06:18 -06:00
Galantsev, Dmitrii b4dd8b40ab CMAKE - Fix ABSL in clang18+ (#106)
Please see:
- https://github.com/abseil/abseil-cpp/issues/1747
- https://github.com/llvm/llvm-project/issues/102443

When GRPC is compiled with different compiler from RDC - ABI broke.
Possibly because some templates were not instantiated.
Setting '-fclang-abi-compat=17' fixes the issue.

Change-Id: Ic6409cf413c87b135f334e5b03145cb1c63356d4

Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com>

[ROCm/rdc commit: a8d479c147]
2025-01-30 10:33:58 -06:00
Pryor, Adam c5560793e8 SWDEV-500382 fix energy consumed (#105)
Change-Id: I3f180f34abed763db1287bf01581753534f32828

Signed-off-by: adapryor <Adam.pryor@amd.com>

[ROCm/rdc commit: af56e460c4]
2025-01-30 09:38:00 -06:00
Galantsev, Dmitrii d5ce61d95e CMAKE - Move rdc_options into share/rdc/conf/
Change-Id: Ib2e792aef180f0f267d86d68c57b852b2cdc8ea6
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com>


[ROCm/rdc commit: 99d4d77e20]
2025-01-24 12:06:05 -06:00
Pryor, Adam 0186fc2481 SWDEV-508477 Eval Flops Percent (#85)
SWDEV-508477 - Profiler add FP*_PERCENT

Change-Id: Idb6250fe6b7ba3df6fe7d30861e0fbbda7e9bdce

Signed-off-by: adapryor <Adam.pryor@amd.com>
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com>

[ROCm/rdc commit: 6f358ddc9e]
2025-01-24 10:07:32 -06:00
Galantsev, Dmitrii 3218c2af5c CMAKE - Rename SMI_*_DIR into AMD_SMI_*_DIR
Change-Id: I3b8b852e6b68f1448c8ed5d5e6ea4579c470ff53
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com>


[ROCm/rdc commit: e033fd4c55]
2025-01-23 20:56:00 -06:00
Ma, Li 25853f01dc Fix Memory Current Bandwidth (#98)
Adjust the calculation order to ensure accuracy.


Change-Id: Ica10769fa3dba10c67428d09ffd454fc09ed0da8

Signed-off-by: Li Ma <li.ma@amd.com>

[ROCm/rdc commit: 9dce427c69]
2025-01-24 10:22:08 +08:00
stali 01990d5121 fix topology issue
[ROCm/rdc commit: e36d3fae22]
2025-01-24 09:22:42 +08:00
Galantsev, Dmitrii 61ff214b64 Fix workflow for rocprof by specifying GPU_TARGETS
Change-Id: I153f9e73471599fbcf68c73ad0ed9f4db7a742ef
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com>


[ROCm/rdc commit: ef77c0ed92]
2025-01-23 18:38:34 -06:00
Galantsev, Dmitrii 710692b8e7 Update workflow to artifacts@v4
Change-Id: Ib08a0afc0954ea2eb581425cbf9cf1d7715cebc5
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com>


[ROCm/rdc commit: 9dd58b6907]
2025-01-22 14:31:31 -06:00
adapryor c57e200bdc SWDEV-500382 fix energy consumed
Change-Id: I3f180f34abed763db1287bf01581753534f32828


[ROCm/rdc commit: e8057b1042]
2025-01-21 21:49:33 -06:00
adapryor 8286a92fc1 Implementation for RDC_FI_PROF_OCCUPANCY_PER_ACTIVE_CU SWDEV-50895
Signed-off-by: adapryor <Adam.pryor@amd.com>
Change-Id: I8da7d9846edabe5629c75f50cd2bb4b23e019a17
Signed-off-by: adapryor <Adam.pryor@amd.com>


[ROCm/rdc commit: 290b90dc89]
2025-01-21 21:49:19 -06:00
stali 7f4e5c85cb fixed rdc link state print issue
[ROCm/rdc commit: b427c07ffe]
2025-01-22 09:05:49 +08:00
Pryor, Adam 9f1f502d93 SWDEV-510089 Fix rocprof segfaulting on ctrl+c (#94)
Change-Id: Iaa0f3856bb8fed174cbc935b85739414ecd44758

Signed-off-by: adapryor <Adam.pryor@amd.com>

[ROCm/rdc commit: 0ae4404a09]
2025-01-21 10:30:31 -06:00
Mallya, Ameya Keshava 7267692f1b Fixed Workflow for updated KWS structure
[ROCm/rdc commit: 0490b1c925]
2025-01-17 08:21:11 -08:00
Mallya, Ameya Keshava 4206250e61 Added KWS check (#88)
[ROCm/rdc commit: cadbf69b45]
2025-01-15 11:11:01 -08:00
limeng12 4f3b114740 [SWDEV-230863] Improve the functionality of RdcSmiHealth module.
Memory check:get the threshold of retired page number
EEPROM check:read and verify the checksum
Power/Thermal check: power/thermal throttle status counter

Signed-off-by: Meng Li <li.meng@amd.com>
Change-Id: Id2c751416eb5bf007e6e1da8dc05966a6ba1324e


[ROCm/rdc commit: 016a1d9d39]
2025-01-14 08:14:36 +08:00
Galantsev, Dmitrii 78f37c1784 Include assert.h during C compilation (#4)
Fix for https://github.com/ROCm/ROCm/issues/3997. When compiling a C program that includes rdc/rdc.h, multiple assertion errors are thrown without this header included.

Change-Id: Ie5b5c1a1a17c8207cf9b1be23b31193e260d5c1a

Co-authored-by: harkgill-amd <harkgill@amd.com>

[ROCm/rdc commit: 83f36f1673]
2025-01-10 11:29:15 -05:00
srawat 8a698a7e23 Update LICENSE
[ROCm/rdc commit: 0e53160bee]
2025-01-09 13:12:24 -06:00
Galantsev, Dmitrii b78295c8f8 RVS - Add IET and PEBB tests
Change-Id: Ia032901d74c882e5cbfa5a3164199cd4d571341f
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com>


[ROCm/rdc commit: 5861ec7663]
2025-01-08 18:23:13 -06:00
Galantsev, Dmitrii 9d32387925 RVS - Add memory bandwidth test
Change-Id: I4c8990170861f6a0f3853615db68634fdaa7a622
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com>


[ROCm/rdc commit: b058cbecf1]
2025-01-08 18:23:13 -06:00