Files
rocm-systems/projects/rdc/python_binding
Galantsev, Dmitrii 375ab5eace Add RDC_FI_GPU_BUSY_PERCENT
AMDSMI needs to merge first and bump the version to at least 24.4.2

Change-Id: I30149bb78c79ebc3de0dabdc8e63fcef12b2f406
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com>


[ROCm/rdc commit: a5cb334f8b]
2025-04-15 17:00:56 -05:00
..
2020-08-17 14:09:37 -05:00
2025-04-15 17:00:56 -05:00
2020-11-10 14:26:49 -05:00
2025-03-07 20:48:11 +00:00
2025-03-07 20:48:15 +00:00
2024-05-08 18:15:38 -05:00

Quick start

If you do not have the RDC installed, please specify the RDC library path using:

$ export LD_LIBRARY_PATH=<rdc_libs_path>

Then you can run RdcReader in python_binding folder:

$ python RdcReader.py

Prometheus plugin

Install the prometheus_client:

$ pip install prometheus_client

Start the rdcd with auth and then run plugin to connect to it:

$ python rdc_prometheus.py

Check the options of the plugin:

$ python rdc_prometheus.py --help

Verify the plugin is running:

$ curl localhost:5000

In the managment computer, install the Prometheus from https://github.com/prometheus/prometheus

Modify the file prometheus_targets.json to add the compute nodes running the plugin. Start the Prometheus

$ prometheus --config.file=<full path of the rdc_prometheus_example.yml>

Browse to localhost:9090 in the management computer for metrics from RDC.