Files
rocm-systems/projects/rdc/python_binding
adapryor 7113c62704 Fix Prometheus counters
default to gauage

Change-Id: Ia0428e61f023f10b02b3ebe103870d40c057abe3

Change values in question to gauges

Change-Id: I81c91c880246342a0ad0586f6dbe50b247a01117

fixes

Change-Id: I949438d3d3b511c22649640e082b59a3fb7696e0

Fix info handling

Change-Id: I8091fbfa55ba5a9c21c4569dd40e37fb432924f3

fix default

Change-Id: Ia449fed18730a06a858107e9218dc7b443a681fb


[ROCm/rdc commit: e847f74f78]
2025-03-07 20:48:11 +00:00
..
2020-08-17 14:09:37 -05:00
2025-03-07 20:48:11 +00:00
2020-11-10 14:26:49 -05:00
2025-03-07 20:48:11 +00:00
2024-05-08 18:15:38 -05:00

Quick start

If you do not have the RDC installed, please specify the RDC library path using:

$ export LD_LIBRARY_PATH=<rdc_libs_path>

Then you can run RdcReader in python_binding folder:

$ python RdcReader.py

Prometheus plugin

Install the prometheus_client:

$ pip install prometheus_client

Start the rdcd with auth and then run plugin to connect to it:

$ python rdc_prometheus.py

Check the options of the plugin:

$ python rdc_prometheus.py --help

Verify the plugin is running:

$ curl localhost:5000

In the managment computer, install the Prometheus from https://github.com/prometheus/prometheus

Modify the file prometheus_targets.json to add the compute nodes running the plugin. Start the Prometheus

$ prometheus --config.file=<full path of the rdc_prometheus_example.yml>

Browse to localhost:9090 in the management computer for metrics from RDC.