Dosyalar
rocm-systems/projects/rdc/python_binding
Bill(Shuzhou) Liu d1efa59fe8 Fallback to junction temperature and socket power
If the card does not have edge temperature, fallback to junction
temperature. If the card only have socket power, then use socket
power instead.

Change-Id: I053a67a89cf3b29a34e82123f522c08d7dd68916


[ROCm/rdc commit: 5cfe2b4169]
2024-02-05 10:10:26 -06:00
..
2020-08-17 14:09:37 -05:00
2020-11-10 14:26:49 -05:00
2020-08-17 14:09:37 -05:00
2022-04-27 14:38:48 -04:00

Quick start

If you do not have the RDC installed, please specify the RDC library path using:

$ export LD_LIBRARY_PATH=<rdc_libs_path>

Then you can run RdcReader in python_binding folder:

$ python RdcReader.py

Prometheus plugin

Install the prometheus_client:

$ pip install prometheus_client

Start the rdcd with auth and then run plugin to connect to it:

$ python rdc_prometheus.py

Check the options of the plugin:

$ python rdc_prometheus.py --help

Verify the plugin is running:

$ curl localhost:5000

In the managment computer, install the Prometheus from https://github.com/prometheus/prometheus

Modify the file prometheus_targets.json to add the compute nodes running the plugin. Start the Prometheus

$ prometheus --config.file=<full path of the rdc_prometheus_example.yml>

Browse to localhost:9090 in the managment computer for metrics from RDC.