Arquivos
rocm-systems/projects/rdc/python_binding
Bill(Shuzhou) Liu 1602a481cf Integrate RDC with Grafana
A new Grafana dashboard file rdc_grafana_dashboard_example.json
has been added to the folder python_binding. User can import
this dashboard to monitor multiple compute nodes.

To display the host name only in the dashboard, the
rdc_prometheus_example.yml is also changed to create a new label
short_instance which will not have the port number.

Change-Id: I9ab91838006d59c8dcb5fea01decb8c799484e1d


[ROCm/rdc commit: aeba7b0f91]
2020-10-15 14:12:15 -04:00
..
2020-08-17 14:09:37 -05:00
2020-08-17 14:09:37 -05:00
2020-08-17 14:09:37 -05:00
2020-08-17 14:09:37 -05:00
2020-08-17 14:09:37 -05:00

  • Quick start If you do not have the RDC installed, please specify the RDC library path using: export LD_LIBRARY_PATH=<rdc_libs_path>

Then you can run RdcReader in python_binding folder: python RdcReader.py

  • Prometheus plugin Install the prometheus_client: % pip install prometheus_client

Start the rdcd with auth and then run plugin to connect to it: % python rdc_prometheus.py

Check the options of the plugin: % python rdc_prometheus.py --help

Verify the plugin is running: % curl localhost:5000

In the managment computer, install the Prometheus from https://github.com/prometheus/prometheus

Modify the file prometheus_targets.json to add the compute nodes running the plugin. Start the Prometheus % prometheus --config.file=

Browse to localhost:9090 in the managment computer for metrics from RDC.