e2e4fc65c1
Changes:
- Update violation status logic and metric naming for XCP/XCC metrics (thrm/thm consistency)
- Added XCP identifier in monitor to allow partition metrics to be shown with applicable APIs
(Violation Status is the first example of this in monitor)
- Improve CLI monitor output:
support multiple GPU lines per GPU, add new columns, and better formatting
- Refactor helpers and logger for flexible unit formatting and table rendering
- Add examples for amdsmi_get_gpu_pm_metrics_info()/amdsmi_get_gpu_reg_table_info()
new metrics APIs in C++ example
- Sync Python/C++ interface and structures for new metrics fields and naming
- Remove deprecated/unused RSMI activity APIs, documentation not needed since
the APIs no longer exist in ROCm SMI either.
- Cleanup metric violations + fix handle watch arguments
- Provide better handling/doc for average_flattened_ints()
- Group xcp metrics with brackets in human readable + adjust output size
Signed-off-by: Poag, Charis <Charis.Poag@amd.com>
AMD SMI CLI tool
A command line tool for manipulating and monitoring the amdgpu kernel;
amd-smi is intended to replace and deprecate the existing
rocm-smi CLI tool.
When using the CLI tool, you should have at least one AMD GPU and the driver installed.
Note
The AMD SMI CLI tool is provided as an example code to aid the development of telemetry tools. The Python or C++ library is recommended as a robust data source.
Find the documentation in the docs/ directory.
Online documentation
Explore the latest documentation on the ROCm documentation portal.