fa6f071751
Mode-1 GPU reset affects entire XGMI hive. Added xgmi_hive_id check to reset only once for same-hive GPUs while preserving separate resets for different hives or no hives. - Example: `sudo amd-smi reset -G` or `sudo amd-smi reset -G -g 0` on MI300 will reset all GPU's only once. Signed-off-by: Bindhiya Kanangot Balakrishnan <Bindhiya.KanangotBalakrishnan@amd.com>
AMD SMI CLI tool
A command line tool for manipulating and monitoring the amdgpu kernel;
amd-smi is intended to replace and deprecate the existing
rocm-smi CLI tool.
When using the CLI tool, you should have at least one AMD GPU and the driver installed.
Note
The AMD SMI CLI tool is provided as an example code to aid the development of telemetry tools. The Python or C++ library is recommended as a robust data source.
Find the documentation in the docs/ directory.
Online documentation
Explore the latest documentation on the ROCm documentation portal.