Commit-Graf

1343 Incheckningar

Upphovsman SHA1 Meddelande Datum
Williams, Justin e488e8dc33 [DCSM-524] ESMI version fix
Adjusted ESMI version logic

Signed-off-by: Justin Williams <Justin.Williams@amd.com>
Signed-off-by: Maisam Arif <Maisam.Arif@amd.com>
Co-authored-by: Maisam Arif <Maisam.Arif@amd.com>
Change-Id: Ie6c480d7175b7bffcce183e9104419e87481aa89

[ROCm/amdsmi commit: 5f56d97d57]
2025-01-30 02:38:20 -06:00
Ramalingam, Muthusamy 746b1404f9 amdsmi: Adding Support to get hsmp Driver version
* amdsmi: Adding Support to get hsmp Driver version

Adding Support to fetch hsmp driver version from ESmi Interfaces.
Adding Support to fetch memory bandwidth per socket.

Signed-off-by: muthusamy <muthusamy.ramalingam@amd.com>

[ROCm/amdsmi commit: ced110dbb6]
2025-01-29 13:45:02 -06:00
Williams, Justin 79ffdfb12f [DCSM-524] ESMI build fix (#72)
Fix amd_hsmp failure to copy new version

Signed-off-by: Justin Williams <Justin.Williams@amd.com>
Co-authored-by: Arif, Maisam <Maisam.Arif@amd.com>

[ROCm/amdsmi commit: 21841f44a5]
2025-01-29 13:39:19 -06:00
Mallya, Ameya Keshava 16a2c1f9de Added !verify trigger
Signed-off-by: Mallya, Ameya Keshava <AmeyaKeshava.Mallya@amd.com>

[ROCm/amdsmi commit: 98052605c9]
2025-01-28 20:08:25 -08:00
Williams, Justin 80aeafa808 [SWDEV-500518] RHEL8 fix (#71)
Fix ordering of RHEL 8 build process

Signed-off-by: Williams, Justin <Justin.Williams@amd.com>

[ROCm/amdsmi commit: 782dd075ef]
2025-01-28 16:42:14 -06:00
Arif, Maisam da7e32834b Updated Import Error Logging
Signed-off-by: Maisam Arif <Maisam.Arif@amd.com>
Change-Id: Ief4a5f100f54668c5bce001ea051136738fbc468

[ROCm/amdsmi commit: 703415cb1f]
2025-01-28 15:56:49 -06:00
Scaffidi, Salvatore ab20142e0a [SWDEV-511296] Update violation_timestamp to read timestamp from firmware
Updated violation_status->violation_timestamp to read values from firmware timestamp

Signed-off-by: Greg Scaffidi <salvatore.scaffidi@amd.com>
Change-Id: I567f824a9ace09a780bca8bb182d45bed681e9ce


[ROCm/amdsmi commit: 87834bf829]
2025-01-28 15:43:06 -06:00
Joseph Narlo ab88f38fa0 [SWDEV-504583] Resolve Additional Compiler Warnings
Signed-off-by: Joseph Narlo <joseph.narlo@amd.com>


[ROCm/amdsmi commit: dc228398d0]
2025-01-28 15:36:44 -06:00
Joseph Narlo cdf612c53c [SWDEV-504389] Sync Comments in Linux BM
Signed-off-by: Joseph Narlo <joseph.narlo@amd.com>


[ROCm/amdsmi commit: 3d12d64c9b]
2025-01-24 07:24:11 -06:00
Maisam Arif b7a2c6200a Dropped count from amdsmi_get_link_topology_nearest() python API
The count field was not pythonic nor needed

Signed-off-by: Maisam Arif <Maisam.Arif@amd.com>
Change-Id: I212f43dc11f2f2c7eddd39900e6e3aaec03f3f8f


[ROCm/amdsmi commit: 803b18fe95]
2025-01-22 19:07:01 -06:00
Kanangot Balakrishnan, Bindhiya a725ae078c [SWDEV-508042] Fix TypeError in specific clocks csv logging (#57)
Logging specific clocks in csv format was causing TypeError as the levels were int.
Fixed this by appending Level string at the beginning.

Signed-off-by: Bindhiya Kanangot Balakrishnan <Bindhiya.KanangotBalakrishnan@amd.com>

[ROCm/amdsmi commit: e3e11835e4]
2025-01-22 18:06:13 -06:00
Pham, Gabriel 9617b9c269 [SWDEV-493207] Added amdgpu version to version command
Signed-off-by: gabrpham <Gabriel.Pham@amd.com>

[ROCm/amdsmi commit: b779ce2831]
2025-01-22 18:05:25 -06:00
Williams, Justin 9f0f7eb940 [SWDEV-500518] Removed Artifacts
Signed-off-by: Williams, Justin <Justin.Williams@amd.com>

[ROCm/amdsmi commit: a86da2d5d5]
2025-01-22 15:18:57 -06:00
Park, Peter 35b58c8fb4 Update copyright year to 2025
Signed-off-by: Peter Park <peter.park@amd.com>

[ROCm/amdsmi commit: 63af5cf611]
2025-01-22 15:06:09 -06:00
Williams, Justin 9834db174b [SWDEV-500518] Added Tests to CI (#42)
Added all python, amdsmi, and build example script tests.

Signed-off-by: Justin Williams <Justin.Williams@amd.com>

[ROCm/amdsmi commit: bb11b4e2b4]
2025-01-22 10:10:32 -06:00
Kanangot Balakrishnan, Bindhiya 0adb298195 [SWDEV-481004] Update Changelog for gfx_version number fix (#54)
Updated changelog with an example showing correct gfx version.

Signed-off-by: Bindhiya Kanangot Balakrishnan <Bindhiya.KanangotBalakrishnan@amd.com>

[ROCm/amdsmi commit: 4b74badb00]
2025-01-22 08:49:41 -06:00
Kanangot Balakrishnan, Bindhiya a0c409be01 [SWDEV-481004] Fix for incorrect gfx_version number (#52)
The target_graphics_version was not formatted properly and was
showing incorrect Target Name. Corrected this by fomatting
major, minor and revision numbers.

Signed-off-by: Bindhiya Kanangot Balakrishnan <Bindhiya.KanangotBalakrishnan@amd.com>

[ROCm/amdsmi commit: 6fa991c39c]
2025-01-21 15:42:05 -06:00
Castillo, Juan 489105e734 [SWDEV-508173] [AMDSMI] Python API missing function errors (#46)
* [SWDEV-508173] Updates include:
- Updating py-interface to import amdsmi_get_gpu_reg_table_info and amdsmi_get_gpu_pm_metrics_info.
- Updating the ctypes from byref to pointer.

Signed-off-by: Castillo, Juan <Juan.Castillo@amd.com>

[ROCm/amdsmi commit: 9cc5c303a2]
2025-01-21 14:11:41 -06:00
Arif, Maisam 60c0561419 [SWDEV-509389] AMD-SMI crash when multiple threads call SMI APIs (#53)
Multi-threaded application rsmi_dev_gpu_metrics_info_get() causes crash

Code changes related to the following:
  * API implementation changes

Change-Id: I1f1fb39c1125569ec5d534b37fd6f68c8829eef7

Signed-off-by: Oliveira, Daniel <daniel.oliveira@amd.com>
Authored-by: Oliveira, Daniel <daniel.oliveira@amd.com>

[ROCm/amdsmi commit: 03a2368655]
2025-01-21 14:00:15 -06:00
Kanangot Balakrishnan, Bindhiya d31107d110 SWDEV-457845: Fix Linux VM clean_local_data error on set
Corrected clean_local_data error in Linux VM's while doing
amd-smi set without args.

Signed-off-by: Bindhiya Kanangot Balakrishnan <Bindhiya.KanangotBalakrishnan@amd.com>

[ROCm/amdsmi commit: 834993e1c3]
2025-01-20 14:45:27 -06:00
Mallya, Ameya Keshava d981f5db8e Fixed Workflow for updated KWS structure
Signed-off-by: Mallya, Ameya Keshava <AmeyaKeshava.Mallya@amd.com>

[ROCm/amdsmi commit: c74fde2415]
2025-01-17 08:17:43 -08:00
Poag, Charis fa81bcb513 [SWDEV-488276] Add partition 2.0 functionality (#44)
Changes:
* CLI:
  - Updated amd-smi partition
  - Updated amd-smi partition -c
  - Updated amd-smi partition -m
  - Updated amd-smi partition -a
  - Updated amd-smi set -M <NPS1/NPS2/NPS4/NPS8>
  - Updated amd-smi set -C <SPX/DPX/QPX/TPX/CPX>
  - Updated amd-smi set -C <ACCELERATOR_TYPE> or <PROFILE_INDEX>
    Where PROFILE_INDEX = available ACCELERATOR_TYPES
  - Updated amd-smi set --help, now includes more detail for
    amd-smi set -C <ACCELERATOR_TYPE> or <PROFILE_INDEX>

* API:
  - Added amdsmi_get_gpu_memory_partition_config
  - Added amdsmi_set_gpu_memory_partition_mode
  - Added amdsmi_get_gpu_accelerator_partition_profile_config
  - Updated amdsmi_get_gpu_accelerator_partition_profile_config
  - Added amdsmi_set_gpu_accelerator_partition_profile

Signed-off-by: Charis Poag <Charis.Poag@amd.com>

[ROCm/amdsmi commit: c1cd2b46ef]
2025-01-16 00:53:46 -06:00
Mallya, Ameya Keshava 8f203f8bca Added KWS check
Signed-off-by: Mallya, Ameya Keshava <AmeyaKeshava.Mallya@amd.com>


[ROCm/amdsmi commit: c6bb6cafd0]
2025-01-16 00:53:38 -06:00
Scaffidi, Salvatore 6eeb06927d [SWDEV-463406] Update API with fields for gfx_clock_below_host_limit and low_utilization violations
Updated API with fields for gfx_clock_below_host_limit and low_utilization violations
Change-Id: I25647bae6e7b785f44dab024272767658688bcad

---------
Signed-off-by: Scaffidi, Salvatore <Salvatore.Scaffidi@amd.com>
Signed-off-by: Arif, Maisam <Maisam.Arif@amd.com>
Co-authored-by: Charis Poag <Charis.Poag@amd.com>


[ROCm/amdsmi commit: 3793be7735]
2025-01-08 22:07:23 -06:00
Arif, Maisam c563c9c8d5 Corrected spacing and simplified logic
Change-Id: I51c98339367d1cb9470a00ee05463ac8662d6b01

Signed-off-by: Maisam Arif <Maisam.Arif@amd.com>

[ROCm/amdsmi commit: 490132748f]
2025-01-08 20:18:24 -06:00
Kanangot Balakrishnan, Bindhiya 67d5b8568a SWDEV-504130 Add temperature violation status to amd-smi monitor (#2)
Added boolean temperature violation status to amd-smi monitor.

Signed-off-by: Bindhiya Kanangot Balakrishnan <Bindhiya.KanangotBalakrishnan@amd.com>

[ROCm/amdsmi commit: d0e770ffbc]
2025-01-08 16:35:53 -06:00
Pham, Gabriel 9e4aa3a2e1 [SWDEV-502523] Made amd-smi reset command arguments mutually exclusive
Made reset arguments mutually exclusive so that users can only 
select one option at a time to prevent throwing of errors.

---------
Signed-off-by: Pham, Gabriel <Gabriel.Pham@amd.com>

[ROCm/amdsmi commit: 129ad8ffad]
2025-01-08 16:24:05 -06:00
Castillo, Juan 90e148b499 [SWDEV-495169] Update err output to log_err (#24)
Update status type for EPERM and ENOENT based on feedback from ticket.
Update error output to LOG_ERR.

---------

Signed-off-by: Juan Castillo <juan.castillo@amd.com>

[ROCm/amdsmi commit: 60492e754f]
2025-01-07 17:35:39 -06:00
Williams, Justin bfcf37705d [SWDEV-479339/SWDEV-498804] Added AMDSMI Dockerfile (#35)
Updated AMDSMI Dockerfile

Signed-off-by: Justin Williams <Justin.Williams@amd.com>


[ROCm/amdsmi commit: 7323881957]
2025-01-07 17:28:16 -06:00
Meng, Li (Jassmine) 1872c123d8 [SWDEV-230863] add two new interfaces for background health check (#4)
1. Get the bad pages threshold of a processor.
2. Verify the checksum of RAS EEPROM

Signed-off-by: Meng Li <li.meng@amd.com>

[ROCm/amdsmi commit: dc400d916e]
2025-01-07 17:26:55 -06:00
Huang, Tim 5e82aac4f8 Add rust bindings for amdsmi c interface (#14)
It consists of two main steps:

1. Generating Bindings with `bindgen`:
- The `build.rs` script uses `bindgen` to generate Rust FFI (Foreign
Function Interface) bindings for the AMD SMI C library. This step
automatically exports all enums, structs, unions, and unsafe functions
from the C library into Rust. This provides a comprehensive low-level
interface to the AMD SMI library.

2. Implementing Safe Rust Wrappers:
- The generated bindings are then wrapped in safe Rust functions. These
safe wrappers handle error checking, resource management, and provide a
more idiomatic Rust interface. This ensures that users of the library
can interact with the AMD SMI functions without dealing with unsafe code
directly.

Change-Id: I7d5e49e59826164fc911ced04ef7ca5706b7cc05

Signed-off-by: Tim Huang <tim.huang@amd.com>

[ROCm/amdsmi commit: d32f2a109a]
2025-01-07 17:19:46 -06:00
Kanangot Balakrishnan, Bindhiya a1d60ef088 [SWDEV-439701] Fix wrong error handling in MissingParameterValue (#32)
Error handling was not displaying the missing parameter details in
argument type validator functions. Fixed this by passing param name to
AmdSmiMissingParameterValueException.

Signed-off-by: Bindhiya Kanangot Balakrishnan <Bindhiya.KanangotBalakrishnan@amd.com>

[ROCm/amdsmi commit: 3897670757]
2025-01-07 17:13:00 -06:00
Pham, Gabriel e9fbe5bc66 [SWDEV-502523] made set gpu arguments mutually exclusive (#31)
Signed-off-by: Pham, Gabriel <Gabriel.Pham@amd.com>

[ROCm/amdsmi commit: 5ed340c08b]
2025-01-07 16:48:01 -06:00
Narlo, Joseph 2578ed922b [SWDEV-506030] Fix Errors when generating python wrapper
Update wrapper script is not generating the python wrapper properly.
Fixed how the structure names are parsed.

Signed-off-by: Joseph Narlo <joseph.narlo@amd.com>

[ROCm/amdsmi commit: 84f4d397aa]
2025-01-07 16:12:37 -06:00
Park, Peter 3968f88b4c [SWDEV-503717] Remove occurrences of "Fusion" in docs
Tiny PR to remove occurrences of "Kernel **Fusion** Driver" in
public-facing docs.

Signed-off-by: Peter Park <peter.park@amd.com>

[ROCm/amdsmi commit: d9bba639df]
2025-01-07 16:11:46 -06:00
Maisam Arif e32383b6c5 Deprecated amdsmi_get_energy_count() power field
Signed-off-by: Maisam Arif <Maisam.Arif@amd.com>
Change-Id: I1b5fe8e278b797458e57dff689e692347901bbfd


[ROCm/amdsmi commit: 8ca2c6e247]
2025-01-07 12:45:55 -06:00
Arif, Maisam b7727b1138 [SWDEV-500518] Added amdsmi-build.yml (#29)
Signed-off-by: Williams, Justin <Justin.Williams@amd.com>
Authored-by: Williams, Justin <Justin.Williams@amd.com>

[ROCm/amdsmi commit: e47add5a1b]
2024-12-21 00:57:03 -06:00
Pham, Gabriel 0c4db04140 [SWDEV-476303] Exposed valid values for set command (#8)
Updated amd-smi set help text
---------

Signed-off-by: gabrpham <Gabriel.Pham@amd.com>
Signed-off-by: Pham, Gabriel <Gabriel.Pham@amd.com>

[ROCm/amdsmi commit: 93a027ec95]
2024-12-20 15:32:10 -06:00
gabrpham 11d7fb9508 Additional fixes for amd-smi static --clock
Signed-off-by: gabrpham <Gabriel.Pham@amd.com>


[ROCm/amdsmi commit: 23da950ef0]
2024-12-20 14:45:20 -06:00
Charis Poag bf4bbef085 [SWDEV-484382] Fix VCLK/DCLK outputs for monitor, static, metric
Units were off and VCLK/DCLK outputs were not coming in
properly through amdsmi_get_clk_freq()

Now we match units sent back through rsmi_dev_gpu_clk_freq_get (MHz).

CLI now shows maximum of 2 VCLK/DCLKs otherwise shows N/A if there
is no current_freq listed.

Change-Id: I8a7b66cbb5263e8d396f8568c104e1ce3512923d
Signed-off-by: Charis Poag <Charis.Poag@amd.com>


[ROCm/amdsmi commit: 3226a1d0ea]
2024-12-20 14:11:08 -06:00
Choudhary, Rahul f017a4fecd Update rocm_ci_caller.yml - Enabling PSDB pipeline for pull request
pull request in amd-staging will trigger staging pipeline - compute-psdb-staging-smi-libs-ghemu

pull request in amd-mainline will trigger mainline psdb - compute-psdb-no-npi-ghemu

Signed-off-by: Choudhary, Rahul <Rahul.Choudhary@amd.com>

[ROCm/amdsmi commit: 3a7b8bb664]
2024-12-19 23:17:54 -08:00
Maisam Arif 1f3da7e0a9 [SWDEV-481702] Update marketing name source
Signed-off-by: Maisam Arif <Maisam.Arif@amd.com>


[ROCm/amdsmi commit: 338cdd63ce]
2024-12-19 18:32:51 -06:00
Maisam Arif 04b864bd98 Bump Version to 24.7.2.0
Signed-off-by: Maisam Arif <Maisam.Arif@amd.com>


[ROCm/amdsmi commit: 6dcbff866b]
2024-12-19 17:04:55 -06:00
Arif, Maisam b4ad0a48c6 Update CHANGELOG.md
Signed-off-by: Arif, Maisam <Maisam.Arif@amd.com>

[ROCm/amdsmi commit: 34f9edd2fc]
2024-12-19 16:47:31 -06:00
Justin Williams a803595fe0 Fixed Python module error
[ROCm/amdsmi commit: 747834624b]
2024-12-19 11:19:01 -06:00
Williams, Justin 321aa328de Update README.md
Signed-off-by: Williams, Justin <Justin.Williams@amd.com>

[ROCm/amdsmi commit: 71d8fa1c0f]
2024-12-18 15:36:26 -06:00
Justin Williams 83718095a4 Removed Wheel & Setuptools dependency
[ROCm/amdsmi commit: eb5f64a5e2]
2024-12-18 15:35:28 -06:00
Juan Castillo 2ddb2ef032 [SWDEV-496693]GPU Metrics 1.7
Features added:
- [SWDEV-475244] Add new interface to get max memory bandwidth
Updated API: amdsmi_get_gpu_vram_info
Updated: struct amdsmi_vram_info_t to include vram_max_bandwidth
CLI: amd-smi static --vram

- [SWDEV-488349] Add new interface for XGMI link status
New API: amdsmi_get_gpu_xgmi_link_status
CLI: amd-smi xgmi --link-status

Signed-off-by: Juan Castillo <juan.castillo@amd.com>
Change-Id: I1aa35b741136eb4f02f7ea9a95b865886273eb72


[ROCm/amdsmi commit: f8b8347627]
2024-12-18 10:57:06 -06:00
Joe Narlo 01d303806a SWDEV-504389 [AMD-SMI] Synching Comments in Linux BM
Sync comments from Unified Header to Linux BM

Signed-off-by: Joe Narlo <Joseph.Narlo@amd.com>
Change-Id: I9b1ae94db68761a7963ad87cd60177a57e93ad85


[ROCm/amdsmi commit: ef31bb7166]
2024-12-18 10:57:06 -06:00
Choudhary, Rahul 0375bc03b3 Update rocm_ci_caller.yml fixing base ref
base ref to cover both pull and push request

[ROCm/amdsmi commit: 6ffe28fb47]
2024-12-17 12:19:06 -08:00