Commit Graph

1545 Commits

Author SHA1 Message Date
Williams, Justin a86da2d5d5 [SWDEV-500518] Removed Artifacts
Signed-off-by: Williams, Justin <Justin.Williams@amd.com>
2025-01-22 15:18:57 -06:00
Park, Peter 63af5cf611 Update copyright year to 2025
Signed-off-by: Peter Park <peter.park@amd.com>
2025-01-22 15:06:09 -06:00
Williams, Justin bb11b4e2b4 [SWDEV-500518] Added Tests to CI (#42)
Added all python, amdsmi, and build example script tests.

Signed-off-by: Justin Williams <Justin.Williams@amd.com>
2025-01-22 10:10:32 -06:00
Kanangot Balakrishnan, Bindhiya 4b74badb00 [SWDEV-481004] Update Changelog for gfx_version number fix (#54)
Updated changelog with an example showing correct gfx version.

Signed-off-by: Bindhiya Kanangot Balakrishnan <Bindhiya.KanangotBalakrishnan@amd.com>
2025-01-22 08:49:41 -06:00
Kanangot Balakrishnan, Bindhiya 6fa991c39c [SWDEV-481004] Fix for incorrect gfx_version number (#52)
The target_graphics_version was not formatted properly and was
showing incorrect Target Name. Corrected this by fomatting
major, minor and revision numbers.

Signed-off-by: Bindhiya Kanangot Balakrishnan <Bindhiya.KanangotBalakrishnan@amd.com>
2025-01-21 15:42:05 -06:00
Castillo, Juan 9cc5c303a2 [SWDEV-508173] [AMDSMI] Python API missing function errors (#46)
* [SWDEV-508173] Updates include:
- Updating py-interface to import amdsmi_get_gpu_reg_table_info and amdsmi_get_gpu_pm_metrics_info.
- Updating the ctypes from byref to pointer.

Signed-off-by: Castillo, Juan <Juan.Castillo@amd.com>
2025-01-21 14:11:41 -06:00
Arif, Maisam 03a2368655 [SWDEV-509389] AMD-SMI crash when multiple threads call SMI APIs (#53)
Multi-threaded application rsmi_dev_gpu_metrics_info_get() causes crash

Code changes related to the following:
  * API implementation changes

Change-Id: I1f1fb39c1125569ec5d534b37fd6f68c8829eef7

Signed-off-by: Oliveira, Daniel <daniel.oliveira@amd.com>
Authored-by: Oliveira, Daniel <daniel.oliveira@amd.com>
2025-01-21 14:00:15 -06:00
Kanangot Balakrishnan, Bindhiya 834993e1c3 SWDEV-457845: Fix Linux VM clean_local_data error on set
Corrected clean_local_data error in Linux VM's while doing
amd-smi set without args.

Signed-off-by: Bindhiya Kanangot Balakrishnan <Bindhiya.KanangotBalakrishnan@amd.com>
2025-01-20 14:45:27 -06:00
Mallya, Ameya Keshava c74fde2415 Fixed Workflow for updated KWS structure
Signed-off-by: Mallya, Ameya Keshava <AmeyaKeshava.Mallya@amd.com>
2025-01-17 08:17:43 -08:00
Poag, Charis c1cd2b46ef [SWDEV-488276] Add partition 2.0 functionality (#44)
Changes:
* CLI:
  - Updated amd-smi partition
  - Updated amd-smi partition -c
  - Updated amd-smi partition -m
  - Updated amd-smi partition -a
  - Updated amd-smi set -M <NPS1/NPS2/NPS4/NPS8>
  - Updated amd-smi set -C <SPX/DPX/QPX/TPX/CPX>
  - Updated amd-smi set -C <ACCELERATOR_TYPE> or <PROFILE_INDEX>
    Where PROFILE_INDEX = available ACCELERATOR_TYPES
  - Updated amd-smi set --help, now includes more detail for
    amd-smi set -C <ACCELERATOR_TYPE> or <PROFILE_INDEX>

* API:
  - Added amdsmi_get_gpu_memory_partition_config
  - Added amdsmi_set_gpu_memory_partition_mode
  - Added amdsmi_get_gpu_accelerator_partition_profile_config
  - Updated amdsmi_get_gpu_accelerator_partition_profile_config
  - Added amdsmi_set_gpu_accelerator_partition_profile

Signed-off-by: Charis Poag <Charis.Poag@amd.com>
2025-01-16 00:53:46 -06:00
Mallya, Ameya Keshava c6bb6cafd0 Added KWS check
Signed-off-by: Mallya, Ameya Keshava <AmeyaKeshava.Mallya@amd.com>
2025-01-16 00:53:38 -06:00
Scaffidi, Salvatore 3793be7735 [SWDEV-463406] Update API with fields for gfx_clock_below_host_limit and low_utilization violations
Updated API with fields for gfx_clock_below_host_limit and low_utilization violations
Change-Id: I25647bae6e7b785f44dab024272767658688bcad

---------
Signed-off-by: Scaffidi, Salvatore <Salvatore.Scaffidi@amd.com>
Signed-off-by: Arif, Maisam <Maisam.Arif@amd.com>
Co-authored-by: Charis Poag <Charis.Poag@amd.com>
2025-01-08 22:07:23 -06:00
Arif, Maisam 490132748f Corrected spacing and simplified logic
Change-Id: I51c98339367d1cb9470a00ee05463ac8662d6b01

Signed-off-by: Maisam Arif <Maisam.Arif@amd.com>
2025-01-08 20:18:24 -06:00
Kanangot Balakrishnan, Bindhiya d0e770ffbc SWDEV-504130 Add temperature violation status to amd-smi monitor (#2)
Added boolean temperature violation status to amd-smi monitor.

Signed-off-by: Bindhiya Kanangot Balakrishnan <Bindhiya.KanangotBalakrishnan@amd.com>
2025-01-08 16:35:53 -06:00
Pham, Gabriel 129ad8ffad [SWDEV-502523] Made amd-smi reset command arguments mutually exclusive
Made reset arguments mutually exclusive so that users can only 
select one option at a time to prevent throwing of errors.

---------
Signed-off-by: Pham, Gabriel <Gabriel.Pham@amd.com>
2025-01-08 16:24:05 -06:00
Castillo, Juan 60492e754f [SWDEV-495169] Update err output to log_err (#24)
Update status type for EPERM and ENOENT based on feedback from ticket.
Update error output to LOG_ERR.

---------

Signed-off-by: Juan Castillo <juan.castillo@amd.com>
2025-01-07 17:35:39 -06:00
Williams, Justin 7323881957 [SWDEV-479339/SWDEV-498804] Added AMDSMI Dockerfile (#35)
Updated AMDSMI Dockerfile

Signed-off-by: Justin Williams <Justin.Williams@amd.com>
2025-01-07 17:28:16 -06:00
Meng, Li (Jassmine) dc400d916e [SWDEV-230863] add two new interfaces for background health check (#4)
1. Get the bad pages threshold of a processor.
2. Verify the checksum of RAS EEPROM

Signed-off-by: Meng Li <li.meng@amd.com>
2025-01-07 17:26:55 -06:00
Huang, Tim d32f2a109a Add rust bindings for amdsmi c interface (#14)
It consists of two main steps:

1. Generating Bindings with `bindgen`:
- The `build.rs` script uses `bindgen` to generate Rust FFI (Foreign
Function Interface) bindings for the AMD SMI C library. This step
automatically exports all enums, structs, unions, and unsafe functions
from the C library into Rust. This provides a comprehensive low-level
interface to the AMD SMI library.

2. Implementing Safe Rust Wrappers:
- The generated bindings are then wrapped in safe Rust functions. These
safe wrappers handle error checking, resource management, and provide a
more idiomatic Rust interface. This ensures that users of the library
can interact with the AMD SMI functions without dealing with unsafe code
directly.

Change-Id: I7d5e49e59826164fc911ced04ef7ca5706b7cc05

Signed-off-by: Tim Huang <tim.huang@amd.com>
2025-01-07 17:19:46 -06:00
Kanangot Balakrishnan, Bindhiya 3897670757 [SWDEV-439701] Fix wrong error handling in MissingParameterValue (#32)
Error handling was not displaying the missing parameter details in
argument type validator functions. Fixed this by passing param name to
AmdSmiMissingParameterValueException.

Signed-off-by: Bindhiya Kanangot Balakrishnan <Bindhiya.KanangotBalakrishnan@amd.com>
2025-01-07 17:13:00 -06:00
Pham, Gabriel 5ed340c08b [SWDEV-502523] made set gpu arguments mutually exclusive (#31)
Signed-off-by: Pham, Gabriel <Gabriel.Pham@amd.com>
2025-01-07 16:48:01 -06:00
Narlo, Joseph 84f4d397aa [SWDEV-506030] Fix Errors when generating python wrapper
Update wrapper script is not generating the python wrapper properly.
Fixed how the structure names are parsed.

Signed-off-by: Joseph Narlo <joseph.narlo@amd.com>
2025-01-07 16:12:37 -06:00
Park, Peter d9bba639df [SWDEV-503717] Remove occurrences of "Fusion" in docs
Tiny PR to remove occurrences of "Kernel **Fusion** Driver" in
public-facing docs.

Signed-off-by: Peter Park <peter.park@amd.com>
2025-01-07 16:11:46 -06:00
Maisam Arif 8ca2c6e247 Deprecated amdsmi_get_energy_count() power field
Signed-off-by: Maisam Arif <Maisam.Arif@amd.com>
Change-Id: I1b5fe8e278b797458e57dff689e692347901bbfd
2025-01-07 12:45:55 -06:00
Arif, Maisam e47add5a1b [SWDEV-500518] Added amdsmi-build.yml (#29)
Signed-off-by: Williams, Justin <Justin.Williams@amd.com>
Authored-by: Williams, Justin <Justin.Williams@amd.com>
2024-12-21 00:57:03 -06:00
Pham, Gabriel 93a027ec95 [SWDEV-476303] Exposed valid values for set command (#8)
Updated amd-smi set help text
---------

Signed-off-by: gabrpham <Gabriel.Pham@amd.com>
Signed-off-by: Pham, Gabriel <Gabriel.Pham@amd.com>
2024-12-20 15:32:10 -06:00
gabrpham 23da950ef0 Additional fixes for amd-smi static --clock
Signed-off-by: gabrpham <Gabriel.Pham@amd.com>
2024-12-20 14:45:20 -06:00
Charis Poag 3226a1d0ea [SWDEV-484382] Fix VCLK/DCLK outputs for monitor, static, metric
Units were off and VCLK/DCLK outputs were not coming in
properly through amdsmi_get_clk_freq()

Now we match units sent back through rsmi_dev_gpu_clk_freq_get (MHz).

CLI now shows maximum of 2 VCLK/DCLKs otherwise shows N/A if there
is no current_freq listed.

Change-Id: I8a7b66cbb5263e8d396f8568c104e1ce3512923d
Signed-off-by: Charis Poag <Charis.Poag@amd.com>
2024-12-20 14:11:08 -06:00
Choudhary, Rahul 23a21ea202 Update rocm_ci_caller.yml - Enabling PSDB pipeline for pull request (#18) 2024-12-20 00:28:50 -08:00
Choudhary, Rahul 3a7b8bb664 Update rocm_ci_caller.yml - Enabling PSDB pipeline for pull request
pull request in amd-staging will trigger staging pipeline - compute-psdb-staging-smi-libs-ghemu

pull request in amd-mainline will trigger mainline psdb - compute-psdb-no-npi-ghemu

Signed-off-by: Choudhary, Rahul <Rahul.Choudhary@amd.com>
2024-12-19 23:17:54 -08:00
Maisam Arif 338cdd63ce [SWDEV-481702] Update marketing name source
Signed-off-by: Maisam Arif <Maisam.Arif@amd.com>
2024-12-19 18:32:51 -06:00
Maisam Arif 6dcbff866b Bump Version to 24.7.2.0
Signed-off-by: Maisam Arif <Maisam.Arif@amd.com>
2024-12-19 17:04:55 -06:00
Arif, Maisam 8c579316b8 Update CHANGELOG.md (#20) 2024-12-19 16:50:04 -06:00
Arif, Maisam 34f9edd2fc Update CHANGELOG.md
Signed-off-by: Arif, Maisam <Maisam.Arif@amd.com>
2024-12-19 16:47:31 -06:00
Justin Williams 747834624b Fixed Python module error 2024-12-19 11:19:01 -06:00
Arif, Maisam 0bd1112b6b Merge branch 'amd-mainline' into amd-staging 2024-12-19 11:11:49 -06:00
Williams, Justin 71d8fa1c0f Update README.md
Signed-off-by: Williams, Justin <Justin.Williams@amd.com>
2024-12-18 15:36:26 -06:00
Justin Williams eb5f64a5e2 Removed Wheel & Setuptools dependency 2024-12-18 15:35:28 -06:00
Juan Castillo f8b8347627 [SWDEV-496693]GPU Metrics 1.7
Features added:
- [SWDEV-475244] Add new interface to get max memory bandwidth
Updated API: amdsmi_get_gpu_vram_info
Updated: struct amdsmi_vram_info_t to include vram_max_bandwidth
CLI: amd-smi static --vram

- [SWDEV-488349] Add new interface for XGMI link status
New API: amdsmi_get_gpu_xgmi_link_status
CLI: amd-smi xgmi --link-status

Signed-off-by: Juan Castillo <juan.castillo@amd.com>
Change-Id: I1aa35b741136eb4f02f7ea9a95b865886273eb72
2024-12-18 10:57:06 -06:00
Joe Narlo ef31bb7166 SWDEV-504389 [AMD-SMI] Synching Comments in Linux BM
Sync comments from Unified Header to Linux BM

Signed-off-by: Joe Narlo <Joseph.Narlo@amd.com>
Change-Id: I9b1ae94db68761a7963ad87cd60177a57e93ad85
2024-12-18 10:57:06 -06:00
Choudhary, Rahul 30e08eed1c Create rocm_ci_caller.yml enabling OSDB for push on mainline 2024-12-17 12:21:17 -08:00
Choudhary, Rahul 6ffe28fb47 Update rocm_ci_caller.yml fixing base ref
base ref to cover both pull and push request
2024-12-17 12:19:06 -08:00
Choudhary, Rahul 2c36a327de Create rocm_ci_caller.yml adding workflow caller for PSDB and OSDB 2024-12-16 22:06:03 -08:00
Choudhary, Rahul c11a7f6eb9 Create codeql.yml
copied from previous repo
2024-12-16 22:03:54 -08:00
Maisam Arif 89ecf1c444 Merge amd-dev into amd-master 20241213
Signed-off-by: Maisam Arif <Maisam.Arif@amd.com>
Change-Id: Ie13efc3ba253965dfe32a93438fe39b0509e305f
2024-12-13 12:08:19 -06:00
Maisam Arif 41561ec3a0 Revert "[SWDEV-502001] Fix link for amd_hsmp.h"
This reverts commit a4574a7d13.

Change-Id: I583e3e49989955f03adea84c965503288c0934ee
Signed-off-by: Maisam Arif <Maisam.Arif@amd.com>
2024-12-13 12:01:26 -06:00
Joe Narlo d0a7332d32 SWDEV-492272 [AMDSMI] Build/Compiler warnings messages
Fix compiler warnings

Signed-off-by: Joe Narlo <Joseph.Narlo@amd.com>
Change-Id: I10657b8f3ef18a9b45311e8f6509958297a57823
2024-12-13 00:38:07 -05:00
gabrpham fe290a2056 [SWDEV-484382] Added fclk and socclk to amd-smi metric -c
Signed-off-by: gabrpham <Gabriel.Pham@amd.com>
Change-Id: Ie7e19c757b05455693c0d26eeb5e8b6c1e238375
2024-12-13 00:33:12 -05:00
gabrpham 5f9c2db6f3 [SWDEV-484382] Added new command amd-smi set -c/--clk-level
Signed-off-by: gabrpham <Gabriel.Pham@amd.com>
Change-Id: If45152e3a3c94f65b6a8a960601b9ed16fa3d0d7
2024-12-13 00:32:19 -05:00
gabrpham bc16e1a5da [SWDEV-484382] Added new command amd-smi static --clock
Signed-off-by: gabrpham <Gabriel.Pham@amd.com>
Change-Id: I49e1aa2e699734d81c40c76c62da1cecc5bd3c0e
2024-12-13 00:30:29 -05:00