Wykres commitów

1558 Commity

Autor SHA1 Wiadomość Data
Scaffidi, Salvatore 9fbdaa66ab [SWDEV-463406] Updating Violations Documentation
Signed-off-by: Greg Scaffidi <salvatore.scaffidi@amd.com>
2025-01-30 02:45:13 -06:00
Williams, Justin 5f56d97d57 [DCSM-524] ESMI version fix
Adjusted ESMI version logic

Signed-off-by: Justin Williams <Justin.Williams@amd.com>
Signed-off-by: Maisam Arif <Maisam.Arif@amd.com>
Co-authored-by: Maisam Arif <Maisam.Arif@amd.com>
Change-Id: Ie6c480d7175b7bffcce183e9104419e87481aa89
2025-01-30 02:38:20 -06:00
Ramalingam, Muthusamy ced110dbb6 amdsmi: Adding Support to get hsmp Driver version
* amdsmi: Adding Support to get hsmp Driver version

Adding Support to fetch hsmp driver version from ESmi Interfaces.
Adding Support to fetch memory bandwidth per socket.

Signed-off-by: muthusamy <muthusamy.ramalingam@amd.com>
2025-01-29 13:45:02 -06:00
Williams, Justin 21841f44a5 [DCSM-524] ESMI build fix (#72)
Fix amd_hsmp failure to copy new version

Signed-off-by: Justin Williams <Justin.Williams@amd.com>
Co-authored-by: Arif, Maisam <Maisam.Arif@amd.com>
2025-01-29 13:39:19 -06:00
Mallya, Ameya Keshava 98052605c9 Added !verify trigger
Signed-off-by: Mallya, Ameya Keshava <AmeyaKeshava.Mallya@amd.com>
2025-01-28 20:08:25 -08:00
Williams, Justin 782dd075ef [SWDEV-500518] RHEL8 fix (#71)
Fix ordering of RHEL 8 build process

Signed-off-by: Williams, Justin <Justin.Williams@amd.com>
2025-01-28 16:42:14 -06:00
Arif, Maisam 703415cb1f Updated Import Error Logging
Signed-off-by: Maisam Arif <Maisam.Arif@amd.com>
Change-Id: Ief4a5f100f54668c5bce001ea051136738fbc468
2025-01-28 15:56:49 -06:00
Scaffidi, Salvatore 87834bf829 [SWDEV-511296] Update violation_timestamp to read timestamp from firmware
Updated violation_status->violation_timestamp to read values from firmware timestamp

Signed-off-by: Greg Scaffidi <salvatore.scaffidi@amd.com>
Change-Id: I567f824a9ace09a780bca8bb182d45bed681e9ce
2025-01-28 15:43:06 -06:00
Joseph Narlo dc228398d0 [SWDEV-504583] Resolve Additional Compiler Warnings
Signed-off-by: Joseph Narlo <joseph.narlo@amd.com>
2025-01-28 15:36:44 -06:00
Joseph Narlo 3d12d64c9b [SWDEV-504389] Sync Comments in Linux BM
Signed-off-by: Joseph Narlo <joseph.narlo@amd.com>
2025-01-24 07:24:11 -06:00
Maisam Arif 803b18fe95 Dropped count from amdsmi_get_link_topology_nearest() python API
The count field was not pythonic nor needed

Signed-off-by: Maisam Arif <Maisam.Arif@amd.com>
Change-Id: I212f43dc11f2f2c7eddd39900e6e3aaec03f3f8f
2025-01-22 19:07:01 -06:00
Kanangot Balakrishnan, Bindhiya e3e11835e4 [SWDEV-508042] Fix TypeError in specific clocks csv logging (#57)
Logging specific clocks in csv format was causing TypeError as the levels were int.
Fixed this by appending Level string at the beginning.

Signed-off-by: Bindhiya Kanangot Balakrishnan <Bindhiya.KanangotBalakrishnan@amd.com>
2025-01-22 18:06:13 -06:00
Pham, Gabriel b779ce2831 [SWDEV-493207] Added amdgpu version to version command
Signed-off-by: gabrpham <Gabriel.Pham@amd.com>
2025-01-22 18:05:25 -06:00
Williams, Justin a86da2d5d5 [SWDEV-500518] Removed Artifacts
Signed-off-by: Williams, Justin <Justin.Williams@amd.com>
2025-01-22 15:18:57 -06:00
Park, Peter 63af5cf611 Update copyright year to 2025
Signed-off-by: Peter Park <peter.park@amd.com>
2025-01-22 15:06:09 -06:00
Williams, Justin bb11b4e2b4 [SWDEV-500518] Added Tests to CI (#42)
Added all python, amdsmi, and build example script tests.

Signed-off-by: Justin Williams <Justin.Williams@amd.com>
2025-01-22 10:10:32 -06:00
Kanangot Balakrishnan, Bindhiya 4b74badb00 [SWDEV-481004] Update Changelog for gfx_version number fix (#54)
Updated changelog with an example showing correct gfx version.

Signed-off-by: Bindhiya Kanangot Balakrishnan <Bindhiya.KanangotBalakrishnan@amd.com>
2025-01-22 08:49:41 -06:00
Kanangot Balakrishnan, Bindhiya 6fa991c39c [SWDEV-481004] Fix for incorrect gfx_version number (#52)
The target_graphics_version was not formatted properly and was
showing incorrect Target Name. Corrected this by fomatting
major, minor and revision numbers.

Signed-off-by: Bindhiya Kanangot Balakrishnan <Bindhiya.KanangotBalakrishnan@amd.com>
2025-01-21 15:42:05 -06:00
Castillo, Juan 9cc5c303a2 [SWDEV-508173] [AMDSMI] Python API missing function errors (#46)
* [SWDEV-508173] Updates include:
- Updating py-interface to import amdsmi_get_gpu_reg_table_info and amdsmi_get_gpu_pm_metrics_info.
- Updating the ctypes from byref to pointer.

Signed-off-by: Castillo, Juan <Juan.Castillo@amd.com>
2025-01-21 14:11:41 -06:00
Arif, Maisam 03a2368655 [SWDEV-509389] AMD-SMI crash when multiple threads call SMI APIs (#53)
Multi-threaded application rsmi_dev_gpu_metrics_info_get() causes crash

Code changes related to the following:
  * API implementation changes

Change-Id: I1f1fb39c1125569ec5d534b37fd6f68c8829eef7

Signed-off-by: Oliveira, Daniel <daniel.oliveira@amd.com>
Authored-by: Oliveira, Daniel <daniel.oliveira@amd.com>
2025-01-21 14:00:15 -06:00
Kanangot Balakrishnan, Bindhiya 834993e1c3 SWDEV-457845: Fix Linux VM clean_local_data error on set
Corrected clean_local_data error in Linux VM's while doing
amd-smi set without args.

Signed-off-by: Bindhiya Kanangot Balakrishnan <Bindhiya.KanangotBalakrishnan@amd.com>
2025-01-20 14:45:27 -06:00
Mallya, Ameya Keshava c74fde2415 Fixed Workflow for updated KWS structure
Signed-off-by: Mallya, Ameya Keshava <AmeyaKeshava.Mallya@amd.com>
2025-01-17 08:17:43 -08:00
Poag, Charis c1cd2b46ef [SWDEV-488276] Add partition 2.0 functionality (#44)
Changes:
* CLI:
  - Updated amd-smi partition
  - Updated amd-smi partition -c
  - Updated amd-smi partition -m
  - Updated amd-smi partition -a
  - Updated amd-smi set -M <NPS1/NPS2/NPS4/NPS8>
  - Updated amd-smi set -C <SPX/DPX/QPX/TPX/CPX>
  - Updated amd-smi set -C <ACCELERATOR_TYPE> or <PROFILE_INDEX>
    Where PROFILE_INDEX = available ACCELERATOR_TYPES
  - Updated amd-smi set --help, now includes more detail for
    amd-smi set -C <ACCELERATOR_TYPE> or <PROFILE_INDEX>

* API:
  - Added amdsmi_get_gpu_memory_partition_config
  - Added amdsmi_set_gpu_memory_partition_mode
  - Added amdsmi_get_gpu_accelerator_partition_profile_config
  - Updated amdsmi_get_gpu_accelerator_partition_profile_config
  - Added amdsmi_set_gpu_accelerator_partition_profile

Signed-off-by: Charis Poag <Charis.Poag@amd.com>
2025-01-16 00:53:46 -06:00
Mallya, Ameya Keshava c6bb6cafd0 Added KWS check
Signed-off-by: Mallya, Ameya Keshava <AmeyaKeshava.Mallya@amd.com>
2025-01-16 00:53:38 -06:00
Scaffidi, Salvatore 3793be7735 [SWDEV-463406] Update API with fields for gfx_clock_below_host_limit and low_utilization violations
Updated API with fields for gfx_clock_below_host_limit and low_utilization violations
Change-Id: I25647bae6e7b785f44dab024272767658688bcad

---------
Signed-off-by: Scaffidi, Salvatore <Salvatore.Scaffidi@amd.com>
Signed-off-by: Arif, Maisam <Maisam.Arif@amd.com>
Co-authored-by: Charis Poag <Charis.Poag@amd.com>
2025-01-08 22:07:23 -06:00
Arif, Maisam 490132748f Corrected spacing and simplified logic
Change-Id: I51c98339367d1cb9470a00ee05463ac8662d6b01

Signed-off-by: Maisam Arif <Maisam.Arif@amd.com>
2025-01-08 20:18:24 -06:00
Kanangot Balakrishnan, Bindhiya d0e770ffbc SWDEV-504130 Add temperature violation status to amd-smi monitor (#2)
Added boolean temperature violation status to amd-smi monitor.

Signed-off-by: Bindhiya Kanangot Balakrishnan <Bindhiya.KanangotBalakrishnan@amd.com>
2025-01-08 16:35:53 -06:00
Pham, Gabriel 129ad8ffad [SWDEV-502523] Made amd-smi reset command arguments mutually exclusive
Made reset arguments mutually exclusive so that users can only 
select one option at a time to prevent throwing of errors.

---------
Signed-off-by: Pham, Gabriel <Gabriel.Pham@amd.com>
2025-01-08 16:24:05 -06:00
Castillo, Juan 60492e754f [SWDEV-495169] Update err output to log_err (#24)
Update status type for EPERM and ENOENT based on feedback from ticket.
Update error output to LOG_ERR.

---------

Signed-off-by: Juan Castillo <juan.castillo@amd.com>
2025-01-07 17:35:39 -06:00
Williams, Justin 7323881957 [SWDEV-479339/SWDEV-498804] Added AMDSMI Dockerfile (#35)
Updated AMDSMI Dockerfile

Signed-off-by: Justin Williams <Justin.Williams@amd.com>
2025-01-07 17:28:16 -06:00
Meng, Li (Jassmine) dc400d916e [SWDEV-230863] add two new interfaces for background health check (#4)
1. Get the bad pages threshold of a processor.
2. Verify the checksum of RAS EEPROM

Signed-off-by: Meng Li <li.meng@amd.com>
2025-01-07 17:26:55 -06:00
Huang, Tim d32f2a109a Add rust bindings for amdsmi c interface (#14)
It consists of two main steps:

1. Generating Bindings with `bindgen`:
- The `build.rs` script uses `bindgen` to generate Rust FFI (Foreign
Function Interface) bindings for the AMD SMI C library. This step
automatically exports all enums, structs, unions, and unsafe functions
from the C library into Rust. This provides a comprehensive low-level
interface to the AMD SMI library.

2. Implementing Safe Rust Wrappers:
- The generated bindings are then wrapped in safe Rust functions. These
safe wrappers handle error checking, resource management, and provide a
more idiomatic Rust interface. This ensures that users of the library
can interact with the AMD SMI functions without dealing with unsafe code
directly.

Change-Id: I7d5e49e59826164fc911ced04ef7ca5706b7cc05

Signed-off-by: Tim Huang <tim.huang@amd.com>
2025-01-07 17:19:46 -06:00
Kanangot Balakrishnan, Bindhiya 3897670757 [SWDEV-439701] Fix wrong error handling in MissingParameterValue (#32)
Error handling was not displaying the missing parameter details in
argument type validator functions. Fixed this by passing param name to
AmdSmiMissingParameterValueException.

Signed-off-by: Bindhiya Kanangot Balakrishnan <Bindhiya.KanangotBalakrishnan@amd.com>
2025-01-07 17:13:00 -06:00
Pham, Gabriel 5ed340c08b [SWDEV-502523] made set gpu arguments mutually exclusive (#31)
Signed-off-by: Pham, Gabriel <Gabriel.Pham@amd.com>
2025-01-07 16:48:01 -06:00
Narlo, Joseph 84f4d397aa [SWDEV-506030] Fix Errors when generating python wrapper
Update wrapper script is not generating the python wrapper properly.
Fixed how the structure names are parsed.

Signed-off-by: Joseph Narlo <joseph.narlo@amd.com>
2025-01-07 16:12:37 -06:00
Park, Peter d9bba639df [SWDEV-503717] Remove occurrences of "Fusion" in docs
Tiny PR to remove occurrences of "Kernel **Fusion** Driver" in
public-facing docs.

Signed-off-by: Peter Park <peter.park@amd.com>
2025-01-07 16:11:46 -06:00
Maisam Arif 8ca2c6e247 Deprecated amdsmi_get_energy_count() power field
Signed-off-by: Maisam Arif <Maisam.Arif@amd.com>
Change-Id: I1b5fe8e278b797458e57dff689e692347901bbfd
2025-01-07 12:45:55 -06:00
Arif, Maisam e47add5a1b [SWDEV-500518] Added amdsmi-build.yml (#29)
Signed-off-by: Williams, Justin <Justin.Williams@amd.com>
Authored-by: Williams, Justin <Justin.Williams@amd.com>
2024-12-21 00:57:03 -06:00
Pham, Gabriel 93a027ec95 [SWDEV-476303] Exposed valid values for set command (#8)
Updated amd-smi set help text
---------

Signed-off-by: gabrpham <Gabriel.Pham@amd.com>
Signed-off-by: Pham, Gabriel <Gabriel.Pham@amd.com>
2024-12-20 15:32:10 -06:00
gabrpham 23da950ef0 Additional fixes for amd-smi static --clock
Signed-off-by: gabrpham <Gabriel.Pham@amd.com>
2024-12-20 14:45:20 -06:00
Charis Poag 3226a1d0ea [SWDEV-484382] Fix VCLK/DCLK outputs for monitor, static, metric
Units were off and VCLK/DCLK outputs were not coming in
properly through amdsmi_get_clk_freq()

Now we match units sent back through rsmi_dev_gpu_clk_freq_get (MHz).

CLI now shows maximum of 2 VCLK/DCLKs otherwise shows N/A if there
is no current_freq listed.

Change-Id: I8a7b66cbb5263e8d396f8568c104e1ce3512923d
Signed-off-by: Charis Poag <Charis.Poag@amd.com>
2024-12-20 14:11:08 -06:00
Choudhary, Rahul 23a21ea202 Update rocm_ci_caller.yml - Enabling PSDB pipeline for pull request (#18) 2024-12-20 00:28:50 -08:00
Choudhary, Rahul 3a7b8bb664 Update rocm_ci_caller.yml - Enabling PSDB pipeline for pull request
pull request in amd-staging will trigger staging pipeline - compute-psdb-staging-smi-libs-ghemu

pull request in amd-mainline will trigger mainline psdb - compute-psdb-no-npi-ghemu

Signed-off-by: Choudhary, Rahul <Rahul.Choudhary@amd.com>
2024-12-19 23:17:54 -08:00
Maisam Arif 338cdd63ce [SWDEV-481702] Update marketing name source
Signed-off-by: Maisam Arif <Maisam.Arif@amd.com>
2024-12-19 18:32:51 -06:00
Maisam Arif 6dcbff866b Bump Version to 24.7.2.0
Signed-off-by: Maisam Arif <Maisam.Arif@amd.com>
2024-12-19 17:04:55 -06:00
Arif, Maisam 8c579316b8 Update CHANGELOG.md (#20) 2024-12-19 16:50:04 -06:00
Arif, Maisam 34f9edd2fc Update CHANGELOG.md
Signed-off-by: Arif, Maisam <Maisam.Arif@amd.com>
2024-12-19 16:47:31 -06:00
Justin Williams 747834624b Fixed Python module error 2024-12-19 11:19:01 -06:00
Arif, Maisam 0bd1112b6b Merge branch 'amd-mainline' into amd-staging 2024-12-19 11:11:49 -06:00
Williams, Justin 71d8fa1c0f Update README.md
Signed-off-by: Williams, Justin <Justin.Williams@amd.com>
2024-12-18 15:36:26 -06:00