提交線圖

1558 次程式碼提交

作者 SHA1 備註 日期
Maisam Arif ae2c713d67 Update market name device ids
Signed-off-by: Maisam Arif <Maisam.Arif@amd.com>
Change-Id: I10ce84c8466ff30e2486ed3664a9fe1b57d9c9e4
2024-09-04 10:33:43 -05:00
Maisam Arif 1efb5e9910 Updated cli init functions to not intersect with lib init functions
Added Quick start script to quickly test python APIs
"python3 -i tools/amdsmi_quick_start.py"
Fixed ESMI lib macros

Signed-off-by: Maisam Arif <Maisam.Arif@amd.com>
Change-Id: I55370a0cb79d631f7f2f2b91568f089b503ebfad
2024-09-04 10:23:36 -04:00
Xiaodong Wang 2066872297 Fix ASAN issue in DiscoverAmdgpuDevices
I ran a test that exercised this code in dev mode and ASAN found a memory access issue due to the iterator returned by lower_bound being dereferenced unconditionally.  I believe the right fix is to check if the iterator is within the map and if not go to the else branch

Change-Id: I34fdce634791a09a89eee76c8b2b64a9607d57f9
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com>
2024-09-04 10:14:10 -04:00
Maisam Arif d40e4d18a0 Added Example commands for amd-smi CLI
Signed-off-by: Maisam Arif <Maisam.Arif@amd.com>
Change-Id: I4a0211f7dd54de9b225e4546509134bb45c97956
2024-09-04 09:39:21 -04:00
gabrpham 95ca2b83a1 Changed power parameter in amdsmi_get_energy_count() to energy_accumulator
Issue linked here: https://github.com/ROCm/amdsmi/issues/38

Signed-off-by: gabrpham <Gabriel.Pham@amd.com>
Change-Id: I622236eb3f0144aefeb6c82d2713b4822bfeeb11
2024-09-04 09:38:08 -04:00
muthusamy 3c954e78fc [SWDEV-481002] Fix in Update Market Names
Signed-off-by: muthusamy <muthusamy.ramalingam@amd.com>
Change-Id: I16ea6bdd70f7ed847ef56ddf99dfe66d42c7942a
2024-09-03 11:42:24 +00:00
Maisam Arif b5424c1c7e [SWDEV-481002] Update Market Names
Signed-off-by: Maisam Arif <Maisam.Arif@amd.com>
Change-Id: I23d129712fd7d7a0d9de73511c71a2eeeb3ec183
2024-08-30 13:31:25 -04:00
Charis Poag d9d6637cb7 [SWDEV-451960] [WIP] Add Pytest
Updates:
- Added pytest to shared/pytest folder
- User can execute tests:

[pytest]
python3 -m pytest -p no:cacheprovider /opt/rocm/share/amd_smi/tests/pytest/unit_tests.py -s -v
python3 -m pytest -p no:cacheprovider /opt/rocm/share/amd_smi/tests/pytest/integration_test.py -s -v

[unittest]
/opt/rocm/share/amd_smi/tests/pytest/unit_tests.py -v
/opt/rocm/share/amd_smi/tests/pytest/integration_test.py -v

- Automatically installs pytest

Change-Id: Ia3281a9608aeeb803b91f8b83f87ff84b01037f4
Signed-off-by: Charis Poag <Charis.Poag@amd.com>
2024-08-29 10:09:29 -04:00
Zhang Ava db6edd71a2 Merge amd-dev into amd-master 20240828
Signed-off-by: Zhang Ava <niandong.zhang@amd.com>
Change-Id: Ice54ff21b716f137e764687270585239706ea639
2024-08-29 20:05:51 +08:00
Oliveira, Daniel b05849dad0 SWDEV-463401: amdsmi_get_gpu_asic_info() adds num_of_compute_units
number of compute units `amdgpu_gpu_info.num_of_compute_units` is exposed through amdsmi_get_gpu_asic_info().

Code changes related to the following:
  * API
  * CLI
  * Unit tests
  * Examples

Change-Id: Ibeb612d079ed87437a0e56124b8504098fc2dcfd
Signed-off-by: Oliveira, Daniel <daniel.oliveira@amd.com>
2024-08-28 10:15:07 -04:00
Oliveira, Daniel 893f13ab98 SWDEV-463399: amdsmi_get_gpu_vram_info() adds bit-width
Driver info `amdgpu_gpu_info.vram_bit_width` is exposed through amdsmi_get_gpu_vram_info().

Code changes related to the following:
  * API
  * CLI
  * Unit tests
  * Examples

Change-Id: I8abd8db7a603078b2b1c008b2685cecf35caf3d2
Signed-off-by: Oliveira, Daniel <daniel.oliveira@amd.com>
2024-08-27 18:22:50 -04:00
Oliveira, Daniel af3670d758 SWDEV-463372: amdsmi_get_utilization_count() adds decoder_activity
GPU Metrics info `gpu_metrics.vcn_activity` is exposed through amdsmi_get_utilization_count().

Code changes related to the following:
  * API
  * CLI
  * Unit tests

Change-Id: I831b2a81bdc0e090a6698dcb689d10f91ed87dd9
Signed-off-by: Oliveira, Daniel <daniel.oliveira@amd.com>
2024-08-27 16:58:34 -05:00
Maisam Arif 89cbecd0b6 Merge amd-dev into amd-master 20240823
Signed-off-by: Maisam Arif <Maisam.Arif@amd.com>
Change-Id: Ibb71d60b3fc87b2fa268f8a95b2fba4e9019ecba
2024-08-23 19:22:15 -05:00
Maisam Arif 7ac0a49470 Removed extra print statement
Signed-off-by: Maisam Arif <Maisam.Arif@amd.com>
Change-Id: I0043567f4cc17d69860b0c77f42fa77fd41e354d
2024-08-23 19:22:05 -05:00
Charis Poag c46eab4e9e [SWDEV-478807] Add leading 0s to amdsmi_get_fw_info()
Signed-off-by: Charis Poag <Charis.Poag@amd.com>
Change-Id: Ie535dcb8cb44138c115e29a4bc41db3cc488097f
2024-08-23 19:37:32 -04:00
Charis Poag d7c583d422 [SWDEV-478807] Fix incorrect firmware versions and names
- Fix updates API to have correct enum names (PM->SMU)
 - Python API/CLI now reports correct versions and names for
    SMC/TA_XGMI/TA_RAS

Change-Id: Icbe115b3070b9f252ef15b09b781b9b3f5861e50
Signed-off-by: Charis Poag <Charis.Poag@amd.com>
2024-08-23 18:03:13 -05:00
Charis Poag a2dc934b05 Fix amdsmi_reg_type_t not defined
Latest updates need to use a wrapper defined value. Breaks basic CLI
functionality.

$ /opt/rocm/bin/amd-smi list
Traceback (most recent call last):
  File "/opt/rocm/bin/amd-smi", line 44, in <module>
    from amdsmi_commands import AMDSMICommands
  File "/opt/rocm/libexec/amdsmi_cli/amdsmi_commands.py", line 30, in <module>
    from amdsmi_helpers import AMDSMIHelpers
  File "/opt/rocm/libexec/amdsmi_cli/amdsmi_helpers.py", line 35, in <module>
    from amdsmi_init import *
  File "/opt/rocm/libexec/amdsmi_cli/amdsmi_init.py", line 35, in <module>
    from amdsmi import amdsmi_interface
  File "/usr/local/lib/python3.8/dist-packages/amdsmi/__init__.py", line 26, in <module>
    from .amdsmi_interface import amdsmi_init
  File "/usr/local/lib/python3.8/dist-packages/amdsmi/amdsmi_interface.py", line 1725, in <module>
    reg_type: amdsmi_reg_type_t,
NameError: name 'amdsmi_reg_type_t' is not defined

Change-Id: I628c811c137f57f3177a718c9bce859bc553bf7d
Signed-off-by: Charis Poag <Charis.Poag@amd.com>
2024-08-22 21:57:36 -05:00
Zhang Ava 8c5942db3a Merge amd-dev into amd-master 20240821
Signed-off-by: Zhang Ava <niandong.zhang@amd.com>
Change-Id: I9e5be746557fc7e3c7fdee2d91d0b3981945a75f
2024-08-23 09:45:42 +08:00
Tom St Denis f4506cfd65 Add amdsmi_get_gpu_pm_metrics_info and amdsmi_get_gpu_reg_table_info to py-interface (v3)
v2: drop depend on libc
v3: whitespace

Signed-off-by: Tom St Denis <tom.stdenis@amd.com>
Change-Id: I2eff7aa9d4f0ca8635796f82b106ac0d36176346
2024-08-21 08:38:14 -04:00
Bill(Shuzhou) Liu 97e70d44cf Set soft min or max clock
Add the API to support set soft min or max clock.

Change-Id: Ia34381a721ef3c3d894d5a89d25afa757be46a79
2024-08-20 13:22:32 -04:00
Maisam Arif 2a11d82ab9 Merge amd-dev into amd-master 20240820
Signed-off-by: Maisam Arif <Maisam.Arif@amd.com>
Change-Id: I33f3e9c430c039131a4e81224eb561269aecbe32
2024-08-20 03:17:30 -05:00
Maisam Arif 78373cb5f7 [SWDEV-479989] - Fixed if statement for filtering ecc blocks
Signed-off-by: Maisam Arif <Maisam.Arif@amd.com>
Change-Id: I4d0c7579257c98be8a4ba8e5a31b5d9db4305844
2024-08-20 03:16:49 -05:00
Maisam Arif 9253a941ca Merge amd-dev into amd-master 20240819
Signed-off-by: Maisam Arif <Maisam.Arif@amd.com>
Change-Id: I124d1e5d0896ae539636f72dcc8cc2182cac6954
2024-08-19 17:21:24 -05:00
Maisam Arif c934291940 Revert "Do not automatically download kernel header amd_hsmp.h"
This reverts commit f3cb51c08e.

Change-Id: I48ef2a6df69e7b8bc4e66009e6ee2987af8448fc
Signed-off-by: Maisam Arif <Maisam.Arif@amd.com>
2024-08-19 17:20:20 -05:00
Zhang Ava 0d6cb91a1d Merge amd-dev into amd-master 20240814
Signed-off-by: Zhang Ava <niandong.zhang@amd.com>
Change-Id: I377d25c7fbabe9d177303f0efe077f75be1be29b
2024-08-15 18:48:41 +08:00
Tom Rix f3cb51c08e Do not automatically download kernel header amd_hsmp.h
First look locally following these heuristics
Either as a user specified option -DBUILD_KERNEL_ASM_DIR=<PATH>
or from the running kernel's src
and then from hints at where it could be.

When these fail, download from the upstream kernel

Change-Id: If8d62a4f84a929f550e4a83cda93e4d671e92d02
Signed-off-by: Tom Rix <Tom.Rix@amd.com>
2024-08-13 15:42:20 -05:00
Maisam Arif 1a8462f364 Merge amd-dev into amd-master 20240813
Signed-off-by: Maisam Arif <Maisam.Arif@amd.com>
Change-Id: Ic744285e2203fad663a2f4a9f722c2bad1e623c8
2024-08-13 15:35:06 -05:00
Maisam Arif b49c5596b5 [SWDEV-478576] Adjusted Disclaimer
Signed-off-by: Maisam Arif <Maisam.Arif@amd.com>
Change-Id: I0dbdcd1c8ff200336e7ee0e8ca88a5eba1b41057
2024-08-09 18:50:55 -04:00
Maisam Arif 037f8689fe Fixed Guest VM registering as Passthrough VM
Signed-off-by: Maisam Arif <Maisam.Arif@amd.com>
Change-Id: I0edc36e1a114166647dc10ebc646665b62c5d88e
2024-08-09 18:44:44 -04:00
Maisam Arif 210680b570 Removed metric --ecc & --ecc-blocks commands from VM
ecc is not supported on VM
	Added static --ras because ras features are still detectable

Signed-off-by: Maisam Arif <Maisam.Arif@amd.com>
Change-Id: Ied4132b863989dfd67897e00904f04d140fd2773
2024-08-09 18:44:44 -04:00
Maisam Arif cec3e4c2a0 [SWDEV-478576] Added Disclaimer for CLI Tool
Signed-off-by: Maisam Arif <Maisam.Arif@amd.com>
Change-Id: I3d432ac3f8c9663365921591d183a5d1f35c4707
2024-08-09 14:14:50 -04:00
Galantsev, Dmitrii d0ff27181b Merge amd-dev into amd-master 20240808
Change-Id: I353b1f5219dd67d1d066fcc51d27677447726776
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com>
2024-08-08 16:42:06 -05:00
Maisam Arif 40112f5b17 Bump Version to 24.6.3.0
Signed-off-by: Maisam Arif <Maisam.Arif@amd.com>
Change-Id: I902da5e5e9e7441002420afaaef01ca9c6c9666f
2024-08-08 01:30:51 -05:00
Ranjith Ramakrishnan 7591eec971 SWDEV-469004 - Append additonal path to system path
amd-smi is installed in /opt/rocm-ver/bin, but not as a soft link in wheel package
For amd-smi to work from bin directory, it need the extra path to find the dependent python scripts in /opt/rocm-ver/libexec/amd_smi

Change-Id: I4ff63a8f55949aaac51d85eae849ecc890f4c694
2024-08-08 02:15:04 -04:00
Ranjith Ramakrishnan 92a4093256 SWDEV-476075 - Prevent the modification of interpreter directives
CPACK is converting /usr/bin/env python3 to /usr/libexec/platform-python in RHEL8.
Undefining __brp_mangle_shebangs will prevent the same

Change-Id: I5120274b90aeaf783b62414ac2aeba9e84029205
2024-08-08 02:04:37 -04:00
Maisam Arif 574712386f Fixed handling in GPU/CPU/CORE select functions
Signed-off-by: Maisam Arif <Maisam.Arif@amd.com>
Change-Id: I83d78a8d6cdcbd54e5c79330be577b3a06a00985
2024-08-05 18:27:30 -04:00
gabrpham 0143041262 Fixed cli issue with empty cpu/core parameter
Change-Id: Id0fee74357a56baaec59ca5359eb00a65cfd6185
Signed-off-by: gabrpham <Gabriel.Pham@amd.com>
2024-08-05 16:37:36 -05:00
gabrpham fe1dc23ade Fixed 'amd-smi process -G'
Issue linked here: https://github.com/ROCm/amdsmi/issues/23

Signed-off-by: gabrpham <Gabriel.Pham@amd.com>
Change-Id: I73c2dede8634b21a5dfe0245a202e883fa856de2
2024-08-02 16:42:08 -04:00
Galantsev, Dmitrii 3784f37a3a Cleanup convert_SI_unit and misc linter warnings
Change-Id: I000ba548b79a7023aabad653125842064fa2e7cb
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com>
2024-08-02 10:29:06 -04:00
Zhang Ava ce0c45ebca Merge amd-dev into amd-master 20240731
Signed-off-by: Zhang Ava <niandong.zhang@amd.com>
Change-Id: Iad8ae44b4d92afc76ecb5afdd4ee57e9c981b6a8
2024-08-02 13:07:51 +08:00
gabrpham de8145387d [SWDEV-439701] Additional GPU error handling
Change-Id: Ieb35e9712f2a78acef8961d865dba1d824969ef3
Signed-off-by: gabrpham <Gabriel.Pham@amd.com>
2024-07-30 16:19:10 -05:00
Zhang Ava 813ed1bf92 Merge amd-dev into amd-master 20240724
Signed-off-by: Zhang Ava <niandong.zhang@amd.com>
Change-Id: I0f646a946999682c2128c5322447a64ae40286b2
2024-07-26 13:42:16 +08:00
Galantsev, Dmitrii f3426ced06 Docs - Switch to amd-staging branch
Change-Id: I1a26542b3a7831c1f5efea6d6b4084f77b0a7cdb
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com>
2024-07-23 17:10:34 -05:00
Galantsev, Dmitrii ceac87ef5a Azure - Switch to amd-staging branch
Change-Id: I5cc2316427631fc17990506c4234163302febd3d
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com>
2024-07-23 17:04:37 -05:00
Maisam Arif 23fc9e4ea5 Merge amd-dev into amd-master 20240719
Signed-off-by: Maisam Arif <Maisam.Arif@amd.com>
Change-Id: Id47efb0e83a5bfcf2fdc996ab2d0e4e20bc0ab9b
2024-07-19 18:14:33 -05:00
Galantsev, Dmitrii 47c8cd10cf Fix missing c_str() introduced in 8bc8307
Change-Id: Ife778276aaebd109a413efb3db703de36b730613
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com>
2024-07-19 19:12:17 -04:00
Charis Poag ac40e963d3 Fix TypeError: 'type' object is not subscriptable
Python 3.8 requires typing import to specify.
Python 3.10, no longer requires typing import.

Change-Id: I5d9844c91932bc3af53acc6dd56eb258f4d18d9b
Signed-off-by: Charis Poag <Charis.Poag@amd.com>
2024-07-19 16:33:41 -05:00
Maisam Arif c9a113eecf Merge amd-dev into amd-master 20240719
Signed-off-by: Maisam Arif <Maisam.Arif@amd.com>
Change-Id: If86d0b42820c0faf1a6ea4525aec6d11bf57a510
2024-07-19 12:47:11 -05:00
Maisam Arif 8bc8307c60 [SWDEV-474450] Removed DEVICE_MUTEX from gpu_reset
Signed-off-by: Maisam Arif <Maisam.Arif@amd.com>
Change-Id: I706fb47288738bfbde94b56fee66bbf807b3c0cb
2024-07-19 11:47:52 -04:00
Maisam Arif 3a9c93bfa6 Updated Changelog with Mutex Fix
Signed-off-by: Maisam Arif <Maisam.Arif@amd.com>
Change-Id: I0aee284ce7600efc66b0ad5392c11bb6a502a929
2024-07-19 11:18:09 -04:00