Deepak Mewar
1bbb19c8b7
Added amdsmi cpu family & cpu model
...
- Updated header and source files
- Updated python interface
- Generated python wrapper for updated header
- Updated the CLI to have cpu family & cpu model
as part of metric table
Change-Id: Iea440251797270d5d29ffe883b0ad6db790be658
[ROCm/amdsmi commit: 6f7273fda5 ]
2024-02-06 18:46:27 -05:00
Deepak Mewar
c3ed6ffbfe
Updated python interface for metric table readable timestamp
...
Change-Id: I251d9ee9c908413e29dd4ce6bf46acb448867a43
[ROCm/amdsmi commit: 8adf3eba1b ]
2024-02-06 18:46:27 -05:00
Maisam Arif
39537d999d
SWDEV-436533 - Cache Info Struct Update
...
Signed-off-by: Maisam Arif <maisarif@amd.com >
Change-Id: Ic640fa657cdcc32d7b00ff78fc9452ec7e05dd07
[ROCm/amdsmi commit: 88192d8b6b ]
2024-02-05 16:51:04 -05:00
Bill(Shuzhou) Liu
058e3449a3
UMC ECC count error
...
Remove another place which will check the size of the file.
Change-Id: If681a84253a8594c9f6d44d84dbf5d04a5e35612
[ROCm/amdsmi commit: 9683fc26ca ]
2024-02-05 11:07:29 -06:00
Maisam Arif
d5f2a6770a
Fixed gpu_metric and cache cli checks
...
Signed-off-by: Maisam Arif <maisarif@amd.com >
Change-Id: Ic71e2b50dfa8fc106a17079842a7564a8e24b69d
[ROCm/amdsmi commit: 59d885a9ca ]
2024-02-01 05:47:18 -05:00
khashaik
fa2920f4dd
amdsmi_cli: Update the smu fw version data output
...
Change-Id: I8b28f7b5cf32530f5e6b3af4c342d7b6c150012b
[ROCm/amdsmi commit: 1e708aaad9 ]
2024-01-31 15:55:59 -05:00
Bill(Shuzhou) Liu
25a7ea7c06
UMC ECC count return not supported
...
The current code assume err_count sysfs only have 2 lines, which is
changed for umc_err_count by adding extra line for defer errors.
The code is changed to relax such check.
Change-Id: I7129bfcf9ef47346bca3c5a9535bc8c16c670f88
[ROCm/amdsmi commit: b125f3ee8f ]
2024-01-30 11:12:40 -06:00
Galantsev, Dmitrii
2d808e23bc
Update wrapper
...
Change-Id: Ic3ff273851bde9a8986719a668a3135fe7dd715f
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com >
[ROCm/amdsmi commit: fe94be55dc ]
2024-01-29 17:30:26 -06:00
Oliveira, Daniel
a2f04dd3bc
fix: [rocm/amd_smi_lib] header cleanup Remove non-unified headers
...
Cleans up individual gpu metric APIs which will be implemented according to 'unified-headers' standards
Code changes related to the following:
* '_get_gpu_metrics_' APIs
* Functional tests
Change-Id: I2dd2ecde11c1d77e343e0ae0e10aeb9120ae9b99
Signed-off-by: Oliveira, Daniel <daniel.oliveira@amd.com >
[ROCm/amdsmi commit: 55734d2d7a ]
2024-01-26 10:38:48 -05:00
Deepak Mewar
8eac06c5fd
amdsmi README updated for python interface
...
Change-Id: I92c1e8eb646488a9cdc32d0933f27e5db8c172ef
[ROCm/amdsmi commit: 3aabb927b4 ]
2024-01-25 02:19:38 -05:00
Maisam Arif
95f4b4eaf3
Updated engine_activity api
...
Signed-off-by: Maisam Arif <maisarif@amd.com >
Change-Id: I3f62e093fdc0254015c0837dca59763551d3659c
[ROCm/amdsmi commit: 0550c9352c ]
2024-01-24 22:23:48 -05:00
Charis Poag
f357c180e7
Fix metric type error output + re-align with ROCm SMI metrics
...
Changes:
* [CLI] Provide fix for "/opt/rocm/bin/amd-smi metric
TypeError: '>' not supported between instances of 'str' and 'i"
--> Python API was updated, CLI needed to reflect these changes
* [API] Updated amdsmi.h's with ROCm SMI
--> Incorrectly added mem_bandwidth_acc & mem_max_bandwidth
--> Realigned wrapper with updates
* [Test] Added metrics not shown in gpu_metrics_read.cc
Change-Id: Ia3a172377fd5a582254dd5a46d81dbec7e763cd9
Signed-off-by: Charis Poag <Charis.Poag@amd.com >
[ROCm/amdsmi commit: 34bd26c68e ]
2024-01-24 21:23:40 -06:00
Bill(Shuzhou) Liu
25ffbb0304
Unified API
...
amdsmi_get_link_metrics() and amdsmi_get_pcie_info()
Change-Id: Iea060e449813b842236243b772e8809497ce98fe
[ROCm/amdsmi commit: 0b67c2ccc4 ]
2024-01-24 18:27:20 -05:00
Deepak Mewar
a8b48ff1e5
amdsmi README updated for esmi library usage
...
Change-Id: I1406f0b0434e735b7d1cc1d931e7a2c92dfba728
[ROCm/amdsmi commit: 9375b6f820 ]
2024-01-24 14:30:26 -05:00
Maisam Arif
7e831b1992
24.2.0 Version update
...
Signed-off-by: Maisam Arif <maisarif@amd.com >
Change-Id: Ied7c24d63ca38c2e5ea5eca6b411e0156f61a403
[ROCm/amdsmi commit: c400a22d4d ]
2024-01-24 11:13:02 -06:00
Maisam Arif
9eef868334
24.1.0 Version update
...
Signed-off-by: Maisam Arif <maisarif@amd.com >
Change-Id: Ibfe92d199b10dc48ece85dfdeda1041f5ea98626
[ROCm/amdsmi commit: c48c989bbc ]
2024-01-24 12:09:48 -05:00
Maisam Arif
d269c35312
Fix subvendor_id error handling
...
Change-Id: Ibb2e8e329233221e72247674b4f2fbaef51baa32
Signed-off-by: Maisam Arif <maisarif@amd.com >
[ROCm/amdsmi commit: 5e25c0771b ]
2024-01-24 10:59:14 -06:00
Maisam Arif
b6a3bb8109
Corrected AmdSmiCacheTypeNames interface class
...
Signed-off-by: Maisam Arif <maisarif@amd.com >
Change-Id: Iec9c6097aec460b180a112be2d24293a40bde125
[ROCm/amdsmi commit: 94f41f2b70 ]
2024-01-24 07:48:30 -06:00
Maisam Arif
084d8f89d1
SWDEV-434348: Corrected Guest Vendor Name values
...
Signed-off-by: Maisam Arif <maisarif@amd.com >
Change-Id: Iee0d45fc64386f0417a0e30cce05608ca2186990
[ROCm/amdsmi commit: 53177525bf ]
2024-01-24 07:34:06 -06:00
Maisam Arif
5edb7a559f
Corrected Cache Type Enum
...
Signed-off-by: Maisam Arif <maisarif@amd.com >
Change-Id: I7d861d022e2855c35e4a79681f83977cc633d1c6
[ROCm/amdsmi commit: 2c87d95ffb ]
2024-01-24 07:28:04 -06:00
Maisam Arif
3273fb6239
SWDEV-440760: Removed specific gpu_metric calls & fixed pcie metrics
...
Signed-off-by: Maisam Arif <maisarif@amd.com >
Change-Id: I679ecede4825c119925de3c9140453653f3f84aa
[ROCm/amdsmi commit: fec1173321 ]
2024-01-24 05:51:36 -06:00
Maisam Arif
037a4283cd
SWDEV-441635: Updated amdsmi_get_utilization_count python API
...
Signed-off-by: Maisam Arif <maisarif@amd.com >
Change-Id: I588e1a61e000d9a5f77f0e8c63f4fef1ec76063e
[ROCm/amdsmi commit: 1ed5080433 ]
2024-01-24 05:51:36 -06:00
Maisam Arif
56f96b613e
Handled unkown vram type out of bounds error
...
Signed-off-by: Maisam Arif <maisarif@amd.com >
Change-Id: I2d32c7043c78c0651f1b4db565a299b6b96abbcc
[ROCm/amdsmi commit: ee80c2cac4 ]
2024-01-24 06:50:17 -05:00
Charis Poag
f2587543e8
SWDEV-436533 [CLI/Python API] Align Cache Info BM UI to Host
...
- [CLI] Refactored cache info to display
cache flags as "cache_properties" names.
Names are displayed as a list of comma-separated
cache type strings. Previously, values
were shown one by one as ENABLED.
ex.
CACHE_PROPERTIES = <a,b,c>
- [JSON] mirrors CLI fields.
No longer display "cache_flags", renamed
field as "cache_properties" dictionary. This
allows users to better understand the
list of names provided.
- [Python API] Updated amdsmi_get_gpu_cache_info
to mirror Host return.
README.md - updated to reflect all changes.
Change-Id: Ife2ef5adcef30058937d1376efb01749e45c02fb
Signed-off-by: Charis Poag <Charis.Poag@amd.com >
[ROCm/amdsmi commit: fe86afed8c ]
2024-01-24 06:21:55 -05:00
Charis Poag
6dc774c275
Add ROCm 6.0 change log
...
Update our change log to reflect a few of the major updates
for ROCm 6.0.
Change-Id: I82157fcfad22e63b62d2409bdc979b312356abe8
Signed-off-by: Charis Poag <Charis.Poag@amd.com >
[ROCm/amdsmi commit: c260819003 ]
2024-01-23 17:48:13 -05:00
Maisam Arif
05210b2c16
SWDEV-440462: Fixed metric functionality to Linux Guest
...
Change-Id: Ia69d01251d1e9bb3717bda3a7d0f752c739393a6
Signed-off-by: Maisam Arif <maisarif@amd.com >
[ROCm/amdsmi commit: 6292ac513c ]
2024-01-21 02:46:15 -06:00
Charis Poag
c5ba765be0
Merge rocm-smi/amd-staging into amd-dev 20240119
...
Change-Id: Ie706473ff92a91b19e95d2d58f64904cad73a89a
Signed-off-by: Charis Poag <Charis.Poag@amd.com >
[ROCm/amdsmi commit: 6132074089 ]
2024-01-19 03:57:00 -05:00
Deepak Mewar
c081e9e6f8
amdsmi wrapper generated for updated amdsmi_get_esmi_err_msg
...
Change-Id: I2388cd75111774852ae6426071d890bbf2d9d0c9
[ROCm/amdsmi commit: 730a82417e ]
2024-01-16 11:41:22 -06:00
Deepak Mewar
6ef2131a21
amdsmi library updated for esmi error status mapping to amdsmi
...
Change-Id: I7e4dd146a1a9af496556efcf811b2e1ed565b09e
[ROCm/amdsmi commit: 5d0b479661 ]
2024-01-16 11:41:22 -06:00
khashaik
a08809a3ca
amdsmi_cli: Update help section
...
- Update help section
Change-Id: Ida8022a27ecc9df3ebef94e27e89624c18a9cf46
[ROCm/amdsmi commit: 27fbbc3388 ]
2024-01-16 11:41:22 -06:00
khashaik
6395351ac3
amdsmi_cli: Updated README.md file in the amdsmi_cli
...
- Update the README.md file in amdsmi_cli folder to include information
for CPU's and CORE's along with the GPU's
Change-Id: I7670811696bc5299a287a6bc8883afe40eeeb557
[ROCm/amdsmi commit: 994b956d5e ]
2024-01-16 11:41:22 -06:00
Deepak Mewar
1f7c6771eb
amdsmi wrapper generated for updated hsmp metric table
...
Change-Id: I18c795e18d9c95320826cb965f36d3fb5546ea5c
[ROCm/amdsmi commit: 19451cc508 ]
2024-01-16 11:41:22 -06:00
Deepak Mewar
171f4818f4
amdsmi library updated for metric table structure
...
Change-Id: Ie8a9840a9020282599dd413e964d86bfb8850f6a
[ROCm/amdsmi commit: a0c95e855b ]
2024-01-16 11:41:22 -06:00
khashaik
a66efce2da
amdsmi_cli: Add checks for no gpu devices, cpu and core devices
...
- Add checks for no gpu devices, cpu and core devices
- Update units for core energy and cpu energy
Change-Id: Ieea43f1bb7fc303ebbbdf72f1ab22644a28df25c
[ROCm/amdsmi commit: 18d8087711 ]
2024-01-16 11:41:22 -06:00
khashaik
c500be9b35
amdsmi_cli: Update parser to add neg values check for the cpu and core arguments
...
Change-Id: Ia7959826637e7749d999a6570df590221e85cf50
[ROCm/amdsmi commit: 108ae03c23 ]
2024-01-16 11:41:22 -06:00
khashaik
47ca69f2a6
amdsmi_cli: Fix issues for CPU related API's for DIMM
...
- Fix interface issues for dimm temperature, dimm refresh rate and dimm power consumption
Change-Id: I998209c8314e4d78a842187c5a0b127aea7dbef2
[ROCm/amdsmi commit: 4971466c22 ]
2024-01-16 11:41:22 -06:00
Deepak Mewar
1a2b556dce
amdsmi interface updated to additionally return the freq src
...
from amdsmi_get_cpu_socket_current_active_freq_limit
Change-Id: I48f1026474115848a30352637415e7a1a52f3481
[ROCm/amdsmi commit: 7dcd5a3fd6 ]
2024-01-16 11:41:22 -06:00
Deepak Mewar
148ecb1805
amdsmi interface updated for amdsmi_get_metrics_table units
...
Change-Id: If211292e894df9d832b879252bebf91c17112d14
[ROCm/amdsmi commit: 898c4bc06f ]
2024-01-16 11:41:22 -06:00
khashaik
323cf14a9c
amdsmi_cli: Fix issues in cpu API "cpu_lclk_dpm_level"
...
- Fix issues in cpu API "cpu_lclk_dpm_level"
- Fix issue for invalid core id
- Update the error message for invalid devices
Change-Id: I71216ff72f89cfe0c86928ae3dce1f88eae91665
[ROCm/amdsmi commit: 256907989b ]
2024-01-16 11:41:22 -06:00
Deepak Mewar
8bd95a26b4
amdsmi_cli: Enabled hsmp metric table from CLI
...
Change-Id: I7f9c13255f952136438249f5180dec5586d01bd7
[ROCm/amdsmi commit: c74f01f401 ]
2024-01-16 11:41:22 -06:00
Deepak Mewar
5d5bb11625
amdsmi interface updated for amdsmi_get_metrics_table encodings
...
Change-Id: Iffed4071d5b2b5645f8118f3fbce26ab258e7882
[ROCm/amdsmi commit: 1b1591571b ]
2024-01-16 11:41:22 -06:00
khashaik
68a49c6c27
amdsmi_cli: Fix issues in "cpu_enable_apb" API
...
Change-Id: I8237fb4641f1a6aecec815fdc020abbf9a3195ba
[ROCm/amdsmi commit: 087a0d3ead ]
2024-01-16 11:41:22 -06:00
Deepak Mewar
52c1014196
amdsmi interface updated for amdsmi_get_metrics_table
...
Change-Id: I0618dd411caf6d30f74793e937984273f9c5b70e
[ROCm/amdsmi commit: 31dc8d0ee8 ]
2024-01-16 11:41:22 -06:00
Deepak Mewar
a45d2e1684
amdsmi wrapper generated for updated amdsmi_get_metrics_table
...
Change-Id: Id55a5647064998d8f546c806f857a8745afe52ea
[ROCm/amdsmi commit: 4ecf25e882 ]
2024-01-16 11:41:22 -06:00
Deepak Mewar
3a00172186
amdsmi library and sample code updated for amdsmi_get_metrics_table
...
Change-Id: Ie03c556f5c38fe4a0365743d3a94220e3aa62b23
[ROCm/amdsmi commit: 9f3a6dbd29 ]
2024-01-16 11:41:22 -06:00
Charis Poag
23a0cb827f
GPU Usage/activity update
...
CLI:
Every usage field is notated by "activity"
gfx_usage -> gfx_activity
umc_usage -> umc_activity
vcn_activities -> vcn_activity
jpeg_activities[AID#] -> jpeg_activity
Wrapper: fixed metric output, misalignment
with generator
update_wrapper.sh:
DOCKER_BUILDKIT to 0 (if unset)
API:
amdsmi_get_gpu_metrics_info:
1.3: Removed commenting out avg socket power
Signed-off-by: Charis Poag <Charis.Poag@amd.com >
Change-Id: Id3fcc20aef420c7b7a90ba22fa3bc643b2716333
[ROCm/amdsmi commit: 4575990ae7 ]
2024-01-15 23:34:08 -06:00
Bill(Shuzhou) Liu
28f354796d
Use the same mutex as rocm-smi
...
Share the same mutex as rocm-smi implementation. Handle the crash
when a user is not in render group.
Change-Id: I486b26569f9b523b41bbdaf95d51f4a730978cfd
[ROCm/amdsmi commit: 5a6b5d2a0a ]
2024-01-15 13:12:49 -05:00
Charis Poag
31081fa8b0
Fix AMD-SMI test segmentation fault TestGpuMetricsRead
...
Issue: need to return on any failure.
The nullptr check test would segfault without-
all values in struct are not initialized.
Signed-off-by: Charis Poag <Charis.Poag@amd.com >
Change-Id: I4987fb73ba9bcb182de7a439a4286333a41bf7eb
[ROCm/amdsmi commit: d74be3120e ]
2024-01-14 19:27:34 -06:00
Galantsev, Dmitrii
64969f2c61
SWDEV-409184 - Exclude some tests in VM
...
Change-Id: Ic196a113426fc63a0b2aadfa04ab4b10ed6434e3
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com >
[ROCm/amdsmi commit: a60f5d2d4c ]
2024-01-11 01:38:15 -06:00
Bill(Shuzhou) Liu
29f1584b9c
Add the ROCm version in CLI
...
Print the ROCm version in CLI
Change-Id: I529201274e114bde44722aa9a6aec13c2bedecf7
[ROCm/amdsmi commit: 9dd24a2b5a ]
2024-01-04 15:25:08 -06:00