Bill(Shuzhou) Liu
97e70d44cf
Set soft min or max clock
...
Add the API to support set soft min or max clock.
Change-Id: Ia34381a721ef3c3d894d5a89d25afa757be46a79
2024-08-20 13:22:32 -04:00
Maisam Arif
40112f5b17
Bump Version to 24.6.3.0
...
Signed-off-by: Maisam Arif <Maisam.Arif@amd.com >
Change-Id: I902da5e5e9e7441002420afaaef01ca9c6c9666f
2024-08-08 01:30:51 -05:00
Maisam Arif
548938389d
Bump Version to 24.6.2.0
...
Signed-off-by: Maisam Arif <Maisam.Arif@amd.com >
Change-Id: Ic389b6783514e88c43958ff5d3413a4c4a8a884f
2024-07-10 19:15:17 -05:00
Oliveira, Daniel
a20db864b8
fix: [SWDEV-466302] [rocm/amd_smi_lib]
...
Fixes `amdsmi_get_gpu_process_list` now requires sudo to access pid and memory information
Code changes related to the following:
* amdsmi_get_gpu_process_list()
* CLI
Change-Id: I72b154c220276b354c350fcc067c9a7c32e6c173
Signed-off-by: Oliveira, Daniel <daniel.oliveira@amd.com >
2024-06-24 00:38:17 -04:00
muthusamy
057d688b55
amd-smi [CPU]: Added Support to get number of threads per core
...
Change-Id: I7e6500f3f53068a3483b64a54d78ac9e1d9cd183
2024-06-21 17:22:55 -04:00
Maisam Arif
a3497702cb
Bump Version to 24.6.1.0 and Update Changelog
...
Signed-off-by: Maisam Arif <Maisam.Arif@amd.com >
Change-Id: I88b8ad1341d7f1a2e14517db82979bb6b28575e8
2024-06-18 23:54:26 -05:00
Bill(Shuzhou) Liu
e3c63628e5
Change the clean shader API to clean local data
...
To be align with the unified API.
Change-Id: I2819339fba6f528204cebd3e9605109e82cbc5b4
2024-06-17 16:23:33 -05:00
Dalibor Stanisavljevic
80043adb81
Changed type to uint32_t oam_id due to header unification
...
Signed-off-by: Dalibor Stanisavljevic <Dalibor.Stanisavljevic@amd.com >
Change-Id: I351415f4a766ad6aa0c2e81adf8b416d066048ea
2024-06-13 05:12:55 -04:00
Bill(Shuzhou) Liu
4cf59c4edb
Change the name of clear sram to run cleaner shader
...
The function is to clean the local data in LDS/GPRs. The clear sram
is misleading.
Change-Id: I0385e6d6348602fe0f347d17e48ed8983f7ceb87
2024-06-05 12:07:39 -05:00
Maisam Arif
68d8c1ab46
Bump Version to 24.6.0.0 and Update Changelog for 6.2.0
...
Signed-off-by: Maisam Arif <Maisam.Arif@amd.com >
Change-Id: I7f20094514cbfa32e40a6e4da36785d94839768c
2024-05-31 03:05:42 -05:00
Maisam Arif
e5d1ba4621
Use different sysfs for soc_pstate and xmgi_plpd
...
The sysfs is changed to use the pm_policy folder with multiple
dpm_policy files.
Change-Id: I40fac8de2d0cb127950d238b8196f6d2416778d0
2024-05-31 01:38:41 -04:00
Dalibor Stanisavljevic
458dc8f180
SWDEV-457337 - Header aligment
...
Missing AMDSMI_STATUS prefix
Change-Id: I15d050a146c92f6897d48317d8fec51d046535d1
Signed-off-by: Dalibor Stanisavljevic <Dalibor.Stanisavljevic@amd.com >
2024-05-30 15:35:38 -04:00
Dalibor Stanisavljevic
7b2463abe0
SWDEV-457337 - Fix header alignment
...
Change-Id: I9f25f6c4f0d00c76b66d13162f30be11368f5b59
Signed-off-by: Dalibor Stanisavljevic <Dalibor.Stanisavljevic@amd.com >
2024-05-23 04:41:57 -04:00
Maisam Arif
721e3ed3ea
Bump Version to 24.5.3.0
...
Signed-off-by: Maisam Arif <maisarif@amd.com >
Change-Id: I0d1ecddd650320287446a06cd8ce680c52a89342
2024-05-15 04:28:27 -04:00
Maisam Arif
7d999aa34c
SWDEV-458102 - Updates to pp_od_clk_voltage parsing
...
Signed-off-by: Maisam Arif <maisarif@amd.com >
Change-Id: I650dae1a99856dcde914fe66917cf9111f3ce0e2
2024-05-15 03:18:24 -05:00
Charis Poag
4295bba37f
[SWDEV-451104] Update static --board + amdsmi_get_gpu_board_info()
...
Updates:
* Expanded `amdsmi_get_gpu_board_info()` amdsmi_board_info_t structure size
Updated sizes that work for retrieving relevant board
information across AMD's ASIC products.
* Fixed `amdsmi_get_gpu_board_info()` to no longer return junk char strings
Change-Id: Ie1553c6109d678d283d82c24e9284f8e19cd6ccc
Signed-off-by: Charis Poag <Charis.Poag@amd.com >
2024-05-13 23:05:32 -05:00
Maisam Arif
52843152a5
SWDEV-444567 - Added Ring Hang Event
...
Signed-off-by: Maisam Arif <maisarif@amd.com >
Change-Id: I2e73ba08ee0004f6f30660b2fa425ea94bafceca
2024-05-03 17:21:28 -04:00
Maisam Arif
11c72946eb
Revert "SWDEV-458102 - Deprecated Voltage Curve API"
...
This reverts commit 1423fb632e .
Change-Id: I8a3eaf0a9f28200e09fb35d5260fbc070fe8a4a9
2024-05-02 15:27:16 -05:00
Charis Poag
c24d66740e
SWDEV-450580 - Fix powercap set
...
Updates:
* CLI - Added AMDSMIHelpers.convert_SI_unit() to help
conversion of units
* API - Reverted to uW for power cap limits
* CLI - amd-smi static --limit now includes MIN_POWER
* Tests now are all using uW units to keep W conversion
to only happen in CLI
* Python API now reflects same units as uW (what is seen
in amdgpu driver)
* CLI - amd-smi metric --power:
Fixed power seen on gpu_metrics v1.3
Change-Id: I32d9ba78d0d8806772f0860f9a803a885b3f316a
Signed-off-by: Charis Poag <Charis.Poag@amd.com >
2024-05-02 10:13:39 -05:00
Maisam Arif
051d5a4d42
Bump Version to 24.5.2.0
...
Signed-off-by: Maisam Arif <maisarif@amd.com >
Change-Id: I2f51ed93a356e55156983c56bac293a5d7d3b5c1
2024-05-02 02:53:48 -04:00
Maisam Arif
1423fb632e
SWDEV-458102 - Deprecated Voltage Curve API
...
Signed-off-by: Maisam Arif <maisarif@amd.com >
Change-Id: I111c3ce26d2ab66d5e755432f4b8a9bfa631f805
2024-05-02 02:53:29 -04:00
Bill(Shuzhou) Liu
7d2ab7970d
Process isolation and clean shader
...
A few APIs and command line options are added to support process
isolation and clean shader.
Change-Id: I98ad3fc9fc7429799a21798b7fca1c307de7f403
2024-04-24 13:22:20 -04:00
Maisam Arif
1bd18c1a65
Added new ecc blocks and adjusted metric --ecc-block filtering
...
Signed-off-by: Maisam Arif <maisarif@amd.com >
Change-Id: Ib2f69c7d59ee5108024794434fb202b5e4f58738
2024-04-18 15:01:41 -04:00
Maisam Arif
092908daee
Bump Version to 24.5.1.0
...
Signed-off-by: Maisam Arif <maisarif@amd.com >
Change-Id: I842e223b78f337a39098f652fa6e7ef51948fbaf
2024-04-05 02:31:08 -05:00
Oliveira, Daniel
08e2e21bab
fix: [SWDEV-442525] [rocm/amd_smi_lib]
...
Fixes gpu_process_list
Code changes related to the following:
* amdsmi_get_gpu_process_list()
* CLI
* Examples
* Unit tests
* Changelog
* Readme
* rocm_smi_lib commit: 677433b367
Change-Id: I9210fbca7a5da92d0a8b472b72ca82597c8e4fb5
Signed-off-by: Oliveira, Daniel <daniel.oliveira@amd.com >
2024-03-27 16:48:24 -05:00
Maisam Arif
51b3f8cccb
SWDEV-452739 - Add CEM slot type to amd-smi
...
Updated CHANGELOG.md and re-added spaces after bolded lines
Signed-off-by: Maisam Arif <maisarif@amd.com >
Change-Id: Ic728b3e9b083c62fe4c9791b8ede991f5dacc1ca
2024-03-27 02:01:25 -04:00
Bill(Shuzhou) Liu
e4085c6414
Get and set the XGMI PLPD
...
Update the API and CLI to support XGMI Per-Link Power Down Policy.
Change-Id: Iaf04a771eb8bb0829a5b3088d803a7355a8dfd0b
2024-03-26 01:48:14 -05:00
Oliveira, Daniel
1310c767ce
fix: [SWDEV-448201] [rocm/amd_smi_lib]
...
Adds Add PCIE Errors
Code changes related to the following:
* amdsmi_get_pcie_info()
* CLI
* examples
Change-Id: Ie0b7053e77c88fb18309c16e74bce75d862c45a9
Signed-off-by: Oliveira, Daniel <daniel.oliveira@amd.com >
2024-03-24 23:33:32 -04:00
Bill(Shuzhou) Liu
108e6d4ae6
Set and get DPM policy for GPU device
...
Add new APIs to set and get dpm policy for the GPU device.
Change-Id: I26fa49cd17d0ce66bda3446c38945a6cf35717ff
2024-03-12 10:32:31 -04:00
Bill(Shuzhou) Liu
c489cb8f3f
Add support for deferred RAS errors in API
...
The API will support the deferred errors
Change-Id: I221a146f09fefde1fc31e5f746d0870e07c93561
2024-03-04 22:46:44 -05:00
Maisam Arif
69caba8727
Bump Version to 24.4.0.0 & Corrected argument checks for set subcommand
...
Signed-off-by: Maisam Arif <maisarif@amd.com >
Change-Id: I651f8ca652c764f30845503dd869f435f728d5ba
2024-02-23 20:47:19 -06:00
Bill(Shuzhou) Liu
db33cda0c1
Unify the amdsmi_get_pcie_info python interface
...
Make the python interface consistent with the C interface.
Change-Id: Idda08f888947c757e475d5a024b0ec3d8e1d846a
2024-02-22 03:33:59 -05:00
Maisam Arif
f58613561c
Refactor ESMI Initialization and Argument Parsing
...
Signed-off-by: Maisam Arif <maisarif@amd.com >
Change-Id: Iefab3a8110e0d3c525ee0cef1bdef9101550e9de
2024-02-21 19:02:14 -05:00
Deepak Mewar
84608807da
Fix for multiple hsmp freq sources not reported on some setups
...
Change-Id: I8afe7076bd7790cf408ef104c50ac8d258b7d3fc
Signed-off-by: Maisam Arif <maisarif@amd.com >
2024-02-21 06:30:03 -06:00
Maisam Arif
703fdb0ed2
Aligned cache property enum with Host
...
Signed-off-by: Maisam Arif <maisarif@amd.com >
Change-Id: Ie64a33f55c9a9a7cc8c806419509897351f37c70
2024-02-20 05:48:53 -06:00
Maisam Arif
61f8888488
24.3.0 Version update
...
Change-Id: I936c896117ad64d06ea919a8b7bd6ba4cc388592
Signed-off-by: Maisam Arif <maisarif@amd.com >
2024-02-15 17:21:24 -05:00
Maisam Arif
77710921a4
Align list and cache_info to Host
...
Signed-off-by: Maisam Arif <maisarif@amd.com >
Change-Id: I4fa55b360b74d5a202d0b9b4eb7aee660b0a1bcf
2024-02-15 01:47:59 -05:00
Deepak Mewar
34ccbb5d1b
Updated amdsmi header for ESMI doxygen formatting
...
Referencing htttps://github.com/ROCm/amdsmi/pull/10
Change-Id: I516e3643130db8a4213aee7dfcaca27363e3171e
Signed-off-by: Maisam Arif <maisarif@amd.com >
2024-02-14 02:03:05 -06:00
Oliveira, Daniel
78074d7d77
fix: [rocm/amd_smi_lib] amdsmi_get_gpu_activity gfx/memory activity does not update
...
Checks and forces rereading gpu metrics unconditionally
Code changes related to the following:
* Device::dev_log_gpu_metrics()
* amdsmi_get_gpu_metrics_header_info()
Removed unintentionally during work on 'header cleanup Remove non-unified headers'
* Examples
* Unit tests
Change-Id: I83710e173c0f7102d0b7f865c18474c979a95cd8
Signed-off-by: Oliveira, Daniel <daniel.oliveira@amd.com >
2024-02-13 10:15:17 -06:00
Maisam Arif
f831cf49f7
Renamed amdsmi_get_metrics_table to amdsmi_get_cpu_metrics_table
...
Renamed structs to be more conistent with what they are calling
Signed-off-by: Maisam Arif <maisarif@amd.com >
Change-Id: I6f2be2fcb76f004aa592f0dad8545565700ccd4b
2024-02-12 16:30:18 -06:00
Bill(Shuzhou) Liu
86d025daaa
Add @platform doxygen alias
...
The @platform alias will describe which platform (for example,
gpu_baremetal or/and host) an API can be used.
The get_platform.py is a tool to compare APIs in different platforms.
Change-Id: I902bc4fea048269eace6e9f3f4a8e93f3adb7f87
2024-02-07 07:28:38 -05:00
Deepak Mewar
6f7273fda5
Added amdsmi cpu family & cpu model
...
- Updated header and source files
- Updated python interface
- Generated python wrapper for updated header
- Updated the CLI to have cpu family & cpu model
as part of metric table
Change-Id: Iea440251797270d5d29ffe883b0ad6db790be658
2024-02-06 18:46:27 -05:00
Maisam Arif
88192d8b6b
SWDEV-436533 - Cache Info Struct Update
...
Signed-off-by: Maisam Arif <maisarif@amd.com >
Change-Id: Ic640fa657cdcc32d7b00ff78fc9452ec7e05dd07
2024-02-05 16:51:04 -05:00
Maisam Arif
59d885a9ca
Fixed gpu_metric and cache cli checks
...
Signed-off-by: Maisam Arif <maisarif@amd.com >
Change-Id: Ic71e2b50dfa8fc106a17079842a7564a8e24b69d
2024-02-01 05:47:18 -05:00
Oliveira, Daniel
55734d2d7a
fix: [rocm/amd_smi_lib] header cleanup Remove non-unified headers
...
Cleans up individual gpu metric APIs which will be implemented according to 'unified-headers' standards
Code changes related to the following:
* '_get_gpu_metrics_' APIs
* Functional tests
Change-Id: I2dd2ecde11c1d77e343e0ae0e10aeb9120ae9b99
Signed-off-by: Oliveira, Daniel <daniel.oliveira@amd.com >
2024-01-26 10:38:48 -05:00
Charis Poag
34bd26c68e
Fix metric type error output + re-align with ROCm SMI metrics
...
Changes:
* [CLI] Provide fix for "/opt/rocm/bin/amd-smi metric
TypeError: '>' not supported between instances of 'str' and 'i"
--> Python API was updated, CLI needed to reflect these changes
* [API] Updated amdsmi.h's with ROCm SMI
--> Incorrectly added mem_bandwidth_acc & mem_max_bandwidth
--> Realigned wrapper with updates
* [Test] Added metrics not shown in gpu_metrics_read.cc
Change-Id: Ia3a172377fd5a582254dd5a46d81dbec7e763cd9
Signed-off-by: Charis Poag <Charis.Poag@amd.com >
2024-01-24 21:23:40 -06:00
Bill(Shuzhou) Liu
0b67c2ccc4
Unified API
...
amdsmi_get_link_metrics() and amdsmi_get_pcie_info()
Change-Id: Iea060e449813b842236243b772e8809497ce98fe
2024-01-24 18:27:20 -05:00
Maisam Arif
c400a22d4d
24.2.0 Version update
...
Signed-off-by: Maisam Arif <maisarif@amd.com >
Change-Id: Ied7c24d63ca38c2e5ea5eca6b411e0156f61a403
2024-01-24 11:13:02 -06:00
Maisam Arif
c48c989bbc
24.1.0 Version update
...
Signed-off-by: Maisam Arif <maisarif@amd.com >
Change-Id: Ibfe92d199b10dc48ece85dfdeda1041f5ea98626
2024-01-24 12:09:48 -05:00
Deepak Mewar
5d0b479661
amdsmi library updated for esmi error status mapping to amdsmi
...
Change-Id: I7e4dd146a1a9af496556efcf811b2e1ed565b09e
2024-01-16 11:41:22 -06:00