Граф коммитов

361 Коммитов

Автор SHA1 Сообщение Дата
Bill(Shuzhou) Liu 108e6d4ae6 Set and get DPM policy for GPU device
Add new APIs to set and get dpm policy for the GPU device.

Change-Id: I26fa49cd17d0ce66bda3446c38945a6cf35717ff
2024-03-12 10:32:31 -04:00
Maisam Arif 55cfcf11d6 Added __version__ attr to Python Library
Signed-off-by: Maisam Arif <maisarif@amd.com>
Change-Id: Ibbd90eaa60cc8b9dd0387d7fac8aef06a3a43375
2024-02-28 16:27:33 -05:00
Maisam Arif 57a43babad Removed old Python API function documentation
Signed-off-by: Maisam Arif <maisarif@amd.com>
Change-Id: Ib145fae98f1e99ab474b86ec4f6ddc2c8c44126e
2024-02-26 14:10:49 -06:00
Deepak Mewar f48e3f48a3 DCSM-371 - Observing previous mode details as null for amdsmi_set_cpu_pcie_link_rate
Signed-off-by: Deepak Mewar <deepak.mewar@amd.com>
Change-Id: I79a61d7b10aaff27b07e3d108a9b817c5ead6cf3
2024-02-22 16:30:18 -05:00
Maisam Arif a719ae9707 SWDEV-445396 - Aligned Static Command with Host
Signed-off-by: Maisam Arif <maisarif@amd.com>
Change-Id: I4182b9104e173f54830fc44819a61d74d31d65d7
2024-02-22 03:35:00 -05:00
Bill(Shuzhou) Liu db33cda0c1 Unify the amdsmi_get_pcie_info python interface
Make the python interface consistent with the C interface.

Change-Id: Idda08f888947c757e475d5a024b0ec3d8e1d846a
2024-02-22 03:33:59 -05:00
Maisam Arif f58613561c Refactor ESMI Initialization and Argument Parsing
Signed-off-by: Maisam Arif <maisarif@amd.com>
Change-Id: Iefab3a8110e0d3c525ee0cef1bdef9101550e9de
2024-02-21 19:02:14 -05:00
Deepak Mewar 84608807da Fix for multiple hsmp freq sources not reported on some setups
Change-Id: I8afe7076bd7790cf408ef104c50ac8d258b7d3fc
Signed-off-by: Maisam Arif <maisarif@amd.com>
2024-02-21 06:30:03 -06:00
Maisam Arif 703fdb0ed2 Aligned cache property enum with Host
Signed-off-by: Maisam Arif <maisarif@amd.com>
Change-Id: Ie64a33f55c9a9a7cc8c806419509897351f37c70
2024-02-20 05:48:53 -06:00
Maisam Arif 482930eee3 Cleanup & rev_id fix
Signed-off-by: Maisam Arif <maisarif@amd.com>
Change-Id: I497037939b3f13b4f835762f2cdd42261e7f26f4
Signed-off-by: Maisam Arif <maisarif@amd.com>
2024-02-15 16:50:10 -06:00
Maisam Arif 77710921a4 Align list and cache_info to Host
Signed-off-by: Maisam Arif <maisarif@amd.com>
Change-Id: I4fa55b360b74d5a202d0b9b4eb7aee660b0a1bcf
2024-02-15 01:47:59 -05:00
Oliveira, Daniel 78074d7d77 fix: [rocm/amd_smi_lib] amdsmi_get_gpu_activity gfx/memory activity does not update
Checks and forces rereading gpu metrics unconditionally

Code changes related to the following:
  * Device::dev_log_gpu_metrics()
  * amdsmi_get_gpu_metrics_header_info()
    Removed unintentionally during work on 'header cleanup Remove non-unified headers'
  * Examples
  * Unit tests

Change-Id: I83710e173c0f7102d0b7f865c18474c979a95cd8
Signed-off-by: Oliveira, Daniel <daniel.oliveira@amd.com>
2024-02-13 10:15:17 -06:00
Maisam Arif f831cf49f7 Renamed amdsmi_get_metrics_table to amdsmi_get_cpu_metrics_table
Renamed structs to be more conistent with what they are calling

Signed-off-by: Maisam Arif <maisarif@amd.com>
Change-Id: I6f2be2fcb76f004aa592f0dad8545565700ccd4b
2024-02-12 16:30:18 -06:00
Maisam Arif 7a19dbbfe6 SWDEV-439217 - Updated amdsmi_get_gpu_asic_info python api
Signed-off-by: Maisam Arif <maisarif@amd.com>
Change-Id: Iafcfb10bec9a9a04574afdd95f10971f537e433b
2024-02-12 15:37:45 -05:00
Deepak Mewar c533085a6a Updated python interface for renaming hsmp metric table elements as per PPR
Change-Id: Ic91ba09af396cc16f3e6461f49f985a034edbe09
2024-02-06 18:46:27 -05:00
Deepak Mewar 6f7273fda5 Added amdsmi cpu family & cpu model
- Updated header and source files
- Updated python interface
- Generated python wrapper for updated header
- Updated the CLI to have cpu family & cpu model
  as part of metric table

Change-Id: Iea440251797270d5d29ffe883b0ad6db790be658
2024-02-06 18:46:27 -05:00
Deepak Mewar 8adf3eba1b Updated python interface for metric table readable timestamp
Change-Id: I251d9ee9c908413e29dd4ce6bf46acb448867a43
2024-02-06 18:46:27 -05:00
Maisam Arif 88192d8b6b SWDEV-436533 - Cache Info Struct Update
Signed-off-by: Maisam Arif <maisarif@amd.com>
Change-Id: Ic640fa657cdcc32d7b00ff78fc9452ec7e05dd07
2024-02-05 16:51:04 -05:00
Maisam Arif 59d885a9ca Fixed gpu_metric and cache cli checks
Signed-off-by: Maisam Arif <maisarif@amd.com>
Change-Id: Ic71e2b50dfa8fc106a17079842a7564a8e24b69d
2024-02-01 05:47:18 -05:00
Galantsev, Dmitrii fe94be55dc Update wrapper
Change-Id: Ic3ff273851bde9a8986719a668a3135fe7dd715f
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com>
2024-01-29 17:30:26 -06:00
Deepak Mewar 3aabb927b4 amdsmi README updated for python interface
Change-Id: I92c1e8eb646488a9cdc32d0933f27e5db8c172ef
2024-01-25 02:19:38 -05:00
Maisam Arif 0550c9352c Updated engine_activity api
Signed-off-by: Maisam Arif <maisarif@amd.com>
Change-Id: I3f62e093fdc0254015c0837dca59763551d3659c
2024-01-24 22:23:48 -05:00
Charis Poag 34bd26c68e Fix metric type error output + re-align with ROCm SMI metrics
Changes:
* [CLI] Provide fix for "/opt/rocm/bin/amd-smi metric
TypeError: '>' not supported between instances of 'str' and 'i"
--> Python API was updated, CLI needed to reflect these changes
* [API] Updated amdsmi.h's with ROCm SMI
--> Incorrectly added mem_bandwidth_acc & mem_max_bandwidth
--> Realigned wrapper with updates
* [Test] Added metrics not shown in gpu_metrics_read.cc

Change-Id: Ia3a172377fd5a582254dd5a46d81dbec7e763cd9
Signed-off-by: Charis Poag <Charis.Poag@amd.com>
2024-01-24 21:23:40 -06:00
Bill(Shuzhou) Liu 0b67c2ccc4 Unified API
amdsmi_get_link_metrics() and amdsmi_get_pcie_info()

Change-Id: Iea060e449813b842236243b772e8809497ce98fe
2024-01-24 18:27:20 -05:00
Maisam Arif 5e25c0771b Fix subvendor_id error handling
Change-Id: Ibb2e8e329233221e72247674b4f2fbaef51baa32
Signed-off-by: Maisam Arif <maisarif@amd.com>
2024-01-24 10:59:14 -06:00
Maisam Arif 94f41f2b70 Corrected AmdSmiCacheTypeNames interface class
Signed-off-by: Maisam Arif <maisarif@amd.com>
Change-Id: Iec9c6097aec460b180a112be2d24293a40bde125
2024-01-24 07:48:30 -06:00
Maisam Arif 53177525bf SWDEV-434348: Corrected Guest Vendor Name values
Signed-off-by: Maisam Arif <maisarif@amd.com>
Change-Id: Iee0d45fc64386f0417a0e30cce05608ca2186990
2024-01-24 07:34:06 -06:00
Maisam Arif 2c87d95ffb Corrected Cache Type Enum
Signed-off-by: Maisam Arif <maisarif@amd.com>
Change-Id: I7d861d022e2855c35e4a79681f83977cc633d1c6
2024-01-24 07:28:04 -06:00
Maisam Arif fec1173321 SWDEV-440760: Removed specific gpu_metric calls & fixed pcie metrics
Signed-off-by: Maisam Arif <maisarif@amd.com>
Change-Id: I679ecede4825c119925de3c9140453653f3f84aa
2024-01-24 05:51:36 -06:00
Maisam Arif 1ed5080433 SWDEV-441635: Updated amdsmi_get_utilization_count python API
Signed-off-by: Maisam Arif <maisarif@amd.com>
Change-Id: I588e1a61e000d9a5f77f0e8c63f4fef1ec76063e
2024-01-24 05:51:36 -06:00
Charis Poag fe86afed8c SWDEV-436533 [CLI/Python API] Align Cache Info BM UI to Host
- [CLI] Refactored cache info to display
cache flags as "cache_properties" names.
Names are displayed as a list of comma-separated
cache type strings. Previously, values
were shown one by one as ENABLED.

ex.
CACHE_PROPERTIES = <a,b,c>

- [JSON] mirrors CLI fields.
No longer display "cache_flags", renamed
field as "cache_properties" dictionary. This
allows users to better understand the
list of names provided.

- [Python API] Updated amdsmi_get_gpu_cache_info
to mirror Host return.

README.md - updated to reflect all changes.

Change-Id: Ife2ef5adcef30058937d1376efb01749e45c02fb
Signed-off-by: Charis Poag <Charis.Poag@amd.com>
2024-01-24 06:21:55 -05:00
Charis Poag 6132074089 Merge rocm-smi/amd-staging into amd-dev 20240119
Change-Id: Ie706473ff92a91b19e95d2d58f64904cad73a89a
Signed-off-by: Charis Poag <Charis.Poag@amd.com>
2024-01-19 03:57:00 -05:00
Deepak Mewar 730a82417e amdsmi wrapper generated for updated amdsmi_get_esmi_err_msg
Change-Id: I2388cd75111774852ae6426071d890bbf2d9d0c9
2024-01-16 11:41:22 -06:00
Deepak Mewar 19451cc508 amdsmi wrapper generated for updated hsmp metric table
Change-Id: I18c795e18d9c95320826cb965f36d3fb5546ea5c
2024-01-16 11:41:22 -06:00
khashaik 18d8087711 amdsmi_cli: Add checks for no gpu devices, cpu and core devices
- Add checks for no gpu devices, cpu and core devices
  - Update units for core energy and cpu energy

Change-Id: Ieea43f1bb7fc303ebbbdf72f1ab22644a28df25c
2024-01-16 11:41:22 -06:00
khashaik 4971466c22 amdsmi_cli: Fix issues for CPU related API's for DIMM
- Fix interface issues for dimm temperature, dimm refresh rate and dimm power consumption

Change-Id: I998209c8314e4d78a842187c5a0b127aea7dbef2
2024-01-16 11:41:22 -06:00
Deepak Mewar 7dcd5a3fd6 amdsmi interface updated to additionally return the freq src
from amdsmi_get_cpu_socket_current_active_freq_limit

Change-Id: I48f1026474115848a30352637415e7a1a52f3481
2024-01-16 11:41:22 -06:00
Deepak Mewar 898c4bc06f amdsmi interface updated for amdsmi_get_metrics_table units
Change-Id: If211292e894df9d832b879252bebf91c17112d14
2024-01-16 11:41:22 -06:00
khashaik 256907989b amdsmi_cli: Fix issues in cpu API "cpu_lclk_dpm_level"
- Fix issues in cpu API "cpu_lclk_dpm_level"
  - Fix issue for invalid core id
  - Update the error message for invalid devices

Change-Id: I71216ff72f89cfe0c86928ae3dce1f88eae91665
2024-01-16 11:41:22 -06:00
Deepak Mewar 1b1591571b amdsmi interface updated for amdsmi_get_metrics_table encodings
Change-Id: Iffed4071d5b2b5645f8118f3fbce26ab258e7882
2024-01-16 11:41:22 -06:00
Deepak Mewar 31dc8d0ee8 amdsmi interface updated for amdsmi_get_metrics_table
Change-Id: I0618dd411caf6d30f74793e937984273f9c5b70e
2024-01-16 11:41:22 -06:00
Deepak Mewar 4ecf25e882 amdsmi wrapper generated for updated amdsmi_get_metrics_table
Change-Id: Id55a5647064998d8f546c806f857a8745afe52ea
2024-01-16 11:41:22 -06:00
Charis Poag 4575990ae7 GPU Usage/activity update
CLI:
Every usage field is notated by "activity"
gfx_usage -> gfx_activity
umc_usage -> umc_activity
vcn_activities -> vcn_activity
jpeg_activities[AID#] -> jpeg_activity

Wrapper: fixed metric output, misalignment
with generator

update_wrapper.sh:
DOCKER_BUILDKIT to 0 (if unset)

API:
amdsmi_get_gpu_metrics_info:
1.3: Removed commenting out avg socket power

Signed-off-by: Charis Poag <Charis.Poag@amd.com>
Change-Id: Id3fcc20aef420c7b7a90ba22fa3bc643b2716333
2024-01-15 23:34:08 -06:00
khashaik cdf31b8d6a amdsmi: Interface: Add units to the cpu related interfaces.
Change-Id: I294439c345a3e4ca399eb6b3f53eb1f18777180a
2023-12-21 00:09:23 -05:00
Naveen Krishna Chatradhi 94d3c563a3 amdsmi: py-interface: Add python interface for esmi api
Change-Id: I4a3ab1168a7d1bf011ecc9c508e111c281503520
2023-12-18 06:31:35 -05:00
Charis Poag 8f3861e1d9 Add vcn and jpeg activity
Changes:
    - Add new engine field vcn_activity (from 1.4/1.5
      gpu_metrics
    - Updated log output to enhance view of gpu_metric
      data as json pretty print
    - Added new fields provided in 1.5
    - Added unit overview in python API, CLI is WIP

Signed-off-by: Charis Poag <Charis.Poag@amd.com>
Change-Id: I7d9f29e7ecc35dcd0697814c222cdd02b0d5518e
2023-12-15 22:18:46 -05:00
Bill(Shuzhou) Liu 59b510de2b Support max_num_cu_shared and num_cache_instance
Add above fields for cache info. Remove driver_date in CLI and
Remove the disable properties of cache.

Change-Id: I80672490908d9e32a149076cc37459fa56b8b0bf
2023-12-14 09:59:35 -05:00
Maisam Arif e9a6153836 Fix imports for partition API's
Change-Id: Ic3bc0230405ee5e662bfd2d5c6d0ed5bca42a671
2023-12-13 23:52:54 -06:00
Maisam Arif f91fc97fed SWDEV-413122 - Initial Monitor subcommand
Change-Id: Iaeaef77efeaa4289b19f1f676dcae6245f0e0c9e
2023-12-13 23:43:43 -06:00
Maisam Arif b54086a037 Change xgmi_physical_id to oam_id
Signed-off-by: Maisam Arif <maisarif@amd.com>
Change-Id: I35fb36ec0e9f72a7135d8bb9070dbdc0e956b93a
2023-11-22 12:16:38 -06:00