Граф коммитов

1558 Коммитов

Автор SHA1 Сообщение Дата
Justin Williams eb5f64a5e2 Removed Wheel & Setuptools dependency 2024-12-18 15:35:28 -06:00
Juan Castillo f8b8347627 [SWDEV-496693]GPU Metrics 1.7
Features added:
- [SWDEV-475244] Add new interface to get max memory bandwidth
Updated API: amdsmi_get_gpu_vram_info
Updated: struct amdsmi_vram_info_t to include vram_max_bandwidth
CLI: amd-smi static --vram

- [SWDEV-488349] Add new interface for XGMI link status
New API: amdsmi_get_gpu_xgmi_link_status
CLI: amd-smi xgmi --link-status

Signed-off-by: Juan Castillo <juan.castillo@amd.com>
Change-Id: I1aa35b741136eb4f02f7ea9a95b865886273eb72
2024-12-18 10:57:06 -06:00
Joe Narlo ef31bb7166 SWDEV-504389 [AMD-SMI] Synching Comments in Linux BM
Sync comments from Unified Header to Linux BM

Signed-off-by: Joe Narlo <Joseph.Narlo@amd.com>
Change-Id: I9b1ae94db68761a7963ad87cd60177a57e93ad85
2024-12-18 10:57:06 -06:00
Choudhary, Rahul 30e08eed1c Create rocm_ci_caller.yml enabling OSDB for push on mainline 2024-12-17 12:21:17 -08:00
Choudhary, Rahul 6ffe28fb47 Update rocm_ci_caller.yml fixing base ref
base ref to cover both pull and push request
2024-12-17 12:19:06 -08:00
Choudhary, Rahul 2c36a327de Create rocm_ci_caller.yml adding workflow caller for PSDB and OSDB 2024-12-16 22:06:03 -08:00
Choudhary, Rahul c11a7f6eb9 Create codeql.yml
copied from previous repo
2024-12-16 22:03:54 -08:00
Maisam Arif 89ecf1c444 Merge amd-dev into amd-master 20241213
Signed-off-by: Maisam Arif <Maisam.Arif@amd.com>
Change-Id: Ie13efc3ba253965dfe32a93438fe39b0509e305f
2024-12-13 12:08:19 -06:00
Maisam Arif 41561ec3a0 Revert "[SWDEV-502001] Fix link for amd_hsmp.h"
This reverts commit a4574a7d13.

Change-Id: I583e3e49989955f03adea84c965503288c0934ee
Signed-off-by: Maisam Arif <Maisam.Arif@amd.com>
2024-12-13 12:01:26 -06:00
Joe Narlo d0a7332d32 SWDEV-492272 [AMDSMI] Build/Compiler warnings messages
Fix compiler warnings

Signed-off-by: Joe Narlo <Joseph.Narlo@amd.com>
Change-Id: I10657b8f3ef18a9b45311e8f6509958297a57823
2024-12-13 00:38:07 -05:00
gabrpham fe290a2056 [SWDEV-484382] Added fclk and socclk to amd-smi metric -c
Signed-off-by: gabrpham <Gabriel.Pham@amd.com>
Change-Id: Ie7e19c757b05455693c0d26eeb5e8b6c1e238375
2024-12-13 00:33:12 -05:00
gabrpham 5f9c2db6f3 [SWDEV-484382] Added new command amd-smi set -c/--clk-level
Signed-off-by: gabrpham <Gabriel.Pham@amd.com>
Change-Id: If45152e3a3c94f65b6a8a960601b9ed16fa3d0d7
2024-12-13 00:32:19 -05:00
gabrpham bc16e1a5da [SWDEV-484382] Added new command amd-smi static --clock
Signed-off-by: gabrpham <Gabriel.Pham@amd.com>
Change-Id: I49e1aa2e699734d81c40c76c62da1cecc5bd3c0e
2024-12-13 00:30:29 -05:00
Maisam Arif aed7749a2c [SWDEV-489060] Added python3-setuptools & python3-wheel for base images
Signed-off-by: Maisam Arif <Maisam.Arif@amd.com>
Change-Id: I222395e656469f67405bc94a86ab7f8fd1ed34a2
2024-12-11 16:40:51 -06:00
Charis Poag 57f45954b7 Fix amd-smi firmware not printing YAML-like dictionary correctly
List string should take into account dictionary value types

Change-Id: Icc08288cb0007d43eacd1aff6d44c40a84ea9448
Signed-off-by: Charis Poag <Charis.Poag@amd.com>
2024-12-11 10:48:43 -05:00
Justin Williams 2a1e2eed18 [SWDEV-479339/498804] Added AMDSMI Dockerfile
Signed-off-by: Justin Williams <Justin.Williams@amd.com>
Change-Id: Ic7cc6eb6417708cff3f4a33b91a8ef6dcd2b2807
2024-12-10 16:18:42 -05:00
Maisam Arif 554203c13a Fixed spacing in amd-smi --xgmi
Signed-off-by: Maisam Arif <Maisam.Arif@amd.com>
Change-Id: I9fbd20c50a25aa3be80c8aa68eea37b81a74dc67
2024-12-10 15:45:06 -05:00
Charis Poag 7543a058ea [SWDEV-475712] Fix MI2x target_graphics_version
Removed correcting target_graphics_version by
product name. Instead detected target_graphics_version which
needs to be corrected -> populate accordingly.

Change-Id: Ie9240a049313d9338f831ef47be973cd5c228612
Signed-off-by: Charis Poag <Charis.Poag@amd.com>
2024-12-10 13:43:02 -05:00
Charis Poag bc0015fd36 [SWDEV-488288] Remove GFX_BUSY_ACC from amd-smi metric --usage
Output is not helpful to users.

Change-Id: I12a60e28b8eab2fc3ffca4ea88f03018bf0ef3ce
Signed-off-by: Charis Poag <Charis.Poag@amd.com>
2024-12-10 13:37:36 -05:00
Charis Poag b911a0606a [SWDEV-495824] AMD SMI reporting CPX partitions incorrectly
Updated changelog to provide options to users on how to fix.

Change-Id: I4fd04b1e65ff9d678b2d13109599f57a03c84d41
Signed-off-by: Charis Poag <Charis.Poag@amd.com>
2024-12-10 11:20:03 -05:00
Maisam Arif ddcfe28520 [SWDEV-503491] Updated Market Names
Signed-off-by: Maisam Arif <Maisam.Arif@amd.com>
Change-Id: Ib56c4c96190e18708ef4d0d6358dd8d5b1ee9e6a
2024-12-09 15:40:06 -05:00
Bindhiya Kanangot Balakrishnan 288b11df37 [SWDEV-496639] Align amd-smi xgmi statistics
The xgmi read and write values were displayed in KB. The numbers became
unreadable due to misalignment. So, converted read and write values to
readable units using helper function. Updated Changelog.

Signed-off-by: Bindhiya Kanangot Balakrishnan <Bindhiya.KanangotBalakrishnan@amd.com>
Change-Id: I4c90a1de8a58c29cbdf43fe3480a1546f3946673
2024-12-09 12:57:45 -05:00
Charis Poag d323ecff97 [SWDEV-502744] Fix "amd-smi monitor" shows VCN ENC utilization & clock but not VCN DEC
Reason for this fix:
Navi products use vclk and dclk for both encode and decode.
On MI products, only decode is supported.
Navi products cannot support displaying ENC_UTIL % at this time.

Change-Id: I107bb761794ae4724949ac21c110b23a4f616700
Signed-off-by: Charis Poag <Charis.Poag@amd.com>
2024-12-07 12:11:10 -05:00
gabrpham bd01cfc203 Fixed post reset and ring_hang issues
Issues include:
	SWDEV-480250
	SWDEV-480255
	SWDEV-480248
Known issue:
	`amd-smi event` has threads taking events from the same device
which, in the case of resetting gpus, makes it seem like some gpus have
reset mulitple times and other have not reset at all.

Signed-off-by: gabrpham <Gabriel.Pham@amd.com>
Change-Id: Ic7dcc214e0366fc1532ece579d915d34d35d5407
2024-12-06 17:46:00 -05:00
Bindhiya Kanangot Balakrishnan 1586005a5b [SWDEV-457845] Error code unification for amd-smi set
Earlier amd-smi set was returning different outputs in Linux
and Windows. In Linux it was returning ValueError. As part of
Error Code unification, corrected this output message.

Signed-off-by: Bindhiya Kanangot Balakrishnan <Bindhiya.KanangotBalakrishnan@amd.com>
Change-Id: Iba9ddd9c5b2bed0456f303e4373f6771c93608be
2024-12-06 14:21:31 -05:00
Justin Williams 2c24cab86c [SWDEV-502001] Added amd_hsmp.h locally
Signed-off-by: Justin Williams <Justin.Williams@amd.com>
Change-Id: I28e48913743f86fb5fc9082307ec326830d55960
2024-12-05 17:02:48 -05:00
Maisam Arif bc3ac61641 Added gpu_metrics table debug logs in monitor
Signed-off-by: Maisam Arif <Maisam.Arif@amd.com>
Change-Id: I8aa96629a65df7a2d52ef9ed42a884732d097a54
2024-12-05 15:18:13 -06:00
Joe Narlo 547db10384 SWDEV-502330 [AMD-SMI][Unified Header] Convert struct to typedef struct
Change struct to a typedef struct

Signed-off-by: Joe Narlo <Joseph.Narlo@amd.com>
Change-Id: I6f3b22a5219c0db0aab2c308b71213ae75334476
2024-12-04 09:14:05 -05:00
Justin Williams 2370aa1b40 [SWDEV-469278] Removed PyYAML Dependency
Signed-off-by: Justin Williams <Justin.Williams@amd.com>
Change-Id: Idec32cfb0de84cc255b506d7f972e2750992745e
2024-12-03 15:40:44 -05:00
Bindhiya Kanangot Balakrishnan bc77330a74 [SWDEV-499030] Fix truncated FRU_ID
The FRU_ID was truncated because the string copied from sysfs
was limited to 32 characters. This limit has been increased to
AMDSMI_MAX_STRING_LENGTH to accommodate longer FRU_IDs. Also
updated the deprecated string length macros.

Signed-off-by: Bindhiya Kanangot Balakrishnan <Bindhiya.KanangotBalakrishnan@amd.com>
Change-Id: I8becaf9f37609b2e5aecdf92b6ae60f4419ad8ef
2024-12-03 13:43:53 -06:00
Bindhiya Kanangot Balakrishnan fc7e1ddb4a [SWDEV-498507] Tool amd-smi could be more case insensitive
Modified amdsmi_cli to accept case insensitive arguments if
the argument does not start with a single dash(-).

Signed-off-by: Bindhiya Kanangot Balakrishnan <Bindhiya.KanangotBalakrishnan@amd.com>
Change-Id: I1b6320db0afaad0900d5a2049206002c3899fa71
2024-12-02 18:09:45 -05:00
Maisam Arif a4574a7d13 [SWDEV-502001] Fix link for amd_hsmp.h
Signed-off-by: Maisam Arif <Maisam.Arif@amd.com>
Change-Id: I402ee539cdd4c896acd7ccc83f3090c3a5eeba12
(cherry picked from commit 664ade7354)
2024-12-02 17:33:07 -05:00
Maisam Arif 664ade7354 [SWDEV-502001] Fix link for amd_hsmp.h
Signed-off-by: Maisam Arif <Maisam.Arif@amd.com>
Change-Id: I402ee539cdd4c896acd7ccc83f3090c3a5eeba12
2024-12-02 16:30:06 -06:00
Charis Poag ed29393037 Merge amd-dev into amd-master 20241125
Change-Id: I8f8e03b2b68bc7b1afb0010cbd4ad722cbffcb41
Signed-off-by: Charis Poag <Charis.Poag@amd.com>
2024-11-25 08:44:36 -06:00
Charis Poag 7d061f9ae4 [SWDEV-499029] Fix unable to change memory partition modes
Changes:
  * [API] Removed checking board name, fixes for other MI ASICs
  * [API] Fixed unable to restart AMD GPU, libdrm blocked
    doing this operation
  * [API] Added ability to unload/reload libdrm
    from within AMD SMI APIs
  * [CLI] Increased progress bar to change memory partition modes
    to 140 seconds, since driver reload is variable per system

Change-Id: I52f227f2ab850c4a6332ff3ecdc899903b1080f1
Signed-off-by: Charis Poag <Charis.Poag@amd.com>
2024-11-25 09:28:02 -05:00
Joe Narlo 35d8e827b9 SWDEV-497305 [AMDSMI] Consistent string lengths
Unify max string length to AMDSMI_MAX_STRING_LENGTH 256
Replace AMDSMI_NORMAL_STRING_LENGTH, AMDSMI_256_LENGTH

Signed-off-by: Joe Narlo <Joseph.Narlo@amd.com>
Change-Id: Ia81d738be0eefb9683ee53d51c969598fe587f50
2024-11-22 15:37:24 -05:00
Joe Narlo 3052ad4220 SWDEV-495787 [AMDSMI] Different license headers
Change copyrights to MIT and remove date

Signed-off-by: Joe Narlo <Joseph.Narlo@amd.com>
Change-Id: I16f5b412f2b9ddefaaa1771aa714cc18829a1be4
2024-11-22 08:55:28 -05:00
gabrpham 50eaf14b9e [SWDEV-498453] Enabled 'amd-smi set --clk-limit' for virtual environments
Signed-off-by: gabrpham <Gabriel.Pham@amd.com>
Change-Id: I23e994502d4abc1a602d2341e77ad9c50fcf4839
2024-11-19 16:17:29 -06:00
gabrpham fc9d18dd3e [SWDEV-498453] Enabled for virtual environments
Signed-off-by: gabrpham <Gabriel.Pham@amd.com>
Change-Id: Ic7b274cf8e579b733515efe84fc0f325256ef8b1
2024-11-18 11:57:04 -05:00
Maisam Arif ed58196e35 Revert "[SWDEV-446215] Update cmake to put test libs in proper lib dir"
This reverts commit 6e01df00ca.

Reason for revert: Incorrect Path

Change-Id: I88bb304cfab997460a916e1a130fdb75435c648b
2024-11-18 11:15:22 -05:00
Galantsev, Dmitrii 76a92ca691 Merge amd-dev into amd-master 20241115
Change-Id: I11ef3a88e26aa5d83347202ea329b50c87ed7883
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com>
2024-11-15 16:50:17 -06:00
Adam Pryor b7789d4699 Revert "[SWDEV-446215] Update cmake to put test libs in proper lib dir"
This reverts commit 6e01df00ca.

Reason for revert: Because the gtest of amdsmi is different to other components so it was installed in a share/amdsmi/lib folder. It cannot be installed in a common folder such as /usr/local/bin or /usr/bin because all other components try to search those folder first.

 

This is breaking ROCmValidationSuite and other tools. Per Wang, Yanyao this should be reverted.

Change-Id: Id61bc6056fe41800e738616f39293e9b8762a377
2024-11-15 15:08:12 -05:00
Maisam Arif f1c3fbf226 Updated CLI exceptions
Signed-off-by: Maisam Arif <Maisam.Arif@amd.com>
Change-Id: I5c68eed7719c093727afa434e25ba2560dde894a
2024-11-15 11:44:51 -05:00
Zhang Ava b3b421407f Merge amd-dev into amd-master 20241115
Signed-off-by: Zhang Ava <niandong.zhang@amd.com>
Change-Id: I355cb3b84641ac58d98b2bc1a0e75e3e74570ca7
2024-11-15 09:04:59 +08:00
Maisam Arif afd06950c1 Revert "SWDEV-489696 [AMD SMI] Update python integration test"
This reverts commit 06e7bf8a98.

Reason for revert: Changes needed

Change-Id: I96cc956a2f1c73a2828c70ec9aa22931ba570d8f
2024-11-14 18:54:48 -05:00
Joe Narlo 06e7bf8a98 SWDEV-489696 [AMD SMI] Update python integration test
Initial update

Signed-off-by: Joe Narlo <Joseph.Narlo@amd.com>
Change-Id: I7c5777159f591f8b402168576b14ef8c1157e8d9
2024-11-14 17:52:01 -05:00
Maisam Arif dfcf5b4ae5 Corrected pyyaml debian package name
Signed-off-by: Maisam Arif <Maisam.Arif@amd.com>
Change-Id: Ice1541b4c1fc2297ee8bef5a7c7336c93267e01a
2024-11-14 14:42:50 -06:00
Maisam Arif 67b269d112 Merge amd-dev into amd-master 20241113
Signed-off-by: Maisam Arif <Maisam.Arif@amd.com>
Change-Id: Iefd0cca227d24ba87769a65584240cf55754cf20
2024-11-13 11:49:43 -06:00
Justin Williams d3d6157854 [SWDEV-492047] Removed setup.cfg.in
Signed-off-by: Justin Williams <Justin.Williams@amd.com>
Change-Id: I97b14d05b17fefbb87368824f57bc4ab690f1bf0
2024-11-13 12:45:09 -05:00
Peter Park cbfe403b1d remove duplicated changelog
black format docs/conf.py
add seealso to python api reference

Change-Id: I60fa754f0af662669282dc90eea4b7dc5c5030cc
Signed-off-by: Peter Park <peter.park@amd.com>
2024-11-13 11:46:47 -05:00