Γράφημα Υποβολών

95 Υποβολές

Συγγραφέας SHA1 Μήνυμα Ημερομηνία
Joe Narlo bad2cc9c23 SWDEV-495787 [AMDSMI] Different license headers
Change copyrights to MIT and remove date

Signed-off-by: Joe Narlo <Joseph.Narlo@amd.com>
Change-Id: I16f5b412f2b9ddefaaa1771aa714cc18829a1be4


[ROCm/amdsmi commit: 3052ad4220]
2024-11-22 08:55:28 -05:00
gabrpham f7a77c2539 [SWDEV-498453] Enabled for virtual environments
Signed-off-by: gabrpham <Gabriel.Pham@amd.com>
Change-Id: Ic7b274cf8e579b733515efe84fc0f325256ef8b1


[ROCm/amdsmi commit: fc9d18dd3e]
2024-11-18 11:57:04 -05:00
Charis Poag f01eea6077 [SWDEV-488276/SWDEV-497613] Update memory partition set functionality
Changes:
  - [CLI] Added warning screen to AMD SMI users
    setting memory partition
  - [CLI] Added a progress bar time-bar for CLI sets display to 40 seconds
  - [API] Updated to wait until the driver reloads with SYSFS files active
  - [CLI] Now users can set or reset without providing:
    amd-smi set -g all <set arguments>
    or amd-smi reset -g all <set arguments>
    now can directly call -> sudo amd-smi set <set arguments>
    or sudo amd-smi reset <set arguments>
  - [SWDEV-475712][CLI/API] Fixed target_graphics_version field
    not properly displaying for older MI or Navi ASICs.
  - [All APIs] Added a catch for the driver to report invalid arguments
    now these APIs will show AMDSMI_STATUS_INVAL
    (ex. changing to NPS8 if the device does not support it)
  - [Install] Modified paths for Python install commands to support
    multi-ROCm installs

Change-Id: Id11f25d68a82d23c6b2d77ccb30b51e860dd0ca7
Signed-off-by: Charis Poag <Charis.Poag@amd.com>


[ROCm/amdsmi commit: 3ea4a42a6e]
2024-11-12 16:50:32 -04:00
gabrpham 194c33852f [SWDEV-492739] Partial fix for sclk min/max out of bounds
Signed-off-by: gabrpham <Gabriel.Pham@amd.com>
Change-Id: I1f0230955c890c11a735c8cb352c8a9ee4cebe27


[ROCm/amdsmi commit: 2273d95a6c]
2024-11-11 20:45:37 -05:00
gabrpham f9bfce707d updated cli tool examples doc to reflect current CLI
Signed-off-by: gabrpham <Gabriel.Pham@amd.com>
Change-Id: Iab78a412464ba6d7919aeb7da04a031b063a7d09


[ROCm/amdsmi commit: 0f067488e1]
2024-11-11 17:12:40 -05:00
Maisam Arif 43efe1c39a Updated parser help text
Signed-off-by: Maisam Arif <Maisam.Arif@amd.com>
Change-Id: I8cc65edb1e629a55e0efbfc1109b1c549ed81101


[ROCm/amdsmi commit: 7932de967a]
2024-11-11 15:07:21 -06:00
Maisam Arif e9f43bc3dd Added ras and ecc counting back to Linux VMs
Signed-off-by: Maisam Arif <Maisam.Arif@amd.com>
Change-Id: Ie981f7fe8f481f2137e95dda2e200d00ab4d92c8


[ROCm/amdsmi commit: abee26d4ab]
2024-11-08 11:05:15 -06:00
gabrpham 072e67c9c3 [SWDEV-490187] reset gpu partition were removed
The reset gpu partition support for both compute and memory were removed

Code changes related to the following:
  * amdsmi_reset_gpu_compute_partition()
  * amdsmi_reset_gpu_memory_partition()
  * CLI

Change-Id: I372589074b4da172bedd39223edde18939e373ae
Signed-off-by: Oliveira, Daniel <daniel.oliveira@amd.com>


[ROCm/amdsmi commit: f5b7761ac7]
2024-10-18 16:22:26 -05:00
gabrpham d0ad17d9d5 [SWDEV-488846] Removed '--ecc' option from 'amd-smi monitor' when platform is VM
Change-Id: I8f5d7771cbfac3fe5f52dbccbd9f28020adb5f6f


[ROCm/amdsmi commit: 27b5a35d65]
2024-10-16 10:34:19 -04:00
gabrpham 4b461904b2 [SWDEV-486872] Removed '--ras' from static command when platform is VM
Signed-off-by: gabrpham <Gabriel.Pham@amd.com>
Change-Id: I0b03f168d7011428cfea3ab303865f4eaeea78ac


[ROCm/amdsmi commit: eb9116e8c2]
2024-10-16 09:29:24 -05:00
Maisam Arif 5e3d644769 Corrected clean local data partition indexing
Signed-off-by: Maisam Arif <Maisam.Arif@amd.com>
Change-Id: Ib0eeb065f160fccd3c3f4a2d13f0869af01a74ae


[ROCm/amdsmi commit: 27a48e69d8]
2024-10-10 10:54:45 -05:00
Maisam Arif 0368ce662d [SWDEV-447451] Fix attribute error for set/reset on Linux Guest
Signed-off-by: Maisam Arif <Maisam.Arif@amd.com>
Change-Id: I5d55bef44d2eea75c33ba489a57544976900c4a4


[ROCm/amdsmi commit: 4fcf281f1d]
2024-10-09 12:59:19 -05:00
gabrpham 5ca4c2e976 Added amd-smi partition as preliminary command.
new command includes following arguments:
  - current - display the current partition information for the selected
    gpu(s)
  - memory - display memory partition information for the selected
    gpu(s)
  - accelerator - display accelerator partition information for the
    selected gpu(s)
additional functionality will be added as more partition APIs are added.

Signed-off-by: gabrpham <Gabriel.Pham@amd.com>
Change-Id: Ica86160139002ef5213d6d4b0e390670aeef01c8


[ROCm/amdsmi commit: 4e2fc2d604]
2024-09-27 17:05:04 -05:00
Charis Poag 7a35c805b0 [SWDEV-422195/SWDEV-440985] GPU metrics 1.6
Changes:
    - Added new GPU metrics:
      1) Violation status' (ex. PVIOL/TVIOL) accumulators
      2) XCP (Graphics Compute Partitions) statistics
      3) pcie other end recovery counter
    - CLI/API/tests changes were made accordingly

Change-Id: I589b9b1f570f25dda12d95bb501feca85da8b3bb
Signed-off-by: Charis Poag <Charis.Poag@amd.com>


[ROCm/amdsmi commit: 3a4abbd8c0]
2024-09-27 12:04:21 -05:00
Maisam Arif 82096d7f74 Moved KFD information to separate structure and API
Signed-off-by: Maisam Arif <Maisam.Arif@amd.com>
Change-Id: If6eaea589edc704cf408d6391b5f2154134035e7


[ROCm/amdsmi commit: 3b7f661e71]
2024-09-20 03:48:42 -04:00
Charis Poag b6a68dd877 Fix amd-smi CLI calls returning TypeError
$ amd-smi version
TypeError: unsupported operand type(s) for |: 'type' and 'type'

---------------
Python3 --versions lower than 3.10
do not support str | None

Using typing Optional and Union, we can create equivalent logic for
str | none
and
str | list | none

Change-Id: I1f4a7ab67333914b33639dc62652881e1127411e
Signed-off-by: Charis Poag <Charis.Poag@amd.com>


[ROCm/amdsmi commit: 5c778cadf1]
2024-09-18 16:59:12 -05:00
gabrpham fbb1071046 [SWDEV-448738] Added rocmsmi extremum command as 'set -L'
Change-Id: I997c630bd20cc61673813a2301eb5e3002619a32
Signed-off-by: gabrpham <Gabriel.Pham@amd.com>

Change-Id: Ifa884303f9a0fa058af093a23f5be449bba54f29


[ROCm/amdsmi commit: b7f779182d]
2024-09-18 14:51:01 -04:00
gabrpham d04eadec17 Removed _validate_positive function and replaced with _positive_int or _not_negative_int as appropriate
Signed-off-by: gabrpham <Gabriel.Pham@amd.com>
Change-Id: I01effcdf9bae31fd8bc926c5d4bdf58274838618


[ROCm/amdsmi commit: 0d4b332fe4]
2024-09-17 18:37:16 -04:00
Maisam Arif c2b9cdfd2e Udpated License Dates
Signed-off-by: Maisam Arif <Maisam.Arif@amd.com>
Change-Id: I8ca199c129c06508bc3e23745ab5ac2d20dce928


[ROCm/amdsmi commit: 105db1afcd]
2024-09-16 16:14:47 -04:00
Tim Huang 202ddc01aa [SWDEV-463402] - Support retrieving connection type and P2P capabilities between two GPUs
1. Add a API interface amdsmi_topo_get_p2p_status to retrieve
connection type and P2P capabilities between 2 GPUs.

2. Add getting p2p status test in hw_topology_read
to print P2P capability information.

3. Add below tables for cli topology sub commands:
  - CACHE COHERANCY TABLE
  - ATOMICS TABLE
  - DMA TABLE
  - BI-DIRECTIONAL TABLE

Change-Id: I199173030d4170115cea27c472958a4826e4e1bf
Signed-off-by: Tim Huang <tim.huang@amd.com>


[ROCm/amdsmi commit: 260edaa752]
2024-09-06 09:42:34 -04:00
gabrpham 614c89889c [SWDEV-450553] Added gpu memory overdrive to metric function
Signed-off-by: gabrpham <Gabriel.Pham@amd.com>
Change-Id: If7bd6865d641a5a83c594a4d3c57938b1b6dc18e


[ROCm/amdsmi commit: 7d8e54d0e1]
2024-09-04 12:54:14 -04:00
Maisam Arif 1eaf218f2f Removed metric --ecc & --ecc-blocks commands from VM
ecc is not supported on VM
	Added static --ras because ras features are still detectable

Signed-off-by: Maisam Arif <Maisam.Arif@amd.com>
Change-Id: Ied4132b863989dfd67897e00904f04d140fd2773


[ROCm/amdsmi commit: 210680b570]
2024-08-09 18:44:44 -04:00
Maisam Arif 7130dc9f04 Fixed handling in GPU/CPU/CORE select functions
Signed-off-by: Maisam Arif <Maisam.Arif@amd.com>
Change-Id: I83d78a8d6cdcbd54e5c79330be577b3a06a00985


[ROCm/amdsmi commit: 574712386f]
2024-08-05 18:27:30 -04:00
gabrpham 6975697b56 Fixed cli issue with empty cpu/core parameter
Change-Id: Id0fee74357a56baaec59ca5359eb00a65cfd6185
Signed-off-by: gabrpham <Gabriel.Pham@amd.com>


[ROCm/amdsmi commit: 0143041262]
2024-08-05 16:37:36 -05:00
gabrpham 5b44997728 [SWDEV-439701] Additional GPU error handling
Change-Id: Ieb35e9712f2a78acef8961d865dba1d824969ef3
Signed-off-by: gabrpham <Gabriel.Pham@amd.com>


[ROCm/amdsmi commit: de8145387d]
2024-07-30 16:19:10 -05:00
Maisam Arif a7ea536524 [SWDEV-439701] Add missing parameter CLI error cases
Signed-off-by: Maisam Arif <Maisam.Arif@amd.com>
Change-Id: I8a6623ccdd7b16be320ea0f6f48b7b6cb3511983


[ROCm/amdsmi commit: 0fc779b14f]
2024-07-18 14:24:00 -04:00
Maisam Arif 8462635270 Added dmon alias for 'amd-smi monitor'
Signed-off-by: Maisam Arif <Maisam.Arif@amd.com>
Change-Id: I4a787034bd7ab1a0d08d8cfdd038add5c35cdea4


[ROCm/amdsmi commit: 6e5c4b422a]
2024-07-02 11:05:59 -04:00
Maisam Arif 8c87907db9 Updated error code naming to be inline with Host
Signed-off-by: Maisam Arif <Maisam.Arif@amd.com>
Change-Id: I6c66640742a25361ad6399763ee442598ffb0ac6


[ROCm/amdsmi commit: 27cd092c6d]
2024-06-28 08:58:52 -05:00
Maisam Arif a449323329 SWDEV-457854 - Unified BM error codes
Signed-off-by: Maisam Arif <Maisam.Arif@amd.com>
Change-Id: I5b232de3b598bd3146eb0528f61c628da93278d9


[ROCm/amdsmi commit: a3758f82dc]
2024-06-25 18:33:40 -05:00
Maisam Arif 3ef31b3ed0 SWDEV-435197 - Add process table to CLI monitor subcommand
Signed-off-by: Maisam Arif <Maisam.Arif@amd.com>
Change-Id: Ibe06f4a4be619ae9cba909c2474b0e482eeb87d5


[ROCm/amdsmi commit: 92f014059e]
2024-06-19 23:36:55 -05:00
Bill(Shuzhou) Liu f86ba0a7c4 Change the clean shader API to clean local data
To be align with the unified API.

Change-Id: I2819339fba6f528204cebd3e9605109e82cbc5b4


[ROCm/amdsmi commit: e3c63628e5]
2024-06-17 16:23:33 -05:00
Maisam Arif 807ca0ad89 SWDEV-466598 - Fixed CLI process outputs
Signed-off-by: Maisam Arif <Maisam.Arif@amd.com>
Change-Id: I902e82b6e78311e99542b109435346889daa49fc


[ROCm/amdsmi commit: 9fb2c967de]
2024-06-08 18:31:08 -05:00
Maisam Arif e154b3c89b Removed Throttle Status from CLI Tool
Signed-off-by: Maisam Arif <Maisam.Arif@amd.com>
Change-Id: I8eb8f30f821589003201d6d8bb96592ec5f8a483


[ROCm/amdsmi commit: 37c044696d]
2024-06-07 15:19:48 -05:00
Bill(Shuzhou) Liu b517f3c214 Change the name of clear sram to run cleaner shader
The function is to clean the local data in LDS/GPRs. The clear sram
is misleading.

Change-Id: I0385e6d6348602fe0f347d17e48ed8983f7ceb87


[ROCm/amdsmi commit: 4cf59c4edb]
2024-06-05 12:07:39 -05:00
Maisam Arif 7dfe4276cc Use different sysfs for soc_pstate and xmgi_plpd
The sysfs is changed to use the pm_policy folder with multiple
dpm_policy files.

Change-Id: I40fac8de2d0cb127950d238b8196f6d2416778d0


[ROCm/amdsmi commit: e5d1ba4621]
2024-05-31 01:38:41 -04:00
Maisam Arif 486db45f90 Moved --clear-sram-data to 'amd-smi reset'
Signed-off-by: Maisam Arif <maisarif@amd.com>
Change-Id: I46eaf7f887b15d6a8d8a31155bb3e448ef0ec04a


[ROCm/amdsmi commit: 927b9c644b]
2024-05-30 02:26:40 -05:00
Maisam Arif 4a3fb205dc Add Process Isolation and Clear SRAM to VM
Signed-off-by: Maisam Arif <maisarif@amd.com>
Change-Id: I7776e5b10efb2eea798e3e3d523ec5c01a162dc3


[ROCm/amdsmi commit: 3855fb2939]
2024-05-23 15:33:27 -04:00
Maisam Arif 48131e9481 Revert "SWDEV-458102 - Deprecated Voltage Curve API"
This reverts commit cc8eb17759.

Change-Id: I8a3eaf0a9f28200e09fb35d5260fbc070fe8a4a9


[ROCm/amdsmi commit: 11c72946eb]
2024-05-02 15:27:16 -05:00
Maisam Arif cc8eb17759 SWDEV-458102 - Deprecated Voltage Curve API
Signed-off-by: Maisam Arif <maisarif@amd.com>
Change-Id: I111c3ce26d2ab66d5e755432f4b8a9bfa631f805


[ROCm/amdsmi commit: 1423fb632e]
2024-05-02 02:53:29 -04:00
Maisam Arif 83d5f67750 Updated monitor --pcie to use gpu_metrics pcie bandwidth
Signed-off-by: Maisam Arif <maisarif@amd.com>
Change-Id: Id37aebc0297317edcd0f459a4817f56a6030d902


[ROCm/amdsmi commit: 25ef420407]
2024-04-25 20:10:02 -04:00
Bill(Shuzhou) Liu 5b0f4638c4 Process isolation and clean shader
A few APIs and command line options are added to support process
isolation and clean shader.

Change-Id: I98ad3fc9fc7429799a21798b7fca1c307de7f403


[ROCm/amdsmi commit: 7d2ab7970d]
2024-04-24 13:22:20 -04:00
Bill(Shuzhou) Liu b9b958b82c Get and set the XGMI PLPD
Update the API and CLI to support XGMI Per-Link Power Down Policy.

Change-Id: Iaf04a771eb8bb0829a5b3088d803a7355a8dfd0b


[ROCm/amdsmi commit: e4085c6414]
2024-03-26 01:48:14 -05:00
Bill(Shuzhou) Liu 46ab68f840 Set and get DPM policy for GPU device
Add new APIs to set and get dpm policy for the GPU device.

Change-Id: I26fa49cd17d0ce66bda3446c38945a6cf35717ff


[ROCm/amdsmi commit: 108e6d4ae6]
2024-03-12 10:32:31 -04:00
Maisam Arif 9326d1de19 Enabled ecc-blocks argument to linux VM
Signed-off-by: Maisam Arif <maisarif@amd.com>
Change-Id: I310c227ffa3ef45688a49cdedb43844aafe86339


[ROCm/amdsmi commit: dea4fac979]
2024-03-11 15:23:04 -04:00
Maisam Arif 45c9118db0 Updated README and removed cpu core option from Static subparser
Signed-off-by: Maisam Arif <maisarif@amd.com>
Change-Id: I039c0f0ed2f7094aafe8849baea3cec887b7e8ff


[ROCm/amdsmi commit: fa7a2838d8]
2024-02-23 00:41:17 -06:00
Maisam Arif 60a86065c0 SWDEV-436792 - Add XGMI Table
Signed-off-by: Maisam Arif <maisarif@amd.com>
Change-Id: Ia7a43b2b6d01fd32ece00cc26c28ba3088f3aa9e


[ROCm/amdsmi commit: 4ca326d824]
2024-02-22 23:10:57 -06:00
Maisam Arif 8c4518eb66 SWDEV-445664 - Aligned Metric Command with Host
Signed-off-by: Maisam Arif <maisarif@amd.com>
Change-Id: I905ee72272bb4c5ccde3e237d2663ec6e0e55034


[ROCm/amdsmi commit: 542bfc0c77]
2024-02-22 07:15:17 -05:00
Maisam Arif 2c3537e389 Refactor ESMI Initialization and Argument Parsing
Signed-off-by: Maisam Arif <maisarif@amd.com>
Change-Id: Iefab3a8110e0d3c525ee0cef1bdef9101550e9de


[ROCm/amdsmi commit: f58613561c]
2024-02-21 19:02:14 -05:00
Maisam Arif 02a42f3d4b Added Monitor command to Guest Linux
Signed-off-by: Maisam Arif <maisarif@amd.com>
Change-Id: I378a1fcf49d7a69b09b6c93d77a4b084144a5633


[ROCm/amdsmi commit: dd18c117e6]
2024-02-15 09:45:42 -06:00
Maisam Arif 05210b2c16 SWDEV-440462: Fixed metric functionality to Linux Guest
Change-Id: Ia69d01251d1e9bb3717bda3a7d0f752c739393a6
Signed-off-by: Maisam Arif <maisarif@amd.com>


[ROCm/amdsmi commit: 6292ac513c]
2024-01-21 02:46:15 -06:00