2
0
Gráfico de cometimentos

1636 Cometimentos

Autor(a) SHA1 Mensagem Data
Kanangot Balakrishnan, Bindhiya a59cd4c25e [SWDEV-537852] Update process name help text (#517)
* [SWDEV-537852] Update process name help text

Currently process name displays N/A if that need elevated
permissions. Updated the default amd-smi, process and monitor
commands help texts to display elevated permission requirement.

---------

Signed-off-by: Bindhiya Kanangot Balakrishnan <Bindhiya.KanangotBalakrishnan@amd.com>
Signed-off-by: Maisam Arif <Maisam.Arif@amd.com>
Co-authored-by: Maisam Arif <Maisam.Arif@amd.com>

[ROCm/amdsmi commit: ce230efaaa]
2025-07-07 11:26:10 -05:00
Poag, Charis 92f926b43b [SWDEV-533305] Remove partition info from amd-smi static (-p/--partition still available) + CLI API call cleanup (#529)
Updates:
- Separate extra APIs calls from amd-smi CLI to target specific CLI commands that need them.
- Remove extra current_compute_partition SYSFS calls from amd-smi static.
- Remove the partition information from the default `amd-smi static` CLI command.
- Users must now use the `-p` argument to view partition information with `amd-smi static`.
- The help text for the `partition` argument has been updated to reflect this change.
- The partition information can still be accessed using the `amd-smi partition -c -m` or `sudo amd-smi partition -a` commands.

---------

Signed-off-by: Charis Poag <Charis.Poag@amd.com>

[ROCm/amdsmi commit: 88473b7fd0]
2025-07-07 11:21:46 -05:00
Park, Peter aecda316c4 Fix links in docs (#532)
* fix links in amdsmi_cli/README.md
* fix xrefs to install docs
* rm rocm-smi examples and add cli tutorial
* rm disclaimer and add amd smi contributing guidelines to index

Signed-off-by: Peter Park <Peter.Park@amd.com>

[ROCm/amdsmi commit: 8039ab9449]
2025-07-07 11:18:40 -05:00
Narlo, Joseph 540ecd41bd [SWDEV-541675] Remove Unnecessary API from amdsmi.h (#530)
Signed-off-by: josnarlo <Joseph.Narlo@amd.com>

[ROCm/amdsmi commit: 2cf6272b53]
2025-07-07 11:14:27 -05:00
Saeed, Oosman 1c60502d5f [SWDEV-538308] CPER CLI 20 limit bug (#499)
The bug was reproduced like this.

In terminal #1, run command:
sudo amd-smi ras --cper --gpu 6 --severity all --folder /tmp/cper_dump --follow 

In terminal #2, inject errors:
while true; do sudo amdgpuras -b 7 -s 1 -m 6 -t 2; sleep 2; done

The terminal #1 starts dumping cper entry information that it captures. After 20 entries have been captured, open terminal #3 and run same command as terminal #1:
sudo amd-smi ras --cper --gpu 6 --severity all --folder /tmp/cper_dump --follow 

From terminal #3, there will be no output, even when terminal #1 continues capturing and printing information.

The fix:

Since we already have more than 20 CPER entries available in the GPU buffer, when we run the command from terminal #3 to start capturing from the beginning and pass 20 buffers to copy entries to, the C++ API returns a code saying there is more data available.

The Python CLI should not treat this as an error, but should continue to print what the API returned.

---------

Signed-off-by: Oosman Saeed <oossaeed@amd.com>

[ROCm/amdsmi commit: 5b95d227bc]
2025-07-07 11:11:13 -05:00
Cheruvally, Aravindan d25c73783e [SWDEV-530465] Update share/doc/<pkgnm> License Folder (#516)
Update share/doc/ folder for license/docs to reflect correct package name.
Signed-off-by: Cheruvally, Aravindan <Aravindan.Cheruvally@amd.com>

[ROCm/amdsmi commit: f559075a81]
2025-07-03 02:07:54 -05:00
gabrpham_amdeng e9f7fe2842 [SWDEV-539451] Adjusted reset command to prevent reset on partitions
Signed-off-by: gabrpham_amdeng <Gabriel.Pham@amd.com>


[ROCm/amdsmi commit: a2885d6e70]
2025-07-03 01:11:46 -05:00
Kanangot Balakrishnan, Bindhiya f1db852cba [SWDEV-530646] Update changelog for topology optimization (#523)
Signed-off-by: Bindhiya Kanangot Balakrishnan <Bindhiya.KanangotBalakrishnan@amd.com>

[ROCm/amdsmi commit: 80f7045f61]
2025-06-30 17:36:14 -05:00
Justin Williams 490a186dfc CI - Changed CI Runners
Signed-off-by: Justin Williams <Justin.Williams@amd.com>


[ROCm/amdsmi commit: abd3bf2dcf]
2025-06-30 14:23:43 -05:00
Bindhiya Kanangot Balakrishnan 38f59e353a [SWDEV-540014] Correct topology link_type check
Topology numa_bw checks for non-xgmi links to set as N/A.
The recent change in link_type enum mapping caused this
condition to check for PCIE instead of XGMI. Corrected
the same.

Signed-off-by: Bindhiya Kanangot Balakrishnan <Bindhiya.KanangotBalakrishnan@amd.com>


[ROCm/amdsmi commit: fa9ca21520]
2025-06-30 14:01:19 -05:00
Jeremy Newton 7b9ef0e406 Don't install asan docs if disabled
Signed-off-by: Jeremy Newton <Jeremy.Newton@amd.com>


[ROCm/amdsmi commit: 529c6ee151]
2025-06-30 12:05:29 -05:00
Williams, Justin 21f5755794 Fixed NoDRM Failures
Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>
Signed-off-by: Williams, Justin <Justin.Williams@amd.com>

[ROCm/amdsmi commit: 738627af29]
2025-06-25 13:18:25 -05:00
Justin Williams 010f95bfb7 Fixed NoDRM Failures
Signed-off-by: Justin Williams <Justin.Williams@amd.com>


[ROCm/amdsmi commit: bad4868f39]
2025-06-25 13:18:25 -05:00
josnarlo 3f6b0bb1c7 [SWDEV-539912] Add Skipping to Unit Tests
Signed-off-by: josnarlo <Joseph.Narlo@amd.com>


[ROCm/amdsmi commit: 5858d643f3]
2025-06-24 12:01:32 -05:00
Bindhiya Kanangot Balakrishnan 371b349f6c [SWDEV-530646] Reduce amdsmi_topo_get_p2p_status calls in topology
The topology method calls amdsmi_topo_get_p2p_status repeatedly
for the same GPU pairs across different table sections,
significantly impacting performance with 60+ GPUs. Reduced this
by implemeting result caching.

Signed-off-by: Bindhiya Kanangot Balakrishnan <Bindhiya.KanangotBalakrishnan@amd.com>


[ROCm/amdsmi commit: c3453f7c97]
2025-06-24 11:27:28 -05:00
Maisam Arif cd057e446f [SWDEV-533390] Removed kfd_ioctl.h from being copied on install
Signed-off-by: Maisam Arif <Maisam.Arif@amd.com>
Change-Id: I03cb03b5f034e822c8f3c2d1e11e8b4e57251905


[ROCm/amdsmi commit: 2d2e5fe692]
2025-06-20 14:32:16 -05:00
josnarlo 4c0c050962 [SWDEV-539591] Allow integration tests to skip Not Supported APIs
Signed-off-by: josnarlo <Joseph.Narlo@amd.com>


[ROCm/amdsmi commit: d8b8dc4116]
2025-06-20 14:19:56 -05:00
Galantsev, Dmitrii 44986cfbd4 DRM - Remove FD usage
Change-Id: I77dfa778ccd0d39a03289c2e11cf10357566ff16
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com>


[ROCm/amdsmi commit: 9b5bbf555a]
2025-06-20 11:00:42 -05:00
Galantsev, Dmitrii 40228106b5 DRM - Remove caching
Change-Id: I21716cc953462e385e981024f75a9a7c2d76a466
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com>


[ROCm/amdsmi commit: 202b46d96f]
2025-06-20 11:00:42 -05:00
Galantsev, Dmitrii ccdd52e9c0 DRM - Update to latest public
Change-Id: I9f7b46acbae654c377702a599c4b094fd621f101
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com>


[ROCm/amdsmi commit: cb2f152205]
2025-06-20 11:00:42 -05:00
Maisam Arif bc0c47c515 Fix subsystem_id str comparision
Signed-off-by: Maisam Arif <Maisam.Arif@amd.com>
Change-Id: Icbe2440884458b63b42cb653009e7df36eb31e0f


[ROCm/amdsmi commit: 28a7f536f9]
2025-06-19 17:21:17 -05:00
Narlo, Joseph c5e604f357 [SWDEV-489696] Improve AMD SMI Python APIs Functional and Unit Testing (#468)
* Adding python unit tests
* Remove duplicate functions definitions
* Added missing classes for __init__ for py-interface

---------

Signed-off-by: josnarlo <Joseph.Narlo@amd.com>
Signed-off-by: Maisam Arif <Maisam.Arif@amd.com>

[ROCm/amdsmi commit: 7c0802889b]
2025-06-19 16:38:34 -05:00
Arif, Maisam 6123abe733 [SWDEV-538786] Fix ecc counts returning file error (#494)
Change-Id: I5cea584289df95e89b6151d549bf69e4c3e50d22

Signed-off-by: Maisam Arif <Maisam.Arif@amd.com>

[ROCm/amdsmi commit: 967e879861]
2025-06-19 15:24:03 -05:00
Castillo, Juan 4a55abaa05 [SWDEV-531904] - Added GPU Cache Read Tests (#464)
New:
- gpu_cache_read.h and gpu_cache_read.cc
- Test reads GPU cache info and asserts valid structure
Updated:
- integration_test.py
- Added test_gpu_cache_info() and asserts valid structure
- test_get_gpu_compute_partition() to loop through all devices when test fail/pass
Added:
- test_get_gpu_compute_partition_returns_string() to integration_test.py
- This test displays the current compute partition for each bdf

---------

Signed-off-by: Juan Castillo <juan.castillo@amd.com>
Signed-off-by: Castillo, Juan <Juan.Castillo@amd.com>
Signed-off-by: Maisam Arif <Maisam.Arif@amd.com>

[ROCm/amdsmi commit: 470c62f887]
2025-06-19 15:23:34 -05:00
Narlo, Joseph f543f77e30 [SWDEV-537038] amd-smi-lib build failing Fix for integration_test.py (#496)
Signed-off-by: josnarlo <Joseph.Narlo@amd.com>
Signed-off-by: Narlo, Joseph <Joseph.Narlo@amd.com>

[ROCm/amdsmi commit: 57a749f457]
2025-06-19 15:12:31 -05:00
Pham, Gabriel aa95feee60 [SWDEV-531386] Changed source of metric GFX and MEM min and max clk to pp_od_clk_voltage (#453)
* Made corrections to reading of pp_od_clk_voltage
* Added fall back to pp_dpm files if pp_od_clk_voltage doesn't exist

---------

Signed-off-by: gabrpham_amdeng <Gabriel.Pham@amd.com>
Signed-off-by: Maisam Arif <Maisam.Arif@amd.com>

[ROCm/amdsmi commit: 4262aee8f5]
2025-06-19 15:00:45 -05:00
Galantsev, Dmitrii a480b2869d rsmi_init: Do not complain loudly when no driver is found (#74)
Co-authored-by: Samuel Thibault <samuel.thibault@ens-lyon.org>


[ROCm/amdsmi commit: ca52da194d]
2025-06-19 13:22:48 -05:00
Narlo, Joseph 154d266abc [SWDEV-482203] amd-smi Usage basics for C Library Multiple doc errors (#477)
* Added finding rocm include and library paths in code examples

Signed-off-by: josnarlo <Joseph.Narlo@amd.com>

[ROCm/amdsmi commit: db3d763aad]
2025-06-19 11:25:57 -05:00
josnarlo 0862dd11fb [SWDEV-537038] amd_smi-lib build failing Fix for integration_test.py
Signed-off-by: josnarlo <Joseph.Narlo@amd.com>


[ROCm/amdsmi commit: 99b2bfbc61]
2025-06-19 11:23:25 -05:00
Justin Williams 31df8b46bd Adjusted amd-smi set --compute-partition docs
Signed-off-by: Justin Williams <juwillia@amd.com>


[ROCm/amdsmi commit: 81d58f06d1]
2025-06-19 10:58:04 -05:00
gabrpham_amdeng 771e3019ad Adjusted CU % logic to be more robust
[ROCm/amdsmi commit: 9729aba695]
2025-06-19 10:57:19 -05:00
gabrpham_amdeng d049815647 Changed NUM_CU to CU %
[ROCm/amdsmi commit: fd751ba918]
2025-06-19 10:57:19 -05:00
gabrpham 66d3ffe65a Added GTT Memory to process table of default command
Signed-off-by: gabrpham <Gabriel.Pham@amd.com>


[ROCm/amdsmi commit: 9e221a3f09]
2025-06-19 10:57:19 -05:00
gabrpham 0e30436a0f Added GTT Memory to default command and adjusted table format
Signed-off-by: gabrpham <Gabriel.Pham@amd.com>


[ROCm/amdsmi commit: 8a0e65d911]
2025-06-19 10:57:19 -05:00
Galantsev, Dmitrii 06b8484bbc CLI - Fix partition json output
Change-Id: I2b9e575cb960db7c136776bfe5c040b27feba727
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com>


[ROCm/amdsmi commit: 4262802588]
2025-06-19 10:34:57 -05:00
josnarlo ed9086505d [SWDEV-538604] Sync Unified Header and AMDSMI Comments
Signed-off-by: josnarlo <Joseph.Narlo@amd.com>


[ROCm/amdsmi commit: 5ed9fba9be]
2025-06-18 09:13:01 -05:00
Deepak Mewar 63784f77f7 Updated display format of cpu & socket affinities
Signed-off-by: Deepak Mewar <deepak.mewar@amd.com>


[ROCm/amdsmi commit: 7571eb014f]
2025-06-13 17:37:00 -05:00
Bindhiya Kanangot Balakrishnan cd709e93d1 [SWDEV-512393] Print keys of lists in custom_dump
The custom_dump function was not printing list's key
and so static numa was not displaying list keys
CPU affinity and Socket affinity. Updated custom_dump
to print the keys.

Signed-off-by: Bindhiya Kanangot Balakrishnan <Bindhiya.KanangotBalakrishnan@amd.com>


[ROCm/amdsmi commit: 6fbda16098]
2025-06-13 17:37:00 -05:00
josnarlo 48ed5787a6 [SWDEV-537983] Fix comments about temperature units for amdsmi_get_temp_metric
Signed-off-by: josnarlo <Joseph.Narlo@amd.com>


[ROCm/amdsmi commit: d4a946717b]
2025-06-13 16:51:59 -05:00
josnarlo 986a2dd0b5 [SWDEV-537983] Fix comments about temperature units for amdsmi_get_temp_metric
Signed-off-by: josnarlo <Joseph.Narlo@amd.com>


[ROCm/amdsmi commit: 4aee30f49b]
2025-06-13 16:51:59 -05:00
Pham, Gabriel dfaf8386fa Added GTT Memory to default output process table (#480)
* Added GTT Memory to default command and adjusted table format

---------

Signed-off-by: gabrpham <Gabriel.Pham@amd.com>

[ROCm/amdsmi commit: 940ece6813]
2025-06-13 16:43:56 -05:00
dependabot[bot] b1753ad3b3 Bump rocm-docs-core[api_reference] from 1.17.0 to 1.20.1 in /docs/sphinx
Bumps [rocm-docs-core[api_reference]](https://github.com/ROCm/rocm-docs-core) from 1.17.0 to 1.20.1.
- [Release notes](https://github.com/ROCm/rocm-docs-core/releases)
- [Changelog](https://github.com/ROCm/rocm-docs-core/blob/v1.20.1/CHANGELOG.md)
- [Commits](https://github.com/ROCm/rocm-docs-core/compare/v1.17.0...v1.20.1)

---
updated-dependencies:
- dependency-name: rocm-docs-core[api_reference]
  dependency-version: 1.20.1
  dependency-type: direct:production
  update-type: version-update:semver-minor
...

Signed-off-by: dependabot[bot] <support@github.com>

[ROCm/amdsmi commit: 152184dd49]
2025-06-13 16:35:08 -05:00
Maisam Arif 34041504f9 Update workflows and Contrib docs
Signed-off-by: Maisam Arif <Maisam.Arif@amd.com>
Change-Id: I2ae31144ee1ab29c8bbba83f0c7eb0bb9dc079ba


[ROCm/amdsmi commit: 049c59c5bb]
2025-06-13 16:19:10 -05:00
Maisam Arif 6688ae237f Updated 6.4.2 Changelog
Signed-off-by: Maisam Arif <Maisam.Arif@amd.com>
Change-Id: I975f5db0bde9ebccec3756415cb1e7dc47e78988


[ROCm/amdsmi commit: 772b572913]
2025-06-12 17:17:13 -05:00
Maisam Arif 6e37490e87 [SWDEV-529665] PLDM Bundle naming
Signed-off-by: Maisam Arif <Maisam.Arif@amd.com>
Change-Id: Id7f652ddc4e790027869683a4aaa3226ffc05c83


[ROCm/amdsmi commit: 6da33b8ded]
2025-06-12 02:19:37 -05:00
Maisam Arif 7be2218717 [SWDEV-537491] Updated Copyright to aca-decode files
Signed-off-by: Maisam Arif <Maisam.Arif@amd.com>
Change-Id: I9621e4c54f3b490c6eb4cfc3e9bdfb4d489f0052


[ROCm/amdsmi commit: 5763412f7d]
2025-06-11 20:51:51 -05:00
Arif, Maisam 2658f0fe20 Fixed type hinting & Added copy rights (#462)
* Added copyrights
* Fixed type hinting for processor_handle in python_interface
* Fixed Incorrect type hinting to actual return types

---------

Signed-off-by: Arif, Maisam <Maisam.Arif@amd.com>
Change-Id: Ie2a09acf628ed0c43eacc8ec78c159d125acbcdb

[ROCm/amdsmi commit: 23b9da656c]
2025-06-11 17:19:02 -05:00
Justin Williams 0c2228852a CI - Added Build Warnings
Signed-off-by: Justin Williams <Justin.Williams@amd.com>


[ROCm/amdsmi commit: 6d03ca79ff]
2025-06-11 13:13:38 -05:00
Maisam Arif b8caa120a8 [SWDEV-537062] Fixed CU Occupancy reporting UINT MAX
Signed-off-by: Maisam Arif <Maisam.Arif@amd.com>
Change-Id: I975579997a9e455eb930f6c0b8fc5f3dc3cbfae4


[ROCm/amdsmi commit: b579d89ae2]
2025-06-11 10:42:00 -05:00
dependabot[bot] aa35398722 Bump requests from 2.32.3 to 2.32.4 in /docs/sphinx (#471)
Bumps [requests](https://github.com/psf/requests) from 2.32.3 to 2.32.4.
- [Release notes](https://github.com/psf/requests/releases)
- [Changelog](https://github.com/psf/requests/blob/main/HISTORY.md)
- [Commits](https://github.com/psf/requests/compare/v2.32.3...v2.32.4)

---
updated-dependencies:
- dependency-name: requests
  dependency-version: 2.32.4
  dependency-type: indirect
...

Signed-off-by: dependabot[bot] <support@github.com>

[ROCm/amdsmi commit: 7e956ce4f3]
2025-06-11 08:23:27 -05:00