Граф коммитов

1056 Коммитов

Автор SHA1 Сообщение Дата
Maisam Arif 9b4f0f1d2b SWDEV-455131 - Updated process APIs
- Removed amdsmi_get_gpu_process_info from python API
  - Updated documentation
  - Aligned process --json output format to unit & value format

Signed-off-by: Maisam Arif <maisarif@amd.com>
Change-Id: I82bba1b6df71020b4a5995ff63b9aa62611ce4fe


[ROCm/amdsmi commit: c551c3caed]
2024-04-18 14:00:59 -05:00
Galantsev, Dmitrii 6d2aa6f7f8 GIT - Set dependabot checks to monthly
Change-Id: If4db71c0d7b68bc03ba302a01e6cf779a32e4c2b
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com>


[ROCm/amdsmi commit: c06be55b6a]
2024-04-05 11:14:52 -04:00
Maisam Arif 552a34f7d5 Bump Version to 24.5.1.0
Signed-off-by: Maisam Arif <maisarif@amd.com>
Change-Id: I842e223b78f337a39098f652fa6e7ef51948fbaf


[ROCm/amdsmi commit: 092908daee]
2024-04-05 02:31:08 -05:00
Maisam Arif 707730f33d Added amdsmi_get_gpu_process_info python library documentation
Signed-off-by: Maisam Arif <maisarif@amd.com>
Change-Id: I2218bf664a8a155e6b3085378db0fb20f3be3f70


[ROCm/amdsmi commit: 50450a2a69]
2024-04-05 02:30:13 -05:00
Maisam Arif bdf4a5da2f Removed fb_sharing fields from Linux BM
Signed-off-by: Maisam Arif <maisarif@amd.com>
Change-Id: Ia2942b9d33699ced1683270454c479701bce1246


[ROCm/amdsmi commit: 9758a8bc33]
2024-04-05 03:01:24 -04:00
Charis Poag da77ac3975 SWDEV-445668 - Align topology JSON
Updates:
    - [CLI] Updated json output to provide format
      similar to host
      eg.
      [
    {
        "gpu": 0,
        "bdf": "0000:01:00.0",
        "links": [
            {
                "gpu": 0,
                "bdf": "0000:01:00.0",
                "weight": 0,
                "link_status": "ENABLED",
                "link_type": "SELF",
                "num_hops": 0,
                "bandwidth": "N/A",
                "fb_sharing": "ENABLED"
            },
            {
                "gpu": 1,
                "bdf": "0001:01:00.0",
                "weight": 15,
                "link_status": "ENABLED",
                "link_type": "XGMI",
                "num_hops": 1,
                "bandwidth": "50000-100000",
                "fb_sharing": "ENABLED"
            },
        ...
        ]
    },
    {
    ...

Change-Id: I63217f63a4d6ebc23a8a84eaac9dbb7aff5f4cb4
Signed-off-by: Charis Poag <Charis.Poag@amd.com>


[ROCm/amdsmi commit: 08a3e76b26]
2024-04-01 18:37:06 -04:00
Oliveira, Daniel 9e2b1d8a09 fix: [SWDEV-442525] [rocm/amd_smi_lib]
Fixes gpu_process_list

Code changes related to the following:
  * amdsmi_get_gpu_process_list()
  * CLI
  * Examples
  * Unit tests
  * Changelog
  * Readme
  * rocm_smi_lib commit: 677433b367

Change-Id: I9210fbca7a5da92d0a8b472b72ca82597c8e4fb5
Signed-off-by: Oliveira, Daniel <daniel.oliveira@amd.com>


[ROCm/amdsmi commit: 08e2e21bab]
2024-03-27 16:48:24 -05:00
Maisam Arif dc0771d330 Bump Version to 24.5.0.0
Signed-off-by: Maisam Arif <maisarif@amd.com>
Change-Id: I2509c8c2df54f0c5e9376fc0a21c09adc74f0ea8


[ROCm/amdsmi commit: 9800156a7a]
2024-03-27 01:08:42 -05:00
Maisam Arif 144ddec250 SWDEV-452739 - Add CEM slot type to amd-smi
Updated CHANGELOG.md and re-added spaces after bolded lines

Signed-off-by: Maisam Arif <maisarif@amd.com>
Change-Id: Ic728b3e9b083c62fe4c9791b8ede991f5dacc1ca


[ROCm/amdsmi commit: 51b3f8cccb]
2024-03-27 02:01:25 -04:00
Maisam Arif 980da3b329 SWDEV-445664 - Aligned metric --ecc & --ecc-blocks with Host
Signed-off-by: Maisam Arif <maisarif@amd.com>
Change-Id: I93cf2bdab8c4c066bacf0e910e5620d37b362b07


[ROCm/amdsmi commit: e2e4349bd2]
2024-03-26 16:30:31 -04:00
Maisam Arif 4d62fc8bd6 SWDEV-445664 - Aligned metric --clock with Host
Change-Id: Ib4dc372aed61f6301680ac746eccf448e9d0ed00
Signed-off-by: Maisam Arif <maisarif@amd.com>


[ROCm/amdsmi commit: 93b81e5012]
2024-03-26 16:30:31 -04:00
Maisam Arif 9fcbd0f477 SWDEV-447333 - Corrected amdsmi_init() python documentation
Signed-off-by: Maisam Arif <maisarif@amd.com>
Change-Id: If46e7236316687cd97cf1a69770f87154e2681ff


[ROCm/amdsmi commit: 8bf2bd4b89]
2024-03-26 16:30:22 -04:00
Maisam Arif 0079a66c5b SWDEV-435406 - Corrected amdsmi_get_power_info() to return N/A for invalid values
Signed-off-by: Maisam Arif <maisarif@amd.com>
Change-Id: I2aeb6f6670f6f47cd496faf7fc41192647f7d58c


[ROCm/amdsmi commit: dad2c430ea]
2024-03-26 10:43:28 -04:00
Maisam Arif c872b4b1ea SWDEV-431924 - Corrected amdsmi_get_gpu_board_info() to return N/A for invalid values
Signed-off-by: Maisam Arif <maisarif@amd.com>
Change-Id: I3f7e7c873c24b8f5ddd6784700f193c2fdf199e0


[ROCm/amdsmi commit: 72b0a6efe5]
2024-03-26 10:43:16 -04:00
Bill(Shuzhou) Liu b9b958b82c Get and set the XGMI PLPD
Update the API and CLI to support XGMI Per-Link Power Down Policy.

Change-Id: Iaf04a771eb8bb0829a5b3088d803a7355a8dfd0b


[ROCm/amdsmi commit: e4085c6414]
2024-03-26 01:48:14 -05:00
Deepak Mewar 16b0ff1657 fix for cpu enable apb error
Signed-off-by: Deepak Mewar <deepak.mewar@amd.com>
Change-Id: I092b88484046671857c4adbbbeaba78180b103ab


[ROCm/amdsmi commit: 1ac1ee4b9a]
2024-03-25 06:46:42 -04:00
Oliveira, Daniel 51d25d8feb fix: [SWDEV-448201] [rocm/amd_smi_lib]
Adds Add PCIE Errors

Code changes related to the following:
  * amdsmi_get_pcie_info()
  * CLI
  * examples

Change-Id: Ie0b7053e77c88fb18309c16e74bce75d862c45a9
Signed-off-by: Oliveira, Daniel <daniel.oliveira@amd.com>


[ROCm/amdsmi commit: 1310c767ce]
2024-03-24 23:33:32 -04:00
Maisam Arif 438fc0f692 SWDEV-438593 - Updated proccess output error handling
Signed-off-by: Maisam Arif <maisarif@amd.com>
Change-Id: I67747da06362428587dab7467d85d8c9296d442e


[ROCm/amdsmi commit: 06fa6580c4]
2024-03-21 15:34:36 -05:00
Deepak Mewar e1a0420ac0 Updated README with esmi sample code
Change-Id: I50de7926fd76757e5810e8c531bcb6f5770ff454


[ROCm/amdsmi commit: a3407090c3]
2024-03-21 15:51:51 -04:00
Charis Poag e9190173ea Update ROCm 6.0/6.1 CHANGELOG.md & README.md
* Updates:
    - [CHANGELOG.md] Add 6.1 and update 6.0 changes
    - [README.md] Update README.md with ROCm install instructions

Change-Id: Ic701ebcb00e5d0af54d8f97707c1cec71a0aac4c
Signed-off-by: Charis Poag <Charis.Poag@amd.com>


[ROCm/amdsmi commit: 583e5e99bf]
2024-03-19 19:54:01 -05:00
Galantsev, Dmitrii 3a2f4286cc SWDEV-449212 - Fix static build
Disable Python interface and CLI tool for static builds (when
-DBUILD_SHARED_LIBS=OFF is passed to cmake)

Change-Id: I32bbd94d70628a50029a748f7493b55c91d45e02
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com>


[ROCm/amdsmi commit: eef4169a0f]
2024-03-14 13:31:34 -04:00
Maisam Arif 9bbd32046e SWDEV-449314 - Added pyyaml check before installing via pip
Signed-off-by: Maisam Arif <maisarif@amd.com>
Change-Id: Ie6d0d664e74b47c1efce6e6fac19ee4a1bf0d5eb


[ROCm/amdsmi commit: 25c8ff6c2a]
2024-03-14 13:31:06 -04:00
Bill(Shuzhou) Liu 18ae8f3095 Unable to reset GPU from CLI
The CLI helper compares the hex vendor id string with the number
and never match it as AMD GPU.

Change-Id: I1ababdce3a3694a5e26e5b0feef4d3d8cd40df7a


[ROCm/amdsmi commit: b2690fdf1e]
2024-03-13 10:57:15 -04:00
Bill(Shuzhou) Liu 46ab68f840 Set and get DPM policy for GPU device
Add new APIs to set and get dpm policy for the GPU device.

Change-Id: I26fa49cd17d0ce66bda3446c38945a6cf35717ff


[ROCm/amdsmi commit: 108e6d4ae6]
2024-03-12 10:32:31 -04:00
Maisam Arif 010d839dca SWDEV-443112 - Ensured dictionary output when static --bus is empty
Signed-off-by: Maisam Arif <maisarif@amd.com>
Change-Id: Ibd61eeec417a9ff40cb868073b3e1eed2a87cc59


[ROCm/amdsmi commit: 2f8f34946e]
2024-03-11 15:25:28 -04:00
Maisam Arif 9326d1de19 Enabled ecc-blocks argument to linux VM
Signed-off-by: Maisam Arif <maisarif@amd.com>
Change-Id: I310c227ffa3ef45688a49cdedb43844aafe86339


[ROCm/amdsmi commit: dea4fac979]
2024-03-11 15:23:04 -04:00
Lisa cf0de025c1 fix links
Change-Id: I23520f7abf5e67453a928a07b46f126bcd5c1469
Reviewed-By: Galantsev, Dmitrii <dmitrii.galantsev@amd.com>


[ROCm/amdsmi commit: ec56aba6c1]
2024-03-07 06:05:53 -06:00
Galantsev, Dmitrii 01dbe9de84 Fix misc memory leaks
Change-Id: I3dbf56e98d8c1312f9081956ed590962b2bdace3
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com>


[ROCm/amdsmi commit: 44c189b9f5]
2024-03-07 04:56:16 -06:00
Galantsev, Dmitrii 54c7e6f4c7 Fix memory leak created by hanging opendir
Change-Id: I01e372c6a6b427f21e89cb5e4217f876346a35be
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com>


[ROCm/amdsmi commit: 0155219389]
2024-03-07 04:35:30 -06:00
Galantsev, Dmitrii 583ff6d8cb Add .github/CONTRIBUTING.md
Change-Id: Ia7a2272516f2fed37dd38debad09b79484f04684
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com>


[ROCm/amdsmi commit: 50740a3e91]
2024-03-06 19:14:09 -06:00
David Galiffi 3e56a34a92 Add Doc team to CODEOWNERS file
Signed-off-by: David Galiffi <David.Galiffi@amd.com>
Change-Id: Iad8eea0645b63bddb835ed22080facc7d25c1bc0


[ROCm/amdsmi commit: 1b0e01d504]
2024-03-05 15:03:25 -05:00
Bill(Shuzhou) Liu f0e5bffab3 Add support for deferred RAS errors in API
The API will support the deferred errors

Change-Id: I221a146f09fefde1fc31e5f746d0870e07c93561


[ROCm/amdsmi commit: c489cb8f3f]
2024-03-04 22:46:44 -05:00
Maisam Arif a807b8dc37 Revert is not None check for static & metric arugment checks
Signed-off-by: Maisam Arif <maisarif@amd.com>
Change-Id: I351c88d53c9a626ad4305a7c61dc18b976b853f2


[ROCm/amdsmi commit: c8c03dfab0]
2024-03-04 11:02:50 -06:00
Deepak Mewar 40bffc8ced Updated as per latest esmi library changes in github
Change-Id: I949e1f2dcffc223274505764c84f2c6b9a533c98


[ROCm/amdsmi commit: cfb9b5e750]
2024-03-04 11:01:36 -05:00
Maisam Arif ab907b4fd8 SWDEV-448626 - Removed gpu prefix in non-csv formats
Signed-off-by: Maisam Arif <maisarif@amd.com>
Change-Id: I77fc58828a978080482e6ab01ff89f1f5a554dc5


[ROCm/amdsmi commit: 463817f344]
2024-03-01 09:09:23 -06:00
Maisam Arif 3c0b4b46c0 Added __version__ attr to Python Library
Signed-off-by: Maisam Arif <maisarif@amd.com>
Change-Id: Ibbd90eaa60cc8b9dd0387d7fac8aef06a3a43375


[ROCm/amdsmi commit: 55cfcf11d6]
2024-02-28 16:27:33 -05:00
Oliveira, Daniel 94d5c85371 fix: [rocm/amd_smi_lib] Navi3X/Navi2X/MI100 amdsmitst 2 test cases fail when running
Checks returned error by get_gpu_pci_bandwith() before assert

Code changes related to the following:
  * Unit tests

Change-Id: I950eee5d92607eea08722af7d7c84e8457cd4e60
Signed-off-by: Oliveira, Daniel <daniel.oliveira@amd.com>


[ROCm/amdsmi commit: c6208c0db0]
2024-02-28 15:11:22 -06:00
Maisam Arif 968b6aaf41 Removed old Python API function documentation
Signed-off-by: Maisam Arif <maisarif@amd.com>
Change-Id: Ib145fae98f1e99ab474b86ec4f6ddc2c8c44126e


[ROCm/amdsmi commit: 57a43babad]
2024-02-26 14:10:49 -06:00
Maisam Arif b390bbec7a Bump Version to 24.4.0.0 & Corrected argument checks for set subcommand
Signed-off-by: Maisam Arif <maisarif@amd.com>
Change-Id: I651f8ca652c764f30845503dd869f435f728d5ba


[ROCm/amdsmi commit: 69caba8727]
2024-02-23 20:47:19 -06:00
Maisam Arif 45c9118db0 Updated README and removed cpu core option from Static subparser
Signed-off-by: Maisam Arif <maisarif@amd.com>
Change-Id: I039c0f0ed2f7094aafe8849baea3cec887b7e8ff


[ROCm/amdsmi commit: fa7a2838d8]
2024-02-23 00:41:17 -06:00
Maisam Arif 60a86065c0 SWDEV-436792 - Add XGMI Table
Signed-off-by: Maisam Arif <maisarif@amd.com>
Change-Id: Ia7a43b2b6d01fd32ece00cc26c28ba3088f3aa9e


[ROCm/amdsmi commit: 4ca326d824]
2024-02-22 23:10:57 -06:00
Maisam Arif 06d32ac405 Align to Host left adjusted topology output
Signed-off-by: Maisam Arif <maisarif@amd.com>
Change-Id: I8e56156200d5eface7f069ccf82a6b7503e1a48c


[ROCm/amdsmi commit: 16c34e91ac]
2024-02-22 23:10:57 -06:00
Deepak Mewar d41232363c DCSM-371 - Observing previous mode details as null for amdsmi_set_cpu_pcie_link_rate
Signed-off-by: Deepak Mewar <deepak.mewar@amd.com>
Change-Id: I79a61d7b10aaff27b07e3d108a9b817c5ead6cf3


[ROCm/amdsmi commit: f48e3f48a3]
2024-02-22 16:30:18 -05:00
Maisam Arif 4cf9c0eb03 SWDEV-436992 - Added Units of Measure to JSON Output
Signed-off-by: Maisam Arif <maisarif@amd.com>
Change-Id: I0aba1533bc2919b7354ef6cad5ae4ae5d23e92a7


[ROCm/amdsmi commit: 794354dad9]
2024-02-22 07:16:13 -05:00
Maisam Arif 8425ea9d50 JSON Alignment with Host for singular device output
Signed-off-by: Maisam Arif <maisarif@amd.com>
Change-Id: I12acbae8b385dac75ccc37e04d40a29153ba1944


[ROCm/amdsmi commit: 9146e9c6eb]
2024-02-22 07:16:01 -05:00
Maisam Arif 8c4518eb66 SWDEV-445664 - Aligned Metric Command with Host
Signed-off-by: Maisam Arif <maisarif@amd.com>
Change-Id: I905ee72272bb4c5ccde3e237d2663ec6e0e55034


[ROCm/amdsmi commit: 542bfc0c77]
2024-02-22 07:15:17 -05:00
Oliveira, Daniel 49dd38f117 fix: [rocm/amd_smi_lib] TestFrequenciesRead & TestPciReadWrite test cases failed
Fixes asserts in unit tests, and 'pp_dpm_pcie' condition

Code changes related to the following:
  * rsmi_dev_pci_bandwidth_set()
  * Functional tests

Change-Id: Id5e6851393fa3b51bb8cad87daca1efaf500a7e0
Signed-off-by: Oliveira, Daniel <daniel.oliveira@amd.com>


[ROCm/amdsmi commit: 475424525e]
2024-02-22 03:40:50 -05:00
Maisam Arif 13e9ab4ad6 SWDEV-445396 - Aligned Static Command with Host
Signed-off-by: Maisam Arif <maisarif@amd.com>
Change-Id: I4182b9104e173f54830fc44819a61d74d31d65d7


[ROCm/amdsmi commit: a719ae9707]
2024-02-22 03:35:00 -05:00
Bill(Shuzhou) Liu 21cf0c1b5c Unify the amdsmi_get_pcie_info python interface
Make the python interface consistent with the C interface.

Change-Id: Idda08f888947c757e475d5a024b0ec3d8e1d846a


[ROCm/amdsmi commit: db33cda0c1]
2024-02-22 03:33:59 -05:00
Maisam Arif 2c3537e389 Refactor ESMI Initialization and Argument Parsing
Signed-off-by: Maisam Arif <maisarif@amd.com>
Change-Id: Iefab3a8110e0d3c525ee0cef1bdef9101550e9de


[ROCm/amdsmi commit: f58613561c]
2024-02-21 19:02:14 -05:00