Maisam Arif
cec3e4c2a0
[SWDEV-478576] Added Disclaimer for CLI Tool
...
Signed-off-by: Maisam Arif <Maisam.Arif@amd.com >
Change-Id: I3d432ac3f8c9663365921591d183a5d1f35c4707
2024-08-09 14:14:50 -04:00
Maisam Arif
40112f5b17
Bump Version to 24.6.3.0
...
Signed-off-by: Maisam Arif <Maisam.Arif@amd.com >
Change-Id: I902da5e5e9e7441002420afaaef01ca9c6c9666f
2024-08-08 01:30:51 -05:00
Ranjith Ramakrishnan
7591eec971
SWDEV-469004 - Append additonal path to system path
...
amd-smi is installed in /opt/rocm-ver/bin, but not as a soft link in wheel package
For amd-smi to work from bin directory, it need the extra path to find the dependent python scripts in /opt/rocm-ver/libexec/amd_smi
Change-Id: I4ff63a8f55949aaac51d85eae849ecc890f4c694
2024-08-08 02:15:04 -04:00
Ranjith Ramakrishnan
92a4093256
SWDEV-476075 - Prevent the modification of interpreter directives
...
CPACK is converting /usr/bin/env python3 to /usr/libexec/platform-python in RHEL8.
Undefining __brp_mangle_shebangs will prevent the same
Change-Id: I5120274b90aeaf783b62414ac2aeba9e84029205
2024-08-08 02:04:37 -04:00
Maisam Arif
574712386f
Fixed handling in GPU/CPU/CORE select functions
...
Signed-off-by: Maisam Arif <Maisam.Arif@amd.com >
Change-Id: I83d78a8d6cdcbd54e5c79330be577b3a06a00985
2024-08-05 18:27:30 -04:00
gabrpham
0143041262
Fixed cli issue with empty cpu/core parameter
...
Change-Id: Id0fee74357a56baaec59ca5359eb00a65cfd6185
Signed-off-by: gabrpham <Gabriel.Pham@amd.com >
2024-08-05 16:37:36 -05:00
gabrpham
fe1dc23ade
Fixed 'amd-smi process -G'
...
Issue linked here: https://github.com/ROCm/amdsmi/issues/23
Signed-off-by: gabrpham <Gabriel.Pham@amd.com >
Change-Id: I73c2dede8634b21a5dfe0245a202e883fa856de2
2024-08-02 16:42:08 -04:00
Galantsev, Dmitrii
3784f37a3a
Cleanup convert_SI_unit and misc linter warnings
...
Change-Id: I000ba548b79a7023aabad653125842064fa2e7cb
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com >
2024-08-02 10:29:06 -04:00
gabrpham
de8145387d
[SWDEV-439701] Additional GPU error handling
...
Change-Id: Ieb35e9712f2a78acef8961d865dba1d824969ef3
Signed-off-by: gabrpham <Gabriel.Pham@amd.com >
2024-07-30 16:19:10 -05:00
Galantsev, Dmitrii
f3426ced06
Docs - Switch to amd-staging branch
...
Change-Id: I1a26542b3a7831c1f5efea6d6b4084f77b0a7cdb
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com >
2024-07-23 17:10:34 -05:00
Galantsev, Dmitrii
ceac87ef5a
Azure - Switch to amd-staging branch
...
Change-Id: I5cc2316427631fc17990506c4234163302febd3d
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com >
2024-07-23 17:04:37 -05:00
Galantsev, Dmitrii
47c8cd10cf
Fix missing c_str() introduced in 8bc8307
...
Change-Id: Ife778276aaebd109a413efb3db703de36b730613
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com >
2024-07-19 19:12:17 -04:00
Charis Poag
ac40e963d3
Fix TypeError: 'type' object is not subscriptable
...
Python 3.8 requires typing import to specify.
Python 3.10, no longer requires typing import.
Change-Id: I5d9844c91932bc3af53acc6dd56eb258f4d18d9b
Signed-off-by: Charis Poag <Charis.Poag@amd.com >
2024-07-19 16:33:41 -05:00
Maisam Arif
8bc8307c60
[SWDEV-474450] Removed DEVICE_MUTEX from gpu_reset
...
Signed-off-by: Maisam Arif <Maisam.Arif@amd.com >
Change-Id: I706fb47288738bfbde94b56fee66bbf807b3c0cb
2024-07-19 11:47:52 -04:00
Maisam Arif
3a9c93bfa6
Updated Changelog with Mutex Fix
...
Signed-off-by: Maisam Arif <Maisam.Arif@amd.com >
Change-Id: I0aee284ce7600efc66b0ad5392c11bb6a502a929
2024-07-19 11:18:09 -04:00
Galantsev, Dmitrii
baaa1cc4b6
[SWDEV-471523] - Replace lspci and lscpu with sysfs reads
...
Change-Id: Ia5a12c04d3064787a0f36a961e68533b79e17ed0
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com >
2024-07-19 00:29:36 -05:00
Galantsev, Dmitrii
0f6224cbe6
Merge branch 'dgalants/fix_mutex' into amd-dev
...
This change imports changes done to shared_mutex in rocm_smi_lib repo
Change-Id: I1412440b52a3419127b03003df3cc36fa71875d4
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com >
2024-07-18 21:14:34 -05:00
Bill(Shuzhou) Liu
78ee3f5e5a
Remove thread safe only mutex warning message
...
In multiple GPUs environment, too many warning messages generated,
and then need to be removed.
Change-Id: I275de2397eb0e6b189e2e17e94335cb1e8f97815
2024-07-18 20:43:57 -05:00
Bill(Shuzhou) Liu
dbba33d3f5
Support thread only mutex
...
The environment variable RSMI_MUTEX_THREAD_ONLY=1 to enable thread only mutex.
The RSMI_INIT_FLAG_THRAD_ONLY_MUTEX can also be pass to rsmi_init()
to enable thread only mutex.
Change-Id: I2d9844039b774e386f03bb9bb130d8c342504ea6
2024-07-18 20:43:38 -05:00
Bill(Shuzhou) Liu
06b9232a56
Unlock the mutex when process is dead
...
After the dead process is detected, pthread_mutex_consistent() will
be called. After that, the pthread_mutex_unlock() should also be
called to unlock it: "It is the responsibility of the application to
recover the state so it can be reused."
Change-Id: I45d3e2e68c3b06779f3acb1e908dbec0c6a39297
2024-07-18 20:43:21 -05:00
Bill(Shuzhou) Liu
33dab0c232
Remove const to avoid compile error
...
Fix the compile error
Change-Id: I422b606b2b969b418c2e77b47a3afad0cfc732a1
2024-07-18 18:15:43 -04:00
Maisam Arif
8f15c22606
Fix amd-smi event from reading NONE event type
...
Signed-off-by: Maisam Arif <Maisam.Arif@amd.com >
Change-Id: I7acf91eb682b3f0873ca34a98191eafd3925a344
2024-07-18 17:54:08 -04:00
Maisam Arif
b5f9e6a91d
Corrected passthrough platform logic
...
Signed-off-by: Maisam Arif <Maisam.Arif@amd.com >
Change-Id: Ibbc4f5dd6300efd16f26efe17bc68eb8c22ed8be
2024-07-18 17:54:02 -04:00
Maisam Arif
5b7be3bf99
[SWDEV-474474] - Changed Monitor PCIE_REPLAY count to use gpu_metrics
...
Signed-off-by: Maisam Arif <Maisam.Arif@amd.com >
Change-Id: I4351a23e8412875bb4b23b30747ac6d0bf3d3c56
2024-07-18 17:53:49 -04:00
Maisam Arif
0fc779b14f
[SWDEV-439701] Add missing parameter CLI error cases
...
Signed-off-by: Maisam Arif <Maisam.Arif@amd.com >
Change-Id: I8a6623ccdd7b16be320ea0f6f48b7b6cb3511983
2024-07-18 14:24:00 -04:00
Maisam Arif
c83bcb7ac9
[SWDEV-472641] Set file opening and writing encoding to utf-8
...
Signed-off-by: Maisam Arif <Maisam.Arif@amd.com >
Change-Id: Ic5cdca41b588341f7894de72dfe3d949378f9a61
2024-07-12 09:46:44 -04:00
Maisam Arif
2388ff7e3c
Whitespace
...
Signed-off-by: Maisam Arif <Maisam.Arif@amd.com >
Change-Id: I8335e617670a471a97bf54886b3221b6222e507f
2024-07-10 19:22:02 -05:00
Maisam Arif
548938389d
Bump Version to 24.6.2.0
...
Signed-off-by: Maisam Arif <Maisam.Arif@amd.com >
Change-Id: Ic389b6783514e88c43958ff5d3413a4c4a8a884f
2024-07-10 19:15:17 -05:00
Charis Poag
7194aaebf3
[SWDEV-455442/SWDEV-464645] Add back voltage curve testing for MI300
...
Validation requires running tests for MI300 systems, this update
removes the exclusion for these systems.
Change-Id: Idacf3e8bf0bd569f1cfa6192af47993eb5440ee6
2024-07-08 14:24:26 -05:00
Maisam Arif
6e5c4b422a
Added dmon alias for 'amd-smi monitor'
...
Signed-off-by: Maisam Arif <Maisam.Arif@amd.com >
Change-Id: I4a787034bd7ab1a0d08d8cfdd038add5c35cdea4
2024-07-02 11:05:59 -04:00
Maisam Arif
27cd092c6d
Updated error code naming to be inline with Host
...
Signed-off-by: Maisam Arif <Maisam.Arif@amd.com >
Change-Id: I6c66640742a25361ad6399763ee442598ffb0ac6
2024-06-28 08:58:52 -05:00
Bill(Shuzhou) Liu
7a617e6ef2
Make the the devInfoTypesStrings.at(type) exception safe
...
Wrap it in a function to make it exception safe.
Change-Id: I29835993ae4fe2b7aa1a7027fab88c05ba89e6e3
2024-06-26 08:33:44 -05:00
Maisam Arif
a3758f82dc
SWDEV-457854 - Unified BM error codes
...
Signed-off-by: Maisam Arif <Maisam.Arif@amd.com >
Change-Id: I5b232de3b598bd3146eb0528f61c628da93278d9
2024-06-25 18:33:40 -05:00
Maisam Arif
943c74b6ef
SWDEV-446724 - Version command human readable output file format fix
...
Signed-off-by: Maisam Arif <Maisam.Arif@amd.com >
Change-Id: I94de5b03355b503ade307f2a3881acd07266d6c7
2024-06-24 10:35:34 -05:00
Maisam Arif
413c9ef6fe
SWDEV-466302 - Changed blank processes to N/A & Updated Docs
...
Change-Id: I2d68430dda8036879f58b0f1dea5d2825b441179
2024-06-24 00:38:17 -04:00
Oliveira, Daniel
a20db864b8
fix: [SWDEV-466302] [rocm/amd_smi_lib]
...
Fixes `amdsmi_get_gpu_process_list` now requires sudo to access pid and memory information
Code changes related to the following:
* amdsmi_get_gpu_process_list()
* CLI
Change-Id: I72b154c220276b354c350fcc067c9a7c32e6c173
Signed-off-by: Oliveira, Daniel <daniel.oliveira@amd.com >
2024-06-24 00:38:17 -04:00
muthusamy
057d688b55
amd-smi [CPU]: Added Support to get number of threads per core
...
Change-Id: I7e6500f3f53068a3483b64a54d78ac9e1d9cd183
2024-06-21 17:22:55 -04:00
Maisam Arif
92f014059e
SWDEV-435197 - Add process table to CLI monitor subcommand
...
Signed-off-by: Maisam Arif <Maisam.Arif@amd.com >
Change-Id: Ibe06f4a4be619ae9cba909c2474b0e482eeb87d5
2024-06-19 23:36:55 -05:00
Maisam Arif
a3497702cb
Bump Version to 24.6.1.0 and Update Changelog
...
Signed-off-by: Maisam Arif <Maisam.Arif@amd.com >
Change-Id: I88b8ad1341d7f1a2e14517db82979bb6b28575e8
2024-06-18 23:54:26 -05:00
Bill(Shuzhou) Liu
e3c63628e5
Change the clean shader API to clean local data
...
To be align with the unified API.
Change-Id: I2819339fba6f528204cebd3e9605109e82cbc5b4
2024-06-17 16:23:33 -05:00
Sam Wu
94cab382bf
[ROCDOC-593] Update Read the Docs documentation requirements
...
Change-Id: I48350a42a50f0e85d9f074295e27c20b4b66cdb4
2024-06-17 16:54:59 -04:00
Bill(Shuzhou) Liu
4441249ffa
Add return character when set the PM policy using sysfs
...
When set PM policy in sysfs, the driver expect a return character.
Change-Id: I83cddb3cdb14c226e6e856776176000eea33b251
2024-06-13 11:02:13 -04:00
Dalibor Stanisavljevic
80043adb81
Changed type to uint32_t oam_id due to header unification
...
Signed-off-by: Dalibor Stanisavljevic <Dalibor.Stanisavljevic@amd.com >
Change-Id: I351415f4a766ad6aa0c2e81adf8b416d066048ea
2024-06-13 05:12:55 -04:00
Maisam Arif
9fb2c967de
SWDEV-466598 - Fixed CLI process outputs
...
Signed-off-by: Maisam Arif <Maisam.Arif@amd.com >
Change-Id: I902e82b6e78311e99542b109435346889daa49fc
2024-06-08 18:31:08 -05:00
Maisam Arif
f9bfb746fb
Update Python API README example code
...
Signed-off-by: Maisam Arif <Maisam.Arif@amd.com >
Change-Id: I5dbb2e3cdab31b41e6f502d3257fe899eed1ee97
2024-06-07 16:20:00 -04:00
Maisam Arif
37c044696d
Removed Throttle Status from CLI Tool
...
Signed-off-by: Maisam Arif <Maisam.Arif@amd.com >
Change-Id: I8eb8f30f821589003201d6d8bb96592ec5f8a483
2024-06-07 15:19:48 -05:00
Maisam Arif
936451e286
Updated Changelog
...
Signed-off-by: Maisam Arif <Maisam.Arif@amd.com >
Change-Id: I1b2feca549931fa5f51c80022d6a5be3727d3261
2024-06-06 15:02:20 -05:00
Maisam Arif
2197a393f4
Corrected voltage curve cli output in N/A case
...
Signed-off-by: Maisam Arif <Maisam.Arif@amd.com >
Change-Id: I77e0888d94ba489eb4820cdd6c20e9f7de04241f
2024-06-06 15:55:22 -04:00
Galantsev, Dmitrii
23614524ac
update_wrapper.sh - Fix grep command
...
Change-Id: I497fd0d1ac28adc0975b3ab0ea74ce30a9f84f29
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com >
2024-06-06 00:17:13 -04:00
Bill(Shuzhou) Liu
4cf59c4edb
Change the name of clear sram to run cleaner shader
...
The function is to clean the local data in LDS/GPRs. The clear sram
is misleading.
Change-Id: I0385e6d6348602fe0f347d17e48ed8983f7ceb87
2024-06-05 12:07:39 -05:00