adapryor
33924ea79e
Profiler - Fix SIMD Utilization
...
Change-Id: I6775cce9901a714d20e80c8c17e7a563edeb48a4
2025-05-07 00:56:52 -05:00
Galantsev, Dmitrii
fa8b89f4ae
CMAKE - Format with cmake-format
...
Change-Id: I08e71fc5060b1f6e0168225cc5fe66886c2044bd
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com >
2025-05-06 17:28:14 -05:00
Galantsev, Dmitrii
f89beb90f5
CMAKE - Add cmake-format
...
Change-Id: I4036859491934ed26303530d0dc1afb4f1b0d0cd
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com >
2025-05-06 17:28:14 -05:00
Galantsev, Dmitrii
02c0786a2c
Profiler - Add SIMD_UTILIZATION ( #171 )
...
Change-Id: I19d5acd80dbed8c4fc4e1c85eec71ca89398d299
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com >
2025-05-06 13:20:03 -07:00
Rawat, Swati
3e653b7ab3
RDC Doc formatting ( #166 )
...
* doc formatting
* Update job_stats_sample.rst
* Doc formatting
---------
Co-authored-by: srawat <120587655+SwRaw@users.noreply.github.com >
2025-05-05 13:08:33 -05:00
Rawat, Swati
4a230f0180
fix broken link ( #169 )
...
Update job_stats_sample.rst
Co-authored-by: srawat <120587655+SwRaw@users.noreply.github.com >
2025-05-01 10:49:31 -05:00
Pryor, Adam
2db6ddea69
[SWDEV-523349/SWDEV-527257] Fix Rdci Config ( #161 )
...
Change-Id: Iae21ea8061205f186086a3ed59c6259ddeb1dbe7
Signed-off-by: adapryor <Adam.pryor@amd.com >
2025-04-28 11:57:51 -05:00
Peter Park
9edacdeac4
remove comments in Doxyfile referring to wikipedia.org for IAS check
2025-04-23 17:25:21 -05:00
Peter Park
0484dbed94
Update doxyfile
2025-04-23 17:25:21 -05:00
Peter Park
712657b24e
bump rocm-docs-core to 1.18.1
2025-04-23 17:25:21 -05:00
Peter Park
c3aafd846d
update sphinx/conf.py
2025-04-23 17:25:21 -05:00
Peter Park
aac1de1e76
bump rocm-docs-core to 1.17.1
2025-04-23 17:25:21 -05:00
Hila, Nino
6a1c7d8e43
Add palamida.yml
2025-04-22 11:06:25 -05:00
Bill(Shuzhou) Liu
855d185532
Add license file
...
Add license files which are missing.
2025-04-16 11:06:31 -04:00
Galantsev, Dmitrii
5efdcc23fc
CI - Add cherrypick labels automatically
...
Change-Id: Icbd0c70c9cbee2b119e7e74d6cdfe83e93a83df9
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com >
2025-04-15 18:44:37 -05:00
Galantsev, Dmitrii
a5cb334f8b
Add RDC_FI_GPU_BUSY_PERCENT
...
AMDSMI needs to merge first and bump the version to at least 24.4.2
Change-Id: I30149bb78c79ebc3de0dabdc8e63fcef12b2f406
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com >
2025-04-15 17:00:56 -05:00
Galantsev, Dmitrii
ac50573e67
CMAKE - Bump version to 1.1.0
...
Change-Id: I0fbc0f6d842c034ad858f30fa6418afd01e11a4f
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com >
2025-04-11 17:27:27 -05:00
Galantsev, Dmitrii
dfae9cd37f
Profiler - Remove buffer to fix memory leaks
...
Change-Id: Ia3717ccfc147221557f5469965c2abb76b3f451c
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com >
2025-04-11 17:27:27 -05:00
Pryor, Adam
58811fecbb
[SWDEV-515192] Fix rdc topo ( #146 )
...
Change-Id: I64a8077a56e2eaf99735fafb1010d869a1fdb0c3
Signed-off-by: adapryor <Adam.pryor@amd.com >
2025-04-10 17:46:08 -05:00
Galantsev, Dmitrii
91be467cad
Profiler - Fix eval fields
...
The 'value' pointer was being written to a lot and then used for reading
within the same function. This likely caused issues all over RDC when
reading the metrics.
This commit changes it so *value is written to only once.
Change-Id: I83c158c1e46c6ce46ff87d8a2e769f26ffa8c0da
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com >
2025-04-09 20:06:21 -05:00
Galantsev, Dmitrii
24024f0e4f
Revert "Implement CPU discovery support"
...
This reverts commit f967f8a17d15e148464393fcd145af01dc0e1525.
2025-04-07 20:45:19 -05:00
Galantsev, Dmitrii
c96f5db52c
Revert "Fix breaking changes introduced with CPU support"
...
This reverts commit e9ac9e4626e3e45ebdfafb39e251d073091429f1.
2025-04-07 20:45:19 -05:00
Galantsev, Dmitrii
0aeceefcb3
Fix breaking changes introduced with CPU support
...
Changes introduced in 3bdca8b8b6
broke RDC if it was compiled without ESMI support, or if esmi driver is
not loaded when RDC is being used.
Change-Id: Id54e1e9002d2e3cf09240081149eed84178700af
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com >
2025-04-07 14:41:46 -05:00
Yuan, Perry
3bdca8b8b6
Implement CPU discovery support ( #77 )
...
* Implement CPU discovery support
SWDEV-482949:
enable the CPU model name info support to the RDC, rdci command
can detect GPU and CPU modules at the same time.
It will query the CPU info through the amdsmi interface like below:
1 GPUs found.
-----------------------------------------------------------------
GPU Index Device Information
0 AMD Radeon PRO W7800
=================================================================
1 CPUs found.
-----------------------------------------------------------------
CPU Index Device Information
0 AMD Ryzen Threadripper PRO 7995WX 96-Cores
-----------------------------------------------------------------
Change-Id: Ibc6533c9a61000cd86c45b1bae14c3eb6788c119
Signed-off-by: Perry Yuan <perry.yuan@amd.com >
* CMAKE - Add required version for amdsmi
Change-Id: I341a89351d196ec66cce215a5d1d3953302fcc66
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com >
---------
Signed-off-by: Perry Yuan <perry.yuan@amd.com >
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com >
Co-authored-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com >
2025-03-31 10:58:36 +08:00
Galantsev, Dmitrii
80ee980cdb
CMAKE - Fix build types
...
Addresses issue https://github.com/ROCm/rdc/issues/43
Change-Id: I456184358524a6feef4bf83eecb655678c3bc42d
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com >
2025-03-30 18:54:54 -05:00
Mallya, Ameya Keshava
4067831731
Added KWS check for amd-mainline ( #140 )
2025-03-28 08:23:38 -07:00
Galantsev, Dmitrii
bdb2367010
RVS - Add long-running tests
...
Change-Id: Iddeb7f2d4fdcd69d7ac1ae94b2fa128ee3011b1a
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com >
2025-03-27 23:42:56 -05:00
Galantsev, Dmitrii
58350a8bb8
Profiler - Remove bootstrap link
...
Change-Id: Ieea57515d77c2d521d95568c3bc2660cc829d829
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com >
2025-03-27 23:29:30 -05:00
Galantsev, Dmitrii
ea7ccd0660
Fix diagnostic example and allow building
...
Change-Id: Icc85e8018a11b66d1190fa910151acb79cd17b83
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com >
2025-03-27 23:29:30 -05:00
Galantsev, Dmitrii
059d015ea4
CMAKE - Add BUILD_INTERFACE include dirs for rdc_bootstrap
...
Change-Id: I93df878b21e245277c7a8d9589102a15c2517f4f
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com >
2025-03-27 23:29:30 -05:00
Galantsev, Dmitrii
51de344be7
Profiler - Add CPC and CPF metrics
...
Change-Id: I27fd725e9e1868c9afe7624d6e4aafad2a42d47e
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com >
2025-03-27 19:01:23 -05:00
Pryor, Adam
47692d3ed5
[SWDEV-498711] RDC Partition Implementation ( #119 )
...
* [SWDEV-498711] RDC Partition Implementation
Change-Id: Ibfc3709793770537e4c9d36458f34c6b4f461724
Signed-off-by: adapryor <Adam.pryor@amd.com >
2025-03-27 14:10:11 -05:00
Galantsev, Dmitrii
929041b556
Fix amdsmi_get_power_info API
...
This change creates a workaround for a broken C api in amdsmi.
amdsmi_get_power_info API is broken in rocm 6.4.0 (amdsmi 25.2) and is fixed
in rocm 6.4.1 (amdsmi 25.3).
Breaking AMDSMI change:
https://github.com/ROCm/amdsmi/commit/dc4a16da6fb45d581a6e23c78d340172989418a0
Change-Id: Ib45a2702aa722c7735f3ccd1081d8f62e4d34216
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com >
2025-03-19 23:12:45 -05:00
Galantsev, Dmitrii
f5a4402ce5
RVS - Use config files and make GPU aware
...
Change-Id: I7a5c80ed4e6122d102e494d1ae38b4b7d40c42cd
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com >
2025-03-11 15:39:16 -05:00
Galantsev, Dmitrii
247c8c7d5e
RVS - Disable IET test
...
Change-Id: I015d68735316d2dc6af18d16f972d9f379b76bcf
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com >
2025-03-11 09:51:08 -05:00
Galantsev, Dmitrii
6769f64ba0
CHANGELOG - Add 6.4.0 updates
...
Change-Id: Ia788b1b51d6ef93c5d065c70a31a029d76fdab98
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com >
Co-authored-by: Rawat, Swati <Swati.Rawat@amd.com >
2025-03-07 20:02:07 -06:00
srawat
a865793b70
Refactor RDC documentation
...
Change-Id: Ieaba84992a8cbd185f4c2d1dc36a175c0429b754
2025-03-07 19:50:08 -06:00
Galantsev, Dmitrii
3ec9d6c2d2
Fix workflow until grpc updates on github
...
Change-Id: Idf3faa9f7991e4a7ecf78dfb13aafe5c6533fa01
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com >
2025-03-07 19:25:53 -06:00
Galantsev, Dmitrii
8b249046c0
Update gRPC to 1.67.1
...
Change-Id: I911878a3aeec8c9234b0e1ac4447364f2ed845cc
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com >
2025-03-07 18:36:34 -06:00
AL Musaffar, Yazen
cf566ebd31
RDC REST API (Sample code)
...
Please follow the README file
Update README_rdc_rest_api.txt
Update RDC_REST_API.py
Error handling updates
Updates for error handling
Updates
Updates for rdc_field_watch/rdc_field_unwatch and delete query
Updates for rdc_field_watch/rdc_field_unwatch and delete query
SWDEV-479738 [RDC] - Rest API
Delete python_binding/RDC_REST_API.py
new rdc_rest_api.py file for SWDEV-479738 [RDC] - Rest API
2025-03-07 20:48:15 +00:00
adapryor
e847f74f78
Fix Prometheus counters
...
default to gauage
Change-Id: Ia0428e61f023f10b02b3ebe103870d40c057abe3
Change values in question to gauges
Change-Id: I81c91c880246342a0ad0586f6dbe50b247a01117
fixes
Change-Id: I949438d3d3b511c22649640e082b59a3fb7696e0
Fix info handling
Change-Id: I8091fbfa55ba5a9c21c4569dd40e37fb432924f3
fix default
Change-Id: Ia449fed18730a06a858107e9218dc7b443a681fb
2025-03-07 20:48:11 +00:00
adapryor
9571dad23d
[SWDEV-517396] Align rdc_field with rdc_bootstrap
...
Signed-off-by: adapryor <Adam.pryor@amd.com >
Change-Id: I5e05e25c5980a3141665ae2d13a6ae09207ccb41
2025-03-04 08:49:28 -06:00
Galantsev, Dmitrii
d5f8ff0ab0
CMAKE - Set fallback version to 0.3.0
...
Change-Id: I2322bdb7d3a8e4f83346ca4f5d24351ad2a4eccc
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com >
2025-03-04 08:43:32 -06:00
Li Ma
26ea06bb69
Modify the error log for MM_ENC_UTIL
...
Signed-off-by: Li Ma <li.ma@amd.com >
Change-Id: I83805fc8ad7003ecd5189c8f940b44edbf0ebd1f
2025-03-04 08:42:22 -06:00
Arif, Maisam
552f15a1fb
Fixed RDC to work with updated amdsmi_get_power_info() ( #115 )
...
Change-Id: Ic9e7a68ae58f61dbe73fc7d1b17af34152933e71
Signed-off-by: Maisam Arif <Maisam.Arif@amd.com >
2025-02-11 00:51:29 -06:00
Pryor, Adam
93a8ab8915
SWDEV-512736 Fix RDC Policy callback printout ( #114 )
...
Change-Id: I6e018dcb0a6b272812c959649d913e3ba33def40
2025-02-10 08:40:03 -06:00
Williams, Justin
e1d3b6b5b8
[SWDEV-479339/SWDEV-498804] Added RDC Dockerfile ( #50 )
...
* [SWDEV-479339/SWDEV-498804] Added RDC Dockerfile
* Updated Dockerfile
2025-02-04 12:58:40 -06:00
Justin Williams
f106364fc7
Make README.md pretty
...
Change-Id: I7c3341deaf3621ebbc9e495b023b1dd4971a5f1d
2025-01-31 12:22:45 -06:00
Galantsev, Dmitrii
bee9991c4a
Revert "Dgalants/add auth script location ( #108 )"
...
This reverts commit a70aa81cfd .
2025-01-31 12:22:45 -06:00
Pryor, Adam
a70aa81cfd
Dgalants/add auth script location ( #108 )
...
* DOCS: Add authentication scripts location
Change-Id: Ie285d80ea6d9bb8f710998208d0aa7c6db661d02
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com >
* Make README.md pretty (#44 )
Change-Id: I7c3341deaf3621ebbc9e495b023b1dd4971a5f1d
---------
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com >
Co-authored-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com >
Co-authored-by: Williams, Justin <Justin.Williams@amd.com >
2025-01-30 12:08:11 -06:00