Galantsev, Dmitrii
1f5fa94132
Error if power metric inaccessible
...
Change-Id: I359c24f24d0200181646d5a7c13a6e0e4d4958b6
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com >
2024-05-07 04:39:39 -05:00
Galantsev, Dmitrii
5525bf8c86
AMDSMI - Add ring hang event
...
Change-Id: I84696e3cc1a4eba8de48e464f1a208ed9c6e489d
Depends-On: I2e73ba08ee0004f6f30660b2fa425ea94bafceca
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com >
2024-05-03 16:45:42 -05:00
Bill(Shuzhou) Liu
61a75d346b
Add new XGMI and PCIE bandwidth fields from gpu_metrics
...
For new ASIC, the RDC_EVNT_XGMI, RDC_FI_PCIE_RX and RDC_FI_PCIE_TX
are not supported. New fileds RDC_FI_XGMI and RDC_FI_PCIE_BANDWIDTH
should be used.
Change-Id: Iff5bbef4c07994090fa7c4e9b319966215525283
2024-05-03 16:18:17 -04:00
Brandon Bagwell
de3cb36ce0
Adds the ability to modify 'rdc' options
...
Modifying the /opt/rocm/etc/rdc file modifies RDC launch options. If
the file doesn't exist, the service should still launch (though a new
file should likely be included with the next released package of 'rdc'.
Change-Id: I1a1891e9c5c3e6048754eb555779a97a170754c0
2024-04-30 10:28:16 -05:00
Galantsev, Dmitrii
cb87eeeae7
Update kBlockNameMap
...
Change-Id: I096f40f2b953fad7081d4b9bc05c0291c0f8058d
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com >
2024-04-24 23:50:55 -05:00
Galantsev, Dmitrii
0c8827c4b7
CMAKE - Use ADDRESS_SANITIZER env var
...
Change-Id: I4727120de2f9d7bded8c24033c252ede718831fc
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com >
2024-04-24 23:04:25 -05:00
Galantsev, Dmitrii
9702d0f2d7
SWDEV-439576 - rocmsmi -> amdsmi
...
- Migrate to amdsmi library
- NOTE: raslib still uses rocmsmi
- Remove unused rocmsmi service
- Remove unused RDC client code
- Remove RSMI calls from protos/rdc.proto
Change-Id: Ifc34a264c506b0ec5792307ee56b34526268762d
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com >
2024-04-09 20:19:28 -05:00
Galantsev, Dmitrii
60467c45af
git-blame - Ignore formatting commit
...
There are several ways to ignore the formatting commit:
1. Configure local project:
git config --local blame.ignoreRevsFile .git-blame-ignore-revs
2. Run blame with an argument:
--ignore-revs-file .git-blame-ignore-revs
example:
git blame --ignore-revs-file .git-blame-ignore-revs rdci/src/rdci.cc
Change-Id: Ic6eaa740850d9f1462d841361480307646e46b5e
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com >
2024-04-09 20:10:47 -05:00
Ranjith Ramakrishnan
0ca6d6fa59
Remove hard coded ROCm path in rdc.service
...
The executable rdcd was using an absolute path in rdc.service. Using update-alternatives gives the flexibility to invoke the binary from anywhere and no absolute path is required.
Change-Id: I2f3d6fcbf9dd854870cfc2e00532c504ce6cd6fc
2024-04-09 10:27:19 -05:00
Galantsev, Dmitrii
662cc0f8b2
Revert "Sort the ROCr gpu index based on BDF"
...
Fix 'rdcd diag' compute and system tests.
This reverts commit 61a2773875 .
Change-Id: Ia092c46649c1d6338fb96ffe7e6feba4b045f027
2024-04-09 10:27:19 -05:00
Galantsev, Dmitrii
d1400df06c
GIT - Sync dependabot settings with amdsmi
...
Change-Id: I9442355fa0b4a7858c4c9232631a044789166601
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com >
2024-04-04 17:02:05 -05:00
Galantsev, Dmitrii
9d55c26247
Remove -X from .hsaco files
...
Change-Id: I1f1b4f07eb854ce2e254564b83719be52b553b02
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com >
2024-03-27 20:35:08 -05:00
Galantsev, Dmitrii
534d00e31f
Update CHANGELOG.md for ROCm 6.1
...
Change-Id: I50fd82a14f26f0f23f3c3931e242fddf46c5bd62
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com >
2024-03-20 10:16:16 -05:00
Galantsev, Dmitrii
67578106c4
Fix links and add certificate gen guide
...
Change-Id: Ieece04baade54ee3a7cde968aa08077e0d0d8391
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com >
2024-03-19 14:41:16 -05:00
Ranjith Ramakrishnan
b09eede016
Start rdc.service after installing the rdc package
...
The starting of rdc.service was done in preinstall scripts. It should be started after installing rdc package.
Moved the functionality to postinstall scripts
Change-Id: I9a8c733beea43f95474b990a35a431db287b9a8e
2024-03-12 13:30:27 -07:00
Galantsev, Dmitrii
ba88baef9c
Add .github/CONTRIBUTING.md
...
Change-Id: I7aa7381d973520a515d0539f4915ce67342a3a34
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com >
2024-03-08 16:19:47 -06:00
David Galiffi
b34eafe45a
Add Doc team to CODEOWNERS file
...
Signed-off-by: David Galiffi <David.Galiffi@amd.com >
Change-Id: Iad8eea0645b63bddb835ed22080facc7d25c1bc0
2024-03-06 17:58:36 -06:00
Galantsev, Dmitrii
6d5d9971c2
CMAKE - Find hsa-runtime64
...
Change-Id: Id877eb9cfcc61d81993a6a43703ef2e5f72e1e8f
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com >
2024-02-19 23:49:38 -05:00
Galantsev, Dmitrii
32806681ca
SWDEV-444700 - CMAKE - Fix RUNPATH
...
These RUNPATH changes make it so libraries can be found without setting
LD_LIBRARY_PATH.
Mostly tested on installed RDC binaries and libraries. The
build binaries should also work.
Change-Id: Ifd908a5b61d24dfcbb1d08d21b4ee830156d8643
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com >
2024-02-13 16:56:28 -06:00
Galantsev, Dmitrii
81e3a78b1f
Remove unsupported rocprofiler metrics
...
Change-Id: If6cfbcbe018227c591733471ab203fc6675d50af
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com >
2024-02-09 15:18:54 -06:00
Galantsev, Dmitrii
2c27473d6f
README - Fix URLs and add lychee config
...
Use Lychee[1] to check dead links
[1] - https://github.com/lycheeverse/lychee
Change-Id: I0e8aade7879748dbcb4700a527bcae5a2c29ecb5
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com >
2024-02-08 17:06:02 -06:00
Galantsev, Dmitrii
f13a1fbea8
Upgrade gRPC v1.59.1 -> v1.61.0
...
Change-Id: I8a3f13dd8f264e28474bd65e92ac53f87ab7db3f
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com >
Depends-On: Icbb7b4a580894d78d8ef992befa26ce20fcf3309
2024-02-06 19:39:50 -06:00
Galantsev, Dmitrii
aa5448fc16
CMAKE: Reduce install messages size
...
Change-Id: I6fa7cfe986b1de702492a96bddbfd406501bba50
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com >
2024-02-06 00:31:32 -06:00
Bill(Shuzhou) Liu
5cfe2b4169
Fallback to junction temperature and socket power
...
If the card does not have edge temperature, fallback to junction
temperature. If the card only have socket power, then use socket
power instead.
Change-Id: I053a67a89cf3b29a34e82123f522c08d7dd68916
2024-02-05 10:10:26 -06:00
Galantsev, Dmitrii
adf0d7094f
Add __pycache__ to .gitignore
...
Change-Id: I815cf3cdb644978d959b80136ac7e95da3d2ca8d
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com >
2024-01-19 09:32:35 -06:00
Galantsev, Dmitrii
70ada65079
Rebuild librdc_ras.so
...
- Make librdc_ras.so executable
Change-Id: I715ef1d828fe4d0ecf63b8272ffeccbab280f9dc
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com >
2024-01-17 15:19:14 -06:00
Galantsev, Dmitrii
f9e80cc37a
Use templates for module population
...
Also add stddef.h workaround for old GCC.
RHEL-8 still uses GCC 8.5 and templates are not well supported.
Change-Id: Ia4dae23892ec63682ea848c46ba81de85cf6d209
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com >
2024-01-10 00:27:09 -06:00
Galantsev, Dmitrii
eaa1862a80
RVS: Finish initial RVS integration
...
NOTE: RVS Build is disabled by default due to CI build issues.
Change-Id: I1593f0fe22075a9f86f54afa3ac151e109f1f7bd
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com >
2024-01-10 00:27:04 -06:00
Galantsev, Dmitrii
434e40305d
LINT: Add cpplint, clang-format and pre-commit support
...
Change-Id: I3cbb787ef27d90486b212dfb1a8c77c460acc2ac
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com >
2024-01-09 11:37:11 -06:00
Galantsev, Dmitrii
95e057c88d
Simplify ModuleMgr
...
Change-Id: I3a57876c73e50771fcedb7ca4c67d55ac406b34d
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com >
2024-01-09 11:37:11 -06:00
Sam Wu
a5906e9363
Update rocm-doc-core to v0.30.3
...
Documentation theme updates
Change-Id: I043d34b2947b5b27e06ce6a4f4c32f4b1e8ad039
2023-12-21 16:43:17 -07:00
Galantsev, Dmitrii
82e4ea3b6f
SWDEV-436561 - Add CODEOWNERS
...
Change-Id: Ie806f1ba714a88643c0e5f9cb65bf70f8d59f1fb
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com >
2023-12-12 12:07:47 -06:00
Sam Wu
5890852ff1
Standardize documentation for ReadtheDocs
...
Relates to https://github.com/RadeonOpenCompute/rocm-docs-core/issues/330
Change-Id: Ic9370548bb8d919376b20f7e1800fe620369e69b
2023-12-08 16:56:59 -05:00
Galantsev, Dmitrii
ed3cfffd7e
Server - Add -a/--address option
...
Change-Id: Ia9e8d76b9a4ba0aadc567142601a87f0ad0b69e4
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com >
2023-12-04 15:26:44 -06:00
AravindanC
c661bab06f
SWDEV-426649 - config file rocmpath hard coding removed
...
Change-Id: I01df16392201cc112c7533e8c092e4e336237b0b
2023-11-23 17:31:45 -05:00
Bill(Shuzhou) Liu
61a2773875
Sort the ROCr gpu index based on BDF
...
The rocm-smi index is changed to sort based on BDF. The rocr plugin
is also changed based on that.
Change-Id: I5851431db336d50266b253dec1894a7bd9f3554b
2023-11-16 09:07:22 -05:00
Bill(Shuzhou) Liu
1ab4110d46
RDC crash when exit
...
Join the signal handling thread instead of cancel it to prevent
crash with "terminate called without an active exception".
Change-Id: I2e18eb825728fd3a94f67b1b0049516bb7b6ebbc
2023-11-03 09:10:22 -04:00
Galantsev, Dmitrii
e579cb04b2
Upgrade gRPC v1.44.0 -> v1.59.1
...
Change-Id: Ib43a41c61d4028ec029a8c179a94060315870fbb
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com >
2023-10-19 17:29:36 -05:00
Galantsev, Dmitrii
8f9a6796f1
Upgrade to CXX-17 gtest-1.14
...
Change-Id: I1c7316f151128cbc9318b226dac14950e399d2c7
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com >
2023-09-28 12:54:49 -05:00
Galantsev, Dmitrii
f6ace9fa14
README - Update documentation links
...
Change-Id: I2e778a766e6a4489280fe7b86f33a6c597983167
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com >
2023-09-13 19:34:28 -05:00
Galantsev, Dmitrii
fc852fc915
.gitignore - Ignore more build files
...
Change-Id: I5b5207e65cc3fd6537800db388da142c0e76c3ff
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com >
2023-09-06 10:21:11 -05:00
Galantsev, Dmitrii
824056b0be
.editorconfig - Remove whitespace rule
...
Change-Id: Ia928dcb49fc094889784a0afcbc4abbe35bd59c7
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com >
2023-09-06 10:20:43 -05:00
Galantsev, Dmitrii
de252b21a4
SWDEV-410524 - Doxygen add WARN_AS_ERROR
...
Change-Id: I714712d61d1526cb75122a2f23e293745d41a701
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com >
2023-08-11 11:57:44 -05:00
Public Profile
a3ac4bac21
fix broken links
...
Change-Id: Ibd941eb116fd9ae4ed7deeeb3a07324a2a3ca3c3
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com >
2023-08-09 00:13:09 -05:00
Ranjith Ramakrishnan
2e096d9009
SWDEV-366827 - Disable file reorg backward compatibility support by default
...
Change-Id: I9c4201d7786be2e3f77bc1d4d15887741ba59ec5
2023-08-07 09:25:00 -07:00
Galantsev, Dmitrii
6e52a113a2
SYSTEMCTL: Check if running before stopping
...
When uninstalling the RDC application - the user is greeted with an
annoying "Failed to stop rdc.service..." message if the RDC is not
running.
This change makes sure RDC is active before trying to stop it.
Change-Id: I6fa57bfd4b9c348514cd6c38e60ed3930d32b62c
Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com >
2023-05-31 11:41:25 -05:00
Ranjith Ramakrishnan
c82fdeab8d
SWDEV-310152 - Removed the RUNPATH setting in source code
...
Use the RUNPATH provided by build scripts
Change-Id: Ib5b3f689dc20aeecf6974281625865fe650bfa72
2023-05-30 16:17:08 -04:00
Sam Wu
74dce41f4f
update documentation dependencies via requirements
...
rocm-docs-core v0.11.0
Change-Id: I2ecc8c6015b9bb186e1b3241eb84bcbda9c46152
2023-05-18 13:29:08 -06:00
Ranjith Ramakrishnan
bf49b88866
SWDEV-383221 - Set the default value of ROCM_HEADER_WRAPPER_WERROR to OFF
...
Using wrapper header files will result in #warning message by default
Change-Id: If5847e1b03523251238018b2cf0725b302619963
2023-05-08 20:45:08 -07:00
Sam Wu
1335d19020
add configs for read the docs
...
add handbook, user, install, and integration guides
Change-Id: I996f6909f4fdf76910981c0224f5a0266907e27a
remove old documentation steps
Change-Id: Icfad09926e67a2dfa1de0e182fc3cd534f0448f7
formatting fixes
Change-Id: I704bbbbf6ad384178f804e4a3f5e621f9c3d33b9
2023-05-05 15:44:34 -06:00