Commit Graph

1156 Commits

Author SHA1 Message Date
AidanBeltonS bf662640ee SWDEV-539805, SWDEV-553860 - Resolve GCC clang ABI mismatch and check vector alignment (#909)
* SWDEV-539805 - Add checks for vector alignment and size

* SWDEV-553860 - Alter alignment for gcc

* SWDEV-553860 - Align fallback method

* SWDEV-553860 - Alter alignment requirement
2025-09-16 17:10:14 +01:00
systems-assistant[bot] 857e5ef3ce chore: unset executable permission (#213)
Co-authored-by: Eisuke Kawashima <e-kwsm@users.noreply.github.com>
Co-authored-by: systems-assistant[bot] <systems-assistant[bot]@users.noreply.github.com>
Co-authored-by: Maisam Arif <Maisam.Arif@amd.com>
2025-09-16 11:06:54 -05:00
systems-assistant[bot] 88201d2b79 [SWDEV-544729] Updated CLI error handling (#216)
Updated: rocm_smi.py
- Remove all else: clauses from functions where rsmi_ret_ok is part of the if clause, as requested.
- rsmi_ret_ok() function already handles unsucessful return codes and gracefully handles them.
- Updated check_runtime_status() function to sweep through /sys/class/drm to find active runtime_status.
- Updated the message to' AMD GPU device(s) is/are in a low-power state. Check power control/runtime_status'
- This clarifies the status of the GPU and tells them where to check for more info.

Signed-off-by: Juan Castillo <juan.castillo@amd.com>
Co-authored-by: Juan Castillo <juan.castillo@amd.com>
Co-authored-by: Maisam Arif <Maisam.Arif@amd.com>
Co-authored-by: gabrpham <Gabriel.Pham@amd.com>
2025-09-16 10:56:03 -05:00
itrowbri 8ba7120b63 [rocprofiler-sdk] Verify there are callbacks for every kernel dispatch when syncing (#321)
* Added check in Queue::sync to verify that there is a callback for every dispatch

* Removed new atomic, using get_balanced_signal_slots() atomic with initial value of NUM_SIGNALS to verify dispatches complete
2025-09-16 10:35:16 -05:00
systems-assistant[bot] 3b5467b746 [DOC] single pass counter collection (#95) 2025-09-16 11:00:11 -04:00
Sunday Clement db63d4c38b hsakmt: Update udmabuf.h License Identifier Header (#873)
Fix typos, and update the license header to include SPDX license
identifier.

Signed-off-by: Sunday Clement <Sunday.Clement@amd.com>
2025-09-16 10:36:02 -04:00
ywang103-amd 97f8b7b1ec change to single-kernel workload for pc_sampling tests (#955) 2025-09-16 10:17:23 -04:00
Matt Williams af2f2c1345 Update index.rst (#1014) 2025-09-16 09:59:04 -04:00
systems-assistant[bot] f1fabcfd64 rocr: Error Handling Issues (#264)
* rocr: Fix Incorrect Assertion Check

The wrong variable is used in the assertion statement, should be error
checking for the value of paramEndLoc after it is modified by the call
to find().

Signed-off-by: Sunday Clement <Sunday.Clement@amd.com>

* rocr: Fix Potential Undefined Behaviour

In the event that the SvmProfileControl destructor is called and
event == -1 is true then the call to close(event) is effectively
close(-1) which is undefined behaviour. This has been changed to only
call close() on valid file descriptors.

Signed-off-by: Sunday Clement <Sunday.Clement@amd.com>

* rocr: Add Error Check on Bytes Read

In the case that there is an incomplete read the call to copyTo() will
now return an error.

Signed-off-by: Sunday Clement <Sunday.Clement@amd.com>

* rocr: Fix Exception Error

Destructors are implicitly marked with noexcept being true by default
so if its not explicitly marked false in the destructor or the
functions it calls, any thrown exceptions will cause the program to
crash.

Signed-off-by: Sunday Clement <Sunday.Clement@amd.com>

---------

Signed-off-by: Sunday Clement <Sunday.Clement@amd.com>
Co-authored-by: Sunday Clement <Sunday.Clement@amd.com>
2025-09-16 09:43:45 -04:00
xuchen-amd a442766d26 [rocprofiler-compute] improve profile options (#999) 2025-09-15 18:21:45 -04:00
Aleksei Tumakaev 646e4d211a [rocpd] Use SQL queries instead of views in summary generator (#311)
* Use queries instead of views in summary.py

* Export queries when created

* Remove HIP and HSA from output

* Fix domain query

* Export summary queries in the main function

* Fix comments and variable names

* Change syntax for old python versions

---------

Co-authored-by: Young Hui <young.hui@amd.com>
2025-09-15 17:13:06 -04:00
harkgill-amd 902ec4d3ad Fix documentation to match function signature (#990)
Co-authored-by: ammallya <ameyakeshava.mallya@amd.com>
2025-09-15 11:19:21 -07:00
Alysa Liu 7277ecc9a3 rocminfo: Add copyright for new files (#888)
Legal Requirements:

For AMD software being released as open source, add copyright at the top of each new file.

Signed-off-by: Alysa Liu <Alysa.Liu@amd.com>
2025-09-15 11:36:49 -04:00
vstojilj f24e2ca676 SWDEV-546865 - Disable core dumps when running tests (#880)
* SWDEV-546865 - Disable core dumps when running tests

* SWDEV-546865 - Disable core dumps only for tests that require it
2025-09-15 15:58:41 +02:00
harkgill-amd d1b2b5ed44 Fix grid_group::group_dim to return grid_dim and not block_dim (#823)
* Fix grid_group::group_dim to return grid_dim and not block_dim

* Add unit test for grid_group.group_dim()

* Fix unit test errors

* Skip group_dim() assertions for base_type test
2025-09-15 09:42:55 -04:00
systems-assistant[bot] 2f7e9591be SWDEV-541096 - add hipEventWaitDefault and hipEventWaitExternal (#453)
Co-authored-by: Li, Todd tiantuo <Toddtiantuo.Li@amd.com>
2025-09-13 10:33:00 -07:00
Dmitrii 8abe24d3b0 rdc: Add CPU support and CPU metrics infrastructure (#770) 2025-09-12 16:14:38 -05:00
xuchen-amd eb46160a8f update proj toml (#974) 2025-09-12 16:24:44 -04:00
Venkateshwar Reddy Kandula 4daf25944d add gotcha to rocpd cpack component. (#904)
Co-authored-by: Venkateshwar Reddy Kandula <venkateshwar.kandula1306@gmail.com>
2025-09-12 15:21:44 -05:00
Julian Jose 8157437273 [Palamida scan] SWDEV-553054 Adding missing copyrights information (#900)
* Add missing copyright headers in rocprofiler-systems
* Update python-tests
* Update causal test

---------

Co-authored-by: David Galiffi <David.Galiffi@amd.com>
2025-09-12 14:17:58 -04:00
xuchen-amd 7ed6000e32 [rocprofiler-compute] Refactor to add type annotation and misc (#787) 2025-09-12 13:53:24 -04:00
marandje 3a37389f6a SWDEV-547554 - Resolve memory leaks in hiprtc tests (#967) 2025-09-12 18:12:15 +02:00
Kian Cossettini 5d582fcd37 [rocprofiler-systems] Add Fortran OpenMP CTests (#874)
* Added Fortran (amdflang) openmp tests using the openmp-vv project

---------

Signed-off-by: David Galiffi <David.Galiffi@amd.com>
Co-authored-by: David Galiffi <David.Galiffi@amd.com>
2025-09-12 09:52:16 -04:00
habajpai-amd 1c7293e6d0 Add bounds checks in transpose_a for both load and store so edge tiles dont read/write past MxN (#950) 2025-09-12 17:32:30 +05:30
Venkateshwar Reddy Kandula 26e7c4231e [rocprofiler-sdk] Add derived metrics for Navi4 (#238)
* add more derived metrics for navi4.

* addr comments

* addr comments, and add more derived counters.

* EOF.

* misc.

* remove duplicate counter.

* misc.

* Remove gfx12 architecture definition for ldslatency

* remove extra architectures for gfx12.

* use wgp for normalization

* move these changes to another PR.

---------

Co-authored-by: Venkateshwar Reddy Kandula <venkateshwar.kandula1306@gmail.com>
2025-09-12 02:30:10 -05:00
abchoudh-amd 7d847dde3f Split tests (#952) 2025-09-12 12:29:48 +05:30
Venkateshwar Reddy Kandula aa3313aa99 add dl lib to utility_tests (#961) 2025-09-12 10:28:52 +05:30
Marius Brehler 01828d1375 Force gzip to overwrite an existing changelog (#665)
If a compressed changelog exists from a previous build, reconfiguring
the project fails with
```
[rocm-core configure] CMake Error at utils.cmake:213 (message):
[rocm-core configure]   Failed to compress: gzip:
[rocm-core configure]   /home/ben/src/TheRock/build/base/rocm-core/build/DEBIAN/changelog.Debian.gz
[rocm-core configure]   already exists; not overwritten
```

Add `-f` to force overwriting.
2025-09-11 16:34:37 -07:00
systems-assistant[bot] c85200fc42 SWDEV-541096 - add hipEventWaitDefault and hipEventWaitExternal flags (#507)
Co-authored-by: Li, Todd tiantuo <Toddtiantuo.Li@amd.com>
2025-09-11 14:50:55 -07:00
Jatin Chaudhary 3742814d82 SWDEV-553757 - add __HIP__ and __clang__ check for __shfl functions (#872) 2025-09-11 21:57:39 +01:00
Pengda Xie 05f12bf573 SWDEV-550582 - Ensure kernel launched to correct device in hip-tests (#852) 2025-09-11 12:24:35 -07:00
jamessiddeley-amd 62843ed900 [rocprof-compute] Wrap negative values in L2 - Fabric interface detailed metrics (#833)
* update gfx942 soc_config 1700

* add MAX wrapper for Write and Atomic (32B)

* removed trailing whitespace and EA* fix

* added to CHANGELOG.md

* edited changelog
2025-09-11 14:51:58 -04:00
marandje 4ebc4ff5ba SWDEV-553750 - Fix codeql errors in hip-tests (#895) 2025-09-11 18:11:11 +02:00
Alysa Liu 2b2b8329b5 rocr: Add copyright for new files (#886)
Signed-off-by: Alysa Liu <Alysa.Liu@amd.com>
2025-09-11 10:56:31 -04:00
systems-assistant[bot] 3e1e2408a9 SWDEV-541427 - Fix forked stream joining to parent stream that is not origin stream(BeginCaptureStream) (#449)
Co-authored-by: Anusha GodavarthySurya <Anusha.GodavarthySurya@amd.com>
Co-authored-by: systems-assistant[bot] <systems-assistant[bot]@users.noreply.github.com>
Co-authored-by: Godavarthy Surya, Anusha <agodavar@amd.com>
2025-09-11 16:57:33 +05:30
systems-assistant[bot] 0647cf1d28 SWDEV-542700 - Return an error if stream capture is attempted on the null stream while a stream capture is active. (#450)
Co-authored-by: Anusha GodavarthySurya <Anusha.GodavarthySurya@amd.com>
Co-authored-by: systems-assistant[bot] <systems-assistant[bot]@users.noreply.github.com>
Co-authored-by: Godavarthy Surya, Anusha <agodavar@amd.com>
2025-09-11 16:57:22 +05:30
sluzynsk-amd a141df3518 SWDEV-549708 - WarpTest memory leak fix (#838)
Signed-off-by: Sebastian Luzynski <Sebastian.Luzynski@amd.com>
2025-09-11 11:22:38 +02:00
Gopesh Bhardwaj dd44ae3295 [Palamida scan] SWDEV-553053 Adding missing copyrights information (#836)
* SWDEV-553053 Adding missing copyrights information
2025-09-10 23:44:27 -07:00
Julian Jose 2d3803da89 Update using-rocprofv3 documentation (#331)
* Update using-rocprofv3 documentation

* Update using-rocprofv3.rst

* Update using-rocprofv3.rst

* Update projects/rocprofiler-sdk/source/docs/how-to/using-rocprofv3.rst

Co-authored-by: Swati Rawat <120587655+SwRaw@users.noreply.github.com>

* Update projects/rocprofiler-sdk/source/docs/how-to/using-rocprofv3.rst

Co-authored-by: Swati Rawat <120587655+SwRaw@users.noreply.github.com>

---------

Co-authored-by: Gopesh Bhardwaj <gopesh.bhardwaj@amd.com>
Co-authored-by: Swati Rawat <120587655+SwRaw@users.noreply.github.com>
2025-09-11 12:11:04 +05:30
Ammar ELWazir 069d5ecce2 [ROCProfiler SDK] Updating README Building & Installing Instructions (#931)
* Updating ROCProfiler SDK README

* Fixing ROCProfiler SDK License

* Fixing ROCProfiler SDK Installation Steps

---------

Co-authored-by: Joseph Macaranas <145489236+jayhawk-commits@users.noreply.github.com>
2025-09-11 12:08:49 +05:30
systems-assistant[bot] 5b9ba5bda0 Adding more unit tests to reach 80% (#177)
* Adding more unit tests to reach 80%

* Adding pmc and command builder tests

* tests for spm, sqtt and trace config

* removing non-existent struct members from test

* Adding logger tests

* aqlprofile_v2 tests

* spm builder tests

* Addressed feedback

---------

Co-authored-by: gobhardw <gopesh.bhardwaj@amd.com>
Co-authored-by: systems-assistant[bot] <systems-assistant[bot]@users.noreply.github.com>
2025-09-11 12:05:42 +05:30
Venkateshwar Reddy Kandula 66f0df465e [rocprofiler-sdk][Tests] add counter dimensions instance tests for rocprofiler_counter_info_v1_t API change (#637)
* Initial commit, counter dimensions tests, public API.

* Update projects/rocprofiler-sdk/source/lib/rocprofiler-sdk/counters/tests/metrics_test.cpp

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

* Update projects/rocprofiler-sdk/source/lib/rocprofiler-sdk/counters/tests/metrics_test.cpp

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

* Update projects/rocprofiler-sdk/source/lib/rocprofiler-sdk/counters/tests/metrics_test.cpp

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

* Update projects/rocprofiler-sdk/source/lib/rocprofiler-sdk/counters/tests/metrics_test.cpp

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

* Update projects/rocprofiler-sdk/source/lib/rocprofiler-sdk/counters/tests/metrics_test.cpp

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

* Update projects/rocprofiler-sdk/source/lib/rocprofiler-sdk/counters/tests/metrics_test.cpp

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

* add comments.

* Format.

* Format.

---------

Co-authored-by: Venkateshwar Reddy Kandula <venkateshwar.kandula1306@gmail.com>
Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>
2025-09-11 01:08:33 -05:00
Julian Jose 1a39135951 add missing copyright headers in rocprofiler-register (#893) 2025-09-11 10:02:07 +05:30
systems-assistant[bot] e5c155d3f3 SWDEV-547453 - Resolve memory leaks in module tests (#535) 2025-09-11 01:16:21 +02:00
marandje ea68344f2f SWDEV-553962 - Add p2p check to multidevice virtual memory tests (#898) 2025-09-11 01:15:18 +02:00
Ioannis Assiouras 35629e433d SWDEV-546146 - Added support for hipMemLocationTypeHost in hipMemSetAccess (#682) 2025-09-10 23:06:20 +01:00
Joseph Macaranas dd1a2dbf8a Fix LICENSE path for opencl build (#939) 2025-09-10 17:54:22 -04:00
systems-assistant[bot] b60c0ceddd [rocprofv3] Unconditionally collect stream and kernel rename data in rocprofv3 for rocpd (#171)
* Remove config checks for stream and kernel rename data collection

* Updated csv generation to check if kernel rename is on before calling get_kernel_name

* Update metadata to use kernel_rename bool argument

* Formatting + unconditionally store kernel name in rocpd

* Readded kernel rename parameter after rebase

* Fixed rebase conflicts

* Updated comment in line with github comments

* Added check in rocpd csv.cpp to output kernel name if region name is empty

* Add test for kernel rename

---------

Co-authored-by: Ian Trowbridge <Ian.Trowbridge@amd.com>
2025-09-10 16:03:15 -05:00
systems-assistant[bot] bfdb3bc636 fix(python): fix comparison to None (#211)
from PEP8 (https://peps.python.org/pep-0008/#programming-recommendations):

> Comparisons to singletons like None should always be done with is or is not, never the equality operators.

Co-authored-by: Eisuke Kawashima <e-kwsm@users.noreply.github.com>
2025-09-10 14:50:32 -05:00
systems-assistant[bot] 39ea16e544 fix(E712): fix comparison to True/False (#212)
from PEP8 (https://peps.python.org/pep-0008/#programming-recommendations):

> Comparisons to singletons like None should always be done with is or is not, never the equality operators.

Co-authored-by: Eisuke Kawashima <e-kwsm@users.noreply.github.com>
2025-09-10 14:50:23 -05:00