systems-assistant[bot]
0406493ae0
Merge commit 'b5e1645a1476ea48ceeb601076ccb0714cc2948b' into develop
2025-08-06 01:30:06 +00:00
Welton, Benjamin
b5e1645a14
Fix hsa_code_object_app test deadlock with profiler serialization ( #577 )
...
Problem with original test:
- Created circular dependencies between queues:
* Queue1: Kernel A → Barrier(waits for signal_2) → Kernel C
* Queue2: Barrier(waits for signal_1) → Kernel B → sets signal_2
- With strict "one kernel at a time" serialization, this created deadlock:
* Queue1 executed Kernel A, then blocked on barrier waiting for signal_2
* Serializer switched to Queue2, but Queue2 was blocked waiting for signal_1
* Neither queue could proceed: Queue1 needed Queue2's Kernel B to complete,
but Queue2 couldn't start until Queue1 finished completely
- Test would hang indefinitely at hsa_signal_wait_relaxed() for signal_2
Solution implemented:
- Reordered packet submission to eliminate circular dependencies
- Ensured signal producers execute before consumers need them:
* Kernel A produces signal_1 before Queue2's barrier needs it
* Kernel B produces signal_2 before Queue1's continuation needs it
- Dependencies now flow forward without cycles, allowing serializer progress
Refactoring changes:
- Extract common functionality into helper functions:
* create_completion_signal() for signal creation
* create_queue() for queue creation
* submit_kernel_packet() for kernel dispatch packets
* submit_barrier_packet() for barrier packets
- Add comprehensive documentation explaining expected execution pattern
- Simplify main() function making the dependency flow more readable
Co-authored-by: Benjamin Welton <bewelton@amd.com >
2025-08-05 17:29:07 -07:00
systems-assistant[bot]
cdacbdc758
Merge commit '6918af0dd60df64fa445256480dd27be33979c07' into develop
2025-08-05 23:01:31 +00:00
Joseph Macaranas
00a09fbf21
Aug 5 Migration Updates
2025-08-05 17:42:06 -04:00
Joseph Macaranas
ad660b7bd8
Source of truth updates for Aug 5
2025-08-05 17:10:23 -04:00
Joseph Macaranas
792cf209a7
Updating migration status from Aug 5th
2025-08-05 17:06:50 -04:00
Joseph Macaranas
edf4735b3e
Preserve file-path commit histories
2025-08-05 16:59:19 -04:00
Joseph Macaranas
0892fc31c7
Preserve file-path commit histories
2025-08-05 16:57:37 -04:00
Kandula, Venkateshwar reddy
6918af0dd6
[CI] fix RHEL requirements issue ( #563 )
...
* use gcc/g++ from /opt/rh/
2025-08-05 15:47:47 -05:00
Joseph Macaranas
7bc75a21da
Preserving file-path commit histories
2025-08-05 15:56:47 -04:00
systems-assistant[bot]
aad122c939
Merge commit '5f422c1993e4d920b388cc4fc9c52983d507c763' into develop
2025-08-05 17:05:53 +00:00
Bhardwaj, Gopesh
5f422c1993
Fix missing include file in rocJPEG test app ( #525 )
...
- Fixes compilation error for rocJPEG test app
- SWDEV-544094
2025-08-05 11:50:01 -05:00
Joseph Macaranas
44df81c7fd
GitHub Actions: Don't run sync jobs on forks
2025-08-05 12:12:30 -04:00
Baraldi, Giovanni
54161f54fe
Update readme for aqlprofile ( #570 )
...
* Update readme for aqlprofile
* Apply suggestions from code review
Co-authored-by: Rawat, Swati <Swati.Rawat@amd.com >
---------
Co-authored-by: Giovanni Baraldi <gbaraldi@amd.com >
Co-authored-by: Rawat, Swati <Swati.Rawat@amd.com >
2025-08-05 11:11:40 -05:00
U, Srihari
fc3eb4211d
[SDK] Fix data race by syncing emplace and flush operations. ( #529 )
...
* Fix data race by syncing reader and writer
* Address review comments
2025-08-05 11:07:00 -05:00
systems-assistant[bot]
f2eb9e07cc
Merge commit 'e3206c8a29d286c448b533d51b318c3f80b7158c' into develop
2025-08-05 16:06:35 +00:00
systems-assistant[bot]
1c3af22585
Merge commit 'a84c455a9de4569e589dc22d254ef3162a2de12a' into develop
2025-08-05 16:06:34 +00:00
Radosavljevic, Milan
e3206c8a29
Include installation sql header in rocpd library ( #576 )
...
Include installation of rocpd sql header
2025-08-05 11:04:09 -05:00
Rawat, Swati
ca9905d6bb
Update CHANGELOG.md ( #557 )
...
* Update CHANGELOG.md
* some more updates
---------
Co-authored-by: srawat <120587655+SwRaw@users.noreply.github.com >
2025-08-05 10:29:42 -05:00
Elwazir, Ammar
cb816c1c21
Update sync-mainline.yaml
...
[ROCm/rocprofiler commit: a84c455a9d ]
2025-08-05 10:28:30 -05:00
Elwazir, Ammar
a84c455a9d
Update sync-mainline.yaml
2025-08-05 10:28:30 -05:00
Elwazir, Ammar
909f551889
Update sync-staging.yaml
...
[ROCm/rocprofiler commit: 008ee0a6f5 ]
2025-08-05 10:27:18 -05:00
Elwazir, Ammar
008ee0a6f5
Update sync-staging.yaml
2025-08-05 10:27:18 -05:00
Elwazir, Ammar
2cf4ef5ea0
Update sync-staging.yaml
...
[ROCm/rocprofiler commit: 07a55ae99c ]
2025-08-05 10:22:54 -05:00
Elwazir, Ammar
07a55ae99c
Update sync-staging.yaml
2025-08-05 10:22:54 -05:00
systems-assistant[bot]
3b008fae9c
Merge commit 'e8675243442b2fb1044235af47affad11f2ca54c' into develop
2025-08-05 15:07:38 +00:00
vedithal-amd
e867524344
Fix autogen config test ( #856 )
...
* Only check file content hash consistency
* Do not check file modification timestamps since that can be updated by
git commands
2025-08-05 10:50:32 -04:00
systems-assistant[bot]
14b0b2a7d7
Merge commit 'fd6f96ffb54054b405a6f05f800c64394126672d' into develop
2025-08-04 22:06:08 +00:00
Baraldi, Giovanni
fd6f96ffb5
Adding GPU index as a parameter for ATT ( #547 )
...
* Adding GPU index as a parameter for ATT
* Tidy fix
* Using tokenize
* Update tests/rocprofv3/advanced-thread-trace/CMakeLists.txt
Co-authored-by: Indic, Vladimir <Vladimir.Indic@amd.com >
* Update tests/rocprofv3/advanced-thread-trace/CMakeLists.txt
* Adding error logging. Using idx instead of id.
---------
Co-authored-by: Giovanni <gbaraldi@amd.com >
Co-authored-by: Indic, Vladimir <Vladimir.Indic@amd.com >
2025-08-04 23:15:50 +02:00
systems-assistant[bot]
e7f8c9e278
Merge commit '533a8329d8f38b66b6c3577f141396a99d3fabbc' into develop
2025-08-04 19:07:14 +00:00
Trowbridge, Ian
533a8329d8
Revert memory allocation CSV output file header and update tests ( #532 )
...
* Reverted header and field location for csv memory allocation and updated tests
* Updated example csv file and made small update
2025-08-04 13:22:27 -05:00
systems-assistant[bot]
a2a355236a
Merge commit 'ea0a3e8da4d0b7e18654f85a92b2428197683459' into develop
2025-08-03 03:38:45 +00:00
systems-assistant[bot]
0c7d3048e6
Merge commit 'c424dac2617be1af60115802b0b094a60d91646c' into develop
2025-08-03 03:38:44 +00:00
systems-assistant[bot]
38b30a4a6f
Merge commit '7251cea0bc9e90217aea53bd8870a5be728814c2' into develop
2025-08-03 03:38:43 +00:00
systems-assistant[bot]
86a5e6b8a0
Merge commit 'b349e406eddbd7c4a1663823c7318fd54a4507de' into develop
2025-08-03 03:38:41 +00:00
systems-assistant[bot]
807784b3a3
Merge commit '31dfc0fccedbd50a93dad3ff514e74ed252ec239' into develop
2025-08-03 03:38:38 +00:00
Joseph Macaranas
870dc948b1
Terminology change: monorepo -> super-repo
2025-08-02 22:23:24 -04:00
Joseph Macaranas
153f71fb0d
Append, instead of prepend, breadcrumb for patch backs to old repos
2025-08-02 22:13:43 -04:00
Joseph Macaranas
c2d62973d4
Copying updates from rocm-libraries
2025-08-02 22:08:24 -04:00
Joseph Macaranas
d5000f623e
Update CODEOWNERS
2025-08-02 19:37:03 -04:00
vedithal-amd
b349e406ed
Fix L2 cache bandwidth metrics for MI350 ( #843 )
...
* Fix L2 cache bandwidth metrics for MI350
* Address review comments
2025-08-01 14:00:46 -04:00
vedithal-amd
6902b12e65
Fix metrics description ( #853 )
...
* Fix ordering of metrics description
2025-08-01 13:56:29 -04:00
vedithal-amd
d84863b7af
Remove duplicate metric descriptions from documentation ( #852 )
2025-08-01 13:33:58 -04:00
Fei Zheng
0d6e172679
update license files ( #849 )
2025-08-01 10:14:39 -06:00
xuchen-amd
9ddee8de2e
TUI: Kernel Selection ( #769 )
2025-08-01 11:31:43 -04:00
Fei Zheng
81daaaecc7
Fix pc sampling unit test ( #847 )
2025-08-01 09:28:43 -06:00
U, Srihari
7251cea0bc
Fix rocpd kernel traces csv output ( #497 )
...
* Fix rocpd kernel traces csv
* Updated CHANGELOG.md
* Add guid header
* address comment.
* Fix test failures
---------
Co-authored-by: Srihari Uttanur <srihariu@amd.com >
2025-08-01 08:31:21 +05:30
Yiannis Papadopoulos
ea0a3e8da4
libhsakmt: Use numa_node_size64 with long long
2025-07-31 18:17:52 -05:00
vedithal-amd
70ebb4a299
Backport CHANGELOG changes from 7.0 release ( #845 )
...
* Backport CHANGELOG changes from 7.0 release
* Backport CHANGELOG changes from https://github.com/ROCm/rocprofiler-compute/pull/815
2025-07-31 19:02:50 -04:00
amd-hsivasun
6df6f3ec6e
Rocprofiler-Compute Formatting Workflow ( #18 )
...
* Added Formatting Workflow for Compute
* Update rocprofiler-compute-formatting.yml
* Removed use of action for black and isort in compute formatting
---------
Co-authored-by: Sivasuntharampillai, Haresh <Haresh.Sivasuntharampillai+amdeng@amd.com >
2025-07-31 17:47:18 -04:00