Граф коммитов

817 Коммитов

Автор SHA1 Сообщение Дата
Kandula, Venkateshwar reddy 0ff0ffffa2 [SDK] Expose counter dims in rocprofiler_counter_info_v1_t and only show counters being profiled in metadata. (#325)
* expose dimensional info in rocprofiler_counter_info_v1_t.

* add counter_id in dim info.

* address review comments

* format.

* address comments.

* use array of pointers for dimensions_instaces.

* format and comments.

* address comments.

* new line.

* Update counter_defs.yaml

* Update counter_defs.yaml

* Update counter_defs.yaml

* counter_defs.

* format counter defs.

* format counter defs.

* format counter defs.

* show only counters being profiled in metadata.

* Format.

* use config for counters and fix warnings.

* add version for rocprofiler_counter_dimension_info_v1_t struct.

* rename rocprofiler_counter_record_dimension_instance_v1_info_t.

* account device id from pmc for counters metadata.

* move dim structs to counters.h.

* address comments to compare value.

* fix tests.

* Address comments. use pointer of arrays for ABI.

* rebase.

* fix build error.

* use separate metadata::init() for rocprofv3.

* also print not found counters.

* precompute all the perf counters needed to be in metadata.

* Misc.

* format

* Format.

* rocprofiler::sdk::container::c_array

* Address comments.

* source/lib/output/metadata.cpp

* lint.

* add unit test for c_array.

* add unit test and serialization support for c_array container.

* Misc.

* Clean files.

* Format.

* clang-tidy.

* add more checks to c_array.

* misc. typo

* Addr comments.

---------

Co-authored-by: Venkateshwar Reddy Kandula <vkandula@amd.com>
Co-authored-by: Jonathan R. Madsen <Jonathan.Madsen@amd.com>

[ROCm/rocprofiler-sdk commit: bf0fad1d54]
2025-07-22 14:24:25 -07:00
Gill, Harkirat b88018d24d Update output file fields docs to correctly define Grid_Size (#526)
Co-authored-by: Bhardwaj, Gopesh <Gopesh.Bhardwaj@amd.com>

[ROCm/rocprofiler-sdk commit: e948034c83]
2025-07-22 23:16:01 +05:30
Nagaraj, Sriraksha 28d2a8f5bb [rocprofv3-avail] - Add sample data (#514)
* Add sample data for avail and remove color code for non terminal output

* review comments

* review comments

* add documentation

* test fix

[ROCm/rocprofiler-sdk commit: 2447a85215]
2025-07-22 08:39:59 -07:00
Indic, Vladimir f38f6cc82e MI350 stochastic support in changelog (#530)
[ROCm/rocprofiler-sdk commit: 57bfd0f607]
2025-07-22 08:39:22 -07:00
Baraldi, Giovanni 620924b15f Adding inline callstack information to disassembly (#468)
* Adding callstack information to disassembly

* changelog

* Cleanup

* Fix snapshots.json

* Clang tidy fixes

* Fix infinite recursion

* Apply suggestions from code review

Co-authored-by: Indic, Vladimir <Vladimir.Indic@amd.com>

* Remove sibling transversal

* Added docstrings

* Apply suggestions from code review

* Update source/include/rocprofiler-sdk/cxx/codeobj/code_printing.hpp

* Review comments

* Format + comments

* Fmt

* Add class name

* Format

* Fix static linkage

* Making funcs inline

---------

Co-authored-by: Giovanni <gbaraldi@amd.com>
Co-authored-by: Indic, Vladimir <Vladimir.Indic@amd.com>

[ROCm/rocprofiler-sdk commit: be74e1b49f]
2025-07-22 15:52:42 +02:00
Indic, Vladimir f6ed502afd [Stochastic PC Sampling][GFX9] Increasing ROCr buffer to 4MBs. (#487)
Increasing ROCr buffer to 4MBs.

[ROCm/rocprofiler-sdk commit: de4708963e]
2025-07-18 12:06:17 -05:00
Madsen, Jonathan 990946e956 [SDK] Fix null handles (#474)
* Fix null handle

- use .handle=0, not .handle=numeric_limits<>::max()

* Update lib.common.hasher

* Fix ROCPROFILER_CONTEXT_NONE

* Use context operator==

* Update CHANGELOG

* Updated null handle for scratch memory and changed allocation test so that free ops account for null agent

---------

Co-authored-by: Ian Trowbridge <Ian.Trowbridge@amd.com>

[ROCm/rocprofiler-sdk commit: 4d6a61f5e5]
2025-07-18 12:05:52 -05:00
Nagaraj, Sriraksha 483a10f53a [SDK] Update UUID (rocprofiler_uuid_t) (#390)
* changing uuid abi

* fix

* review comments

* fix CI fail

* review comments

* fix

* adding static asserts

* making constructor constexpr

* fix CI fail

* upadate UUID length to 16 bytes

* fixing value64

Co-authored-by: Madsen, Jonathan <Jonathan.Madsen@amd.com>

* Update CHANGELOG.md

Co-authored-by: Madsen, Jonathan <Jonathan.Madsen@amd.com>

---------

Co-authored-by: Madsen, Jonathan <Jonathan.Madsen@amd.com>

[ROCm/rocprofiler-sdk commit: fcfb5bd918]
2025-07-18 11:24:48 -05:00
Bonnell, Jason 052eb24881 replace azure runners with internal (#518)
* replace azure runners with internal

* change to mi300a for debug

* revert back to mi300

* move some of the load to mi300a

* use mi300a for clang-tidy

---------

Co-authored-by: Kandula, Venkateshwar reddy <Venkateshwarreddy.Kandula@amd.com>

[ROCm/rocprofiler-sdk commit: f4cce0d2d4]
2025-07-17 18:09:17 -05:00
Indic, Vladimir d5aba741f3 [Host-Trap PC Sampling] Host-Trap PC sampling an introduce an arbitrary sampling skid of [0, 2] instructions (#515)
* Arbitrary host-trap sampling skid (doc)

The host-trap PC sampling might introduce a skid of [0, 2]
instructions. We documented this information and provides
some advice to application developers how to find
hot-spots in the profiles generated by host-trap sampling.

[ROCm/rocprofiler-sdk commit: 650d35bdaa]
2025-07-17 17:59:46 +02:00
Wang, Huanran f96cafaa60 [rocprofv3] Sanitize string of command in rocpd (#467)
Sanitize string of command in rocpd

Co-authored-by: Huanran Wang <huanrwan@amd.com>

[ROCm/rocprofiler-sdk commit: 5c45c77ec7]
2025-07-15 13:01:14 -04:00
Tumakaev, Aleksei b439b4e719 [rocprofv3] Fix incorrect position of Allocation_Size header in memory_allocation csv (#510)
Fix incorrect position of Allocation_Size header in memory_allocation csv

[ROCm/rocprofiler-sdk commit: 4af73d078f]
2025-07-15 18:15:22 +02:00
Kandula, Venkateshwar reddy f2a5139a37 [CI] add hip api table version of 13 to enum string (#509)
add hip table version to 13, API_ID_LAST is not changed from version=12 since no new struct has been added.

Co-authored-by: Venkateshwar Reddy Kandula <vkandula@amd.com>

[ROCm/rocprofiler-sdk commit: d2393c97f8]
2025-07-11 16:22:12 -05:00
Welton, Benjamin 7dcf95fb57 [SWDEV-540753] Reduce memory allocations in device profiling (#507)
Cache packet creation in all cases to reduce the number of allocations/
destruction operations made down to KFD. There is a bug that we
encounter after a period of runtime in KFD where allocations fail to be
visable to the GPU (suspect this is a FW issue, similar to other FW
issues they have had along the same lines). This sidesteps that issue in
rocprof (and likely should be done regardless)

Co-authored-by: Benjamin Welton <bewelton@amd.com>

[ROCm/rocprofiler-sdk commit: 7e3ea0c58e]
2025-07-11 12:46:19 -07:00
Nagaraj, Sriraksha c8912d2bb6 [rocprofv3-avail] Documentation update and column formatting (#447)
* addressing issues

* doc fix

* test fix

* fix

* fix formatting issue and doc update

* fix column size

* fix

* fix formatting in output

* tests fix

* test fix

* add new line

* add new line

* fix new line

* fixing typo in using-rocprofv3-avail.rst

[ROCm/rocprofiler-sdk commit: 3aaffc42da]
2025-07-10 11:41:12 -05:00
Elwazir, Ammar 23e601837c Fixing Docs Workflow (#505)
[ROCm/rocprofiler-sdk commit: e5161f6e06]
2025-07-09 17:19:16 -05:00
Elwazir, Ammar f5cb757547 Fix Docs Workflow (#504)
* Update docs.yml

* Update docs.yml

* Update docs.yml

[ROCm/rocprofiler-sdk commit: d68e1754d7]
2025-07-09 12:45:49 -05:00
Elwazir, Ammar f051f37cdc GPU-less runners update (#503)
* Update codeql.yml

* Update codeql.yml

* Update codeql.yml

* Update codeql.yml

* Update codeql.yml

* Update codeql.yml

* Update codeql.yml

* Update codeql.yml

* Update codeql.yml

* Update codeql.yml

* clean up

* clean up

* clean up

* Update codeql.yml

* Update codeql.yml

---------

Co-authored-by: Ammar ELWazir <aelwazir@amd.com>

[ROCm/rocprofiler-sdk commit: a871e3534a]
2025-07-09 11:46:47 -05:00
U, Srihari 7243889d6a Add perfetto support for scratch memory (#303)
* Add perfetto support for scratch memory

* Updated tests and docs.

* Update docs data

* Added underflow check

* Record all free events to 0 bytes

* Add format

* Address review comment

* updated tests for scratch memory

* update scratch-memory tests.

[ROCm/rocprofiler-sdk commit: 6f2a5a9646]
2025-07-09 21:05:45 +05:30
Welton, Benjamin fbf17a42d4 [SWDEV-516561][1/2] Add MARKER_RANGE_EXTENT to capture ROCTX ranges (#363)
* [SWDEV-516561][1/2] Add MARKER_RANGE_EXTENT to capture ROCTX ranges

Range extent to capture all work between roctxpush/pop operations. Entry callback takes place during roxtxpush and exit callback takes place in roctxpop. This is primarily to allow us to keep an ancestor id on the ancestor stack such that all operations that take place within the push/pop context can be annotated as being apart of this range. With the current setup (where push and pop are two separate operations that need to be combined externally), we cannot keep an ancestor id on the stack and thus cannot tie tracing events to particular ranges.

Correlation id information is inherited from the push operation. Ancestor id needs to be added in a future commit that also outputs this ancestor to CSV.

Output:

```
[ctest] {'size': 64, 'kind': 7, 'operation': 1, 'correlation_id': {'internal': 1525, 'external': 0, 'ancestor': 1524}, 'start_timestamp': 2932551479402642, 'end_timestamp': 2932551491178449, 'thread_id': 3254861}
[ctest] {'size': 64, 'kind': 8, 'operation': 2, 'correlation_id': {'internal': 1525, 'external': 0, 'ancestor': 1524}, 'start_timestamp': 2932551479405878, 'end_timestamp': 2932551491181214, 'thread_id': 3254861}
```

Note: Kind 8 = range extent op.

* Merge fix

Revert several changes

source/lib/rocprofiler-sdk/marker/range_marker.*

- separate out range marker implementation for standard marker implementation

Update public API with marker core range

Support marker core range in sdk (source/lib/rocprofiler-sdk)

Transition rocprofiler-sdk-tool and output lib to use marker core range

Misc fixes for tests

Fix logic in lib/output/generate{CSV,Stats}.cpp

Update tests/rocprofv3/tracing-hip-in-libraries (marker validation)

Fix test_otf2_data

* Test fixes

---------

Co-authored-by: Benjamin Welton <bewelton@amd.com>

[ROCm/rocprofiler-sdk commit: 2c4e20b951]
2025-07-08 23:41:22 -07:00
Welton, Benjamin 187bdeca54 [SDK] Added retry for signal wait (#494)
Added retry for signal wait

Co-authored-by: Benjamin Welton <bewelton@amd.com>

[ROCm/rocprofiler-sdk commit: 2fa98c4d14]
2025-07-08 23:28:02 -07:00
Roy, Rakesh 05b674651c Fix find_if build error for RHEL10 (#499)
overriding Jonathan's final approval ( as it is a long weekend in US/NA)
adding algorithm header to recognize stdlib's find_if looks fine

[ROCm/rocprofiler-sdk commit: 30e239d1b7]
2025-07-04 09:20:23 -04:00
Bhardwaj, Gopesh d5ca98baed Adding OpenMP usage with rocprofv3 (#472)
* Adding openmp usage with rocprofv3

* minor changes

* Fixing missing line

[ROCm/rocprofiler-sdk commit: e7616c3aad]
2025-07-02 12:25:24 +05:30
Baraldi, Giovanni cd5d5f8142 [rocprofv3] Fix ATT library path (#476)
* Fix library path

* Update docs

* Review comments

* Update source/bin/rocprofv3.py

---------

Co-authored-by: Giovanni Baraldi <gbaraldi@amd.com>

[ROCm/rocprofiler-sdk commit: c0c08b2f08]
2025-07-01 22:08:29 +02:00
Nagaraj, Sriraksha a1ffe9be39 [SDK] Internal event id for AQL packet increased to 64-bit unsigned int (#380)
* event id to 64 bit size

* review comments

* review comments

* review comments

* fix

* handle exception

* fix formatting issue

* fix tests

[ROCm/rocprofiler-sdk commit: 1524ceb84d]
2025-06-30 17:54:37 -05:00
Madsen, Jonathan ce7d811719 [SDK] Fix buffer tracing stringify of stack-allocated char* buffer (#429)
* [SDK] Fix buffer tracing stringify of stack-allocated char* buffer

* Formatting

---------

Co-authored-by: Jonathan R. Madsen <jonathanrmadsen@gmail.com>

[ROCm/rocprofiler-sdk commit: 3590ecc10e]
2025-06-30 17:51:26 -05:00
Madsen, Jonathan 09cd1ea073 [CI] Use rocprof-azure-gpuless-emu-runner-set for build-docs-from-source job (#493)
[ROCm/rocprofiler-sdk commit: 730b4e0a54]
2025-06-30 17:50:43 -05:00
Trowbridge, Ian e2bdebcb57 [rocprofv3] Add check for Stream Stack Pointer before use (#465)
* Added null check for stream_stack before get_stream_id is called

* Rename function and add check for stream strack before pop

* Removed empty check for stream stack and adding error log for get_stream_id in stream.cpp

[ROCm/rocprofiler-sdk commit: 0904b6e34d]
2025-06-30 17:49:48 -05:00
Madsen, Jonathan 145944dc30 [CI] Disable conversion script validation (#492)
- pytest fails when you selectively disable all the tests in the suite

[ROCm/rocprofiler-sdk commit: 99e262ca4a]
2025-06-30 17:02:53 -05:00
Kuricheti, Mythreya 0b003260fa [SDK] Filter agents that are not CPU or GPU (#374)
* Filter HSA agents

* Review comments

[ROCm/rocprofiler-sdk commit: 7842dfe839]
2025-06-30 15:54:48 -05:00
Madsen, Jonathan fb51f0e5d4 [CI] Disable other unstable tests (#491)
Disable other unstable tests

- validation test_validate_counter_collection_pmc1 for conversion script
- increase timeout for tests/rocprofv3/pc-sampling execution phase

[ROCm/rocprofiler-sdk commit: f0fe04b95e]
2025-06-30 15:53:30 -05:00
Madsen, Jonathan fff7146ab8 [CI] Disable other unstable tests (#490)
* Disable other unstable tests

* Disable validating test_total_runtime in kernel-tracing

* The disabled tests will be stabilized and re-enabled by ROCm 7.0.1 or ROCm 7.1 

[ROCm/rocprofiler-sdk commit: 69f71b8097]
2025-06-30 15:42:38 -05:00
Madsen, Jonathan 839c07c4aa [CI] Testing stability (#486)
* [CI] Testing Stability

- CMake option ROCPROFILER_DISABLE_UNSTABLE_CTESTS
  - used for tests which periodically fail around 1 out of every 10 runs
  - set to ON while instability remains, this needs to set to OFF in ROCm 7.1 or, ideally, ROCm 7.0.1
- Use FIXTURES_SETUP and FIXTURES_REQUIRED for some tests
- replace "threw an exception" with "${ROCPROFILER_DEFAULT_FAIL_REGEX}" for misc FAIL_REGULAR_EXPRESSIONS

* Remove contents of all EXCLUDE_{TESTS,LABEL}_REGEX from CI workflow

* Disable patch git step in code-coverage run

* Tweak spin time of reproducible runtime

* Removed patch git step in code-coverage run

* Update ROCPROFILER_DEFAULT_FAIL_REGEX

* Mark test-counter-collection tests as unstable

- add fixtures setup/required

* Remove ATTACHED_FILES_ON_FAIL

- CDash doesn't store enable downloading these properly anyway

* Relax collection-period fuzzing window

* Disable unstable collection-period test

- too unstable

* formatting

* Disable unstable device_counting_service_test.async_counters

* Suppress perfetto internal data race errors

* Switch code-coverage CI jobs to mi300 runner

* Timeout increases

* rocprofv3-test-rocpd updates

- add fixtures
- switch executable
- redefine input/output paths

* Revert code-coverage job to mi300a runner

* Update rocprofv3-test-rocpd-execute-multiproc

- reduce problem size

* disable multiproc rocpd

* Split code-coverage into separate workflow

- network issues cause this job to fail frequently
- when in a separate workflow, it can be restarted easily

* Fixtures for rocprofv3-test-trace-hip-in-libraries

* Disable unstable device_counting_service_test.sync_counters

* Potential fix for code scanning alert no. 171: Workflow does not contain permissions

Co-authored-by: Copilot Autofix powered by AI <62310815+github-advanced-security[bot]@users.noreply.github.com>

* Switch code-coverage to run on rocprof-azure

- mi300a EMU runner set is unstable (network issues)

* tests/rocprofv3/pc-sampling SKIP_REGULAR_EXPRESSION

* Update rocprofv3-test-list-avail-trace-execute

- reduce log level and increase timeout

* rocprofv3: Prevent recursive call to rocprofv3_error_signal_handler + log chaining

* rocprofv3: Use ROCP_ERROR + std::exit instead of ROCP_FATAL

- should help with SKIP_REGULAR_EXPRESSION

---------

Co-authored-by: Copilot Autofix powered by AI <62310815+github-advanced-security[bot]@users.noreply.github.com>

[ROCm/rocprofiler-sdk commit: 640ca55ac0]
2025-06-30 15:07:37 -05:00
Elwazir, Ammar 741ce1b8f3 Adding Manual Trigger for mainline and staging public sync (#489)
* Update sync-staging.yaml

* Update sync-mainline.yaml

[ROCm/rocprofiler-sdk commit: 5962747bf7]
2025-06-30 11:13:50 -05:00
Kandula, Venkateshwar reddy 8f5d00ca5d [SDK] Add gfx950 targets for tests and samples (#399)
* add gfx950 targets.

* add gfx950 targets to ci workflows.

* Format.

---------

Co-authored-by: Venkateshwar Reddy Kandula <vkandula@amd.com>
Co-authored-by: Vaddireddy, Sushma <Sushma.Vaddireddy@amd.com>

[ROCm/rocprofiler-sdk commit: 375866383b]
2025-06-26 14:25:08 -05:00
Trowbridge, Ian b461671093 Modified perfetto output for HIP stream display (#431)
* Modified perfetto output for HIP stream display

* Moved stream_map file location and changed perfetto output names Private_Segment_Size and Group_Segment_Size to Scratch_Size and LDS_Block_Size respectively

* Used const_cast to remove const modifier on void*

* Reverted stream_map changes, now using tool_metadata map to track mapping between stream ptrs and stream IDs

* Removed buffer tracing args in perfetto, added tool_...hip buffer record struct that stores the HIP stream ID for display purposes

* Updated rocpd perfetto.cpp to reflect stream changes. Still need to add vgpr values and stream ID for HIP API

* Changes pass-by const reference to pass-by const value

[ROCm/rocprofiler-sdk commit: 1f8b8c5e9f]
2025-06-26 14:22:50 -05:00
Kuricheti, Mythreya bde07e7baa [SDK] KFD new events API (#321)
* Remove page-migration

* Add KFD events API

* Address review comments

* Move assert checks

* Update enum-string utils

* Update codeowners

* Update KFD header

* Add perfetto category

[ROCm/rocprofiler-sdk commit: 8a461afe20]
2025-06-26 13:28:45 -05:00
Trowbridge, Ian 58532974bc Modified test for agent.cpp where GPUs are missing the UUID (#405)
* Modified test for agent.cpp where multiple GPUs share th same UUID

* Updated for review comments

[ROCm/rocprofiler-sdk commit: eb8c9af720]
2025-06-25 17:36:56 -05:00
Elwazir, Ammar 75895ce046 Fixing Docker Pull Rate Issue (#483)
Update rocm_release_compatibility.yml

[ROCm/rocprofiler-sdk commit: 73ee59244d]
2025-06-25 17:16:22 -05:00
Elwazir, Ammar 1a807c27de Python Lint Fix (#482)
Update python.yml

[ROCm/rocprofiler-sdk commit: 9ec3cc7f62]
2025-06-25 14:58:56 -05:00
Elwazir, Ammar 469137eb7d Multiple CI Fixes & Clean up (#477)
* Update continuous_integration.yml

* Update continuous_integration.yml

* Update continuous_integration.yml

* Update continuous_integration.yml

* Update continuous_integration.yml

* Update continuous_integration.yml

* Update continuous_integration.yml

* Update continuous_integration.yml

* Update continuous_integration.yml

* Update continuous_integration.yml

* Update continuous_integration.yml

* Update continuous_integration.yml

* Update continuous_integration.yml

* Update continuous_integration.yml

* Update continuous_integration.yml

* Update continuous_integration.yml

* Update continuous_integration.yml

* Update continuous_integration.yml

* Update continuous_integration.yml

* updating build essentials

* updating build essentials

* Update continuous_integration.yml

* Update continuous_integration.yml

* Update continuous_integration.yml

* Update continuous_integration.yml

* Update continuous_integration.yml

* Update continuous_integration.yml

* Update continuous_integration.yml

* Update continuous_integration.yml

* Update continuous_integration.yml

* Update continuous_integration.yml

---------

Co-authored-by: gobhardw <gopesh.bhardwaj@amd.com>

[ROCm/rocprofiler-sdk commit: 25f5cbc0c7]
2025-06-25 13:38:13 -05:00
Verma, Saurabh 442da1f287 PC-Sampling doc updates - FW version (#455)
* Initial doc update

* addressed review comments

* addressed review comments - 2

* accept reviewer suggestions

Co-authored-by: Rawat, Swati <Swati.Rawat@amd.com>

* accept reviewer suggestions-2

Co-authored-by: Rawat, Swati <Swati.Rawat@amd.com>

* accept reviewer suggestions-3

Co-authored-by: Rawat, Swati <Swati.Rawat@amd.com>

* accept reviewer suggestions-4

Co-authored-by: Rawat, Swati <Swati.Rawat@amd.com>

* Update README.md

Co-authored-by: Rawat, Swati <Swati.Rawat@amd.com>

* Update CHANGELOG.md as per viewer suggestions

Co-authored-by: Rawat, Swati <Swati.Rawat@amd.com>

* accept review suggestion

Co-authored-by: Indic, Vladimir <Vladimir.Indic@amd.com>

* accept reviewer suggestion

Co-authored-by: Rawat, Swati <Swati.Rawat@amd.com>

* accept reviewer suggestions

Co-authored-by: Rawat, Swati <Swati.Rawat@amd.com>

---------

Co-authored-by: Rawat, Swati <Swati.Rawat@amd.com>
Co-authored-by: Indic, Vladimir <Vladimir.Indic@amd.com>
Co-authored-by: Bhardwaj, Gopesh <Gopesh.Bhardwaj@amd.com>

[ROCm/rocprofiler-sdk commit: f70f369d46]
2025-06-25 13:11:18 +05:30
Trowbridge, Ian 0a9849a5cf Add copyright disclaimer for scan (#453)
[ROCm/rocprofiler-sdk commit: 0ab43420ba]
2025-06-24 16:25:26 -05:00
Elwazir, Ammar aa7a8f16fe Temp: Disable Scratch Memory Test (#473)
* Disable Scratch Memory Test

* Update continuous_integration.yml

* Update continuous_integration.yml

[ROCm/rocprofiler-sdk commit: 86e40741ea]
2025-06-24 16:02:56 -05:00
Elwazir, Ammar 3f993a0192 Using Azure instead of Github Hosted Runners (#475)
* Update formatting.yml

* Update codeql.yml

* Update docs.yml

* Update generate_afar.yml

* Update python.yml

* Update rerun.yml

* Update restrictions.yml

* Update rocm_release_compatibility.yml

* Update scan.yaml

* Update sync-mainline.yaml

* Update sync-staging.yaml

[ROCm/rocprofiler-sdk commit: a2dc4ec041]
2025-06-24 15:43:54 -05:00
Baraldi, Giovanni 0ea9dbf7a8 Adding doc links for trace decoder, aqlprofile and viewer (#464)
Adding interlinks for trace decoder, aqlprofile and viewer

Co-authored-by: Giovanni Baraldi <gbaraldi@amd.com>
Co-authored-by: Bhardwaj, Gopesh <Gopesh.Bhardwaj@amd.com>

[ROCm/rocprofiler-sdk commit: 9dadbbace5]
2025-06-18 14:10:18 +02:00
Trowbridge, Ian 8de9854a62 [SDK] [CI] Update HSA EXT Step Version (#460)
Update HSA EXT step version

[ROCm/rocprofiler-sdk commit: 883a57ece7]
2025-06-17 08:55:58 -07:00
Elwazir, Ammar 4d79e1df30 [SDK] Support CMake option for using internal RCCL tracing + (temporary) enable in CI (#457)
* Temp: disable RCCL tracing

* Update continuous_integration.yml

* Update continuous_integration.yml

* Update continuous_integration.yml

* Adding option to disable rccl tracing from CMake

* Update codeql.yml

* Misc updates

- ROCPROFILER_BUILD_RCCL -> ROCPROFILER_INTERNAL_RCCL_API_TRACE
- env.EXTRA_TEMP_CMAKE_OPTIONS -> env.GLOBAL_CMAKE_OPTIONS
- add (advanced) option ROCPROFILER_INTERNAL_RCCL_API_TRACE

* Fix rocprofiler::sdk::get_enum_label

- missing enum labels for HIP_RUNTIME_API_TABLE_STEP_VERSION > 8

* Update tests/rocprofv3/advanced-thread-trace/CMakeLists.txt

- improve various aspect of cmake -- particularly echoing where attdecoder_LIBRARY was found

* Use CMAKE_MESSAGE_INDENT

- add prefix to cmake messages to help indicate where messages are coming from
- make find_package(Python3 ...) QUIET for bindings

* Fix rocprofiler::sdk::get_enum_label

- handle HSA_AMD_EXT_API_TABLE_MAJOR_VERSION

* Fix rocprofv3 message for att library path

* Fix tests/rocprofv3/advanced-thread-trace/att_input.yml config

* Fix rocprofv3 check_att_capability + soversion/version library resolution

- Account for ROCPROF_ATT_LIBRARY_PATH in env in check_att_capability
- Add resolve_library_path
  - supports resolution of library names to SOVERSION and VERSION paths

* Fix python linting error (unused import)

---------

Co-authored-by: Ammar ELWazir <aelwazir@amd.com>
Co-authored-by: Jonathan R. Madsen <jonathanrmadsen@gmail.com>

[ROCm/rocprofiler-sdk commit: aeb1621c2b]
2025-06-17 08:32:54 -05:00
Bhardwaj, Gopesh 1f4084c7b5 Adding rocpd documenation (#449)
* Adding rocpd docuemenation

* rocpd format

* CHANGELOG update and indexing

* Fixing links

* format fixes

* fixing table

* major edits

* fixed logical error

* fixing rocprofv3 avail

[ROCm/rocprofiler-sdk commit: 3e43b1f019]
2025-06-17 15:41:53 +05:30
Baraldi, Giovanni d41a31dc47 Adding LDS Bank conflict metric to MI300 (#456)
Adding LDS Bank conflict metric to MI3

Co-authored-by: Giovanni Baraldi <gbaraldi@amd.com>

[ROCm/rocprofiler-sdk commit: b1637869b3]
2025-06-13 19:28:57 +02:00