Gráfico de commits

1117 Commits

Autor SHA1 Mensaje Fecha
ywang103-amd 23b42e90c9 fix the error of output path of multi-node mode (#616)
* solve the error that makes name passed by -n not used in multi-node applications

* isort and black formatted
2025-03-18 17:19:19 -04:00
vedithal-amd 45b8937d5d Fix kernel filtering when using rocprofv3 (#615)
When using rocprof v3:
* Use --kernel-include-regex for kernel name filtering
* Use --kernel-iteration-range for kernel dispatch filtering

Update changelog
2025-03-18 11:26:45 -04:00
vedithal-amd 64ccd588de Band aid fix for MI 100 no counters collected (#614)
* rocprofv3 might not collect any counters for MI 100, handle this case gracefully to prevent test failures
2025-03-18 11:26:17 -04:00
Ben Richard 96a25e8cbc Remove dependency on en_US.UTF-8 locale (#613)
Set locale to C.utf8 instead of en_US.UTF-8

Avoid forcing the user to use en_US.UTF-8. Most Linux systems have C.utf8.
2025-03-17 16:40:36 -04:00
ywang103-amd 0c6cec5671 replace rocm-smi with amd-smi cmd (#612) 2025-03-17 16:20:41 -04:00
cfallows-amd 1c237c1382 Debug logging during intensities calculations when no flops recorded (#608)
Added debug log for when no flops are recorded (total_flops is 0), so AI points will not be plotted.
Removed commented out print statement that is not functional- contains nonexistent method call.

Signed-off-by: Carrie Fallows <Carrie.Fallows@amd.com>
2025-03-14 12:37:19 -04:00
vedithal-amd 6827330135 Fix tests on MI 100 (#609) 2025-03-13 12:07:03 -04:00
vedithal-amd 30752d1547 selective counter bugfix (#602)
Allow block filter of the form xx.x
2025-03-11 13:34:48 -04:00
vedithal-amd 15edbf475e Standalone binary no self execute fix (#603)
* Fix nuitka command
2025-03-11 13:34:37 -04:00
cfallows-amd 097d30dc5c Add fp8 graph to standalone gui in analysis mode (#600)
Signed-off-by: Carrie Fallows <Carrie.Fallows@amd.com>
2025-03-11 13:32:37 -04:00
Fei Zheng 7e8d2d2c0e Fix counter collection inconsistency with rocprofv3 2025-03-10 21:05:40 -06:00
vedithal-amd 51c9c6fad3 Update sphinx docs version in develop (#601)
* Update sphinx docs version in develop instead of amd-staging branch
* Add repo admins to pr review
2025-03-10 18:01:37 -04:00
cfallows-amd fd288e6d13 Force kaleido version to be no greater than 0.2.1 (#599)
Higher versions (eg. 0.4.1) have external dependencies that are causing errors and forcing early exits without creating roof plots

Signed-off-by: Carrie Fallows <Carrie.Fallows@amd.com>
2025-03-10 15:16:14 -04:00
vedithal-amd 55cf0e237e Analysis report block based filtering for profiling (#566)
* Analysis report block based filtering for profiling

* Profiling mode changes

- `-b` option now additionally accepts metric id(s), similar to `-b` option in analyze mode (e.g. 6, 6.2, 6.23)
    - Only counters mentioned in the selected analysis report blocks will be collected
        - Add parsing logic to identify hardware counters from analysis report blocks
        - Add filtering logic to only write filtered counters in perfmon files
        - Log not collected counters in one line
- `--list-metrics` option added in profile mode to list possible metric id(s) similar to analyze mode
- Write arguments provided during profiling in profiling_configuration.yaml file

* Analysis mode changes

- During analysis mode, only show report blocks selected during profiling
    - If `-b` option is provided in analysis mode, then follow provided filters
- Do not show empty tables in analysis report

* Miscellaneous changes

- Update CHANGELOG
- Add test cases
    - Instruction mix report block filter
    - Instruction mix and Memory chart report block filter
    - Instruction mix report block filter and CPC hardware block filter
    - TA hardware block filter
    - --list-metrics in profile mode should work
- Move binary handler fixtures to conftest.py to avoid importing
  fixtures
- cmake file in tests directory has been updated to compile sample/vmem.hip for testing

* Public documentation changes

- Use the term "Hardware report block" instead of "Hardware block"
- Add documentation for "--list-metrics" option in profile mode
- Add example of filtering by hardware report block such as instruction
  mix and wavefront launch statistics
- Add deprecation warning for hardware component (sq, tcc) based filtering
2025-03-10 14:42:56 -04:00
Peter Park 0aefd15b7b Fix name in package manager install docs (#593)
* Fix post analysis gui in standalone binary (#591)

* Fix post analysis gui in standalone binary

* Add post analysis gui assets and required server libraries for GUI
  server and web page

* Add port forwarding to docker test compose

* Update README me to use `docker compose up` instead of `docker compose run`
  to run containers with port forwarding and to leverage other
  functionalities of docker compose

* Fix rocprofv1 output processing. (#588)

* fix rocprof-compute binary name in package manager install docs

---------

Co-authored-by: vedithal-amd <Vignesh.Edithal@amd.com>
Co-authored-by: xuchen-amd <xuchen@amd.com>
2025-03-10 13:47:30 -04:00
cfallows-amd 848fa1dc18 FP8 roofline support (#592)
Adding FP8 datatype to roofline feature in rocprof-compute on MI300-based systems.
FP8 now shows in terminal output and roofline csv, and outputs a standalone PDF.

---------

Signed-off-by: Carrie Fallows <Carrie.Fallows@amd.com>
2025-03-07 11:27:01 -05:00
Fei Zheng f64c83fc5e fix specs-correction 2025-03-07 11:27:01 -05:00
xuchen-amd 1b6bc89137 Disable --kokkos-trace in rocprof-compute (#594)
.
2025-03-07 11:27:01 -05:00
xuchen-amd b81310070e Fix rocprofv1 output processing. (#588) 2025-03-07 11:27:01 -05:00
vedithal-amd 0b3114fa88 Fix post analysis gui in standalone binary (#591)
* Fix post analysis gui in standalone binary

* Add post analysis gui assets and required server libraries for GUI
  server and web page

* Add port forwarding to docker test compose

* Update README me to use `docker compose up` instead of `docker compose run`
  to run containers with port forwarding and to leverage other
  functionalities of docker compose
2025-03-07 11:27:01 -05:00
dependabot[bot] 3b78dc9177 Bump rocm-docs-core from 1.15.0 to 1.17.0 in /docs/sphinx (#572)
Bumps [rocm-docs-core](https://github.com/ROCm/rocm-docs-core) from 1.15.0 to 1.17.0.
- [Release notes](https://github.com/ROCm/rocm-docs-core/releases)
- [Changelog](https://github.com/ROCm/rocm-docs-core/blob/develop/CHANGELOG.md)
- [Commits](https://github.com/ROCm/rocm-docs-core/compare/v1.15.0...v1.17.0)

---
updated-dependencies:
- dependency-name: rocm-docs-core
  dependency-type: direct:production
  update-type: version-update:semver-minor
...

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-02-28 16:34:11 -07:00
Fei Zheng 6ece52fbe8 Change normal_unit default to per_kernel 2025-02-27 12:46:47 -07:00
vedithal-amd bec537e2a1 clear yum metadata before yum install (#582) 2025-02-27 09:39:26 -05:00
Fei Zheng 58217f2c6a Add FP8 metrics for MI300 2025-02-26 18:45:19 -07:00
vedithal-amd 19a9365bb9 Do not fail if VERSION.sha not found (#579)
* Create VERSION.sha file while creating standalone binary

* Do not fail on VERSION.sha not found

* Add git to docker container for testing
2025-02-26 11:39:37 -05:00
vedithal-amd 649660d54e Check PATH to verify executable in remaining args (#578) 2025-02-24 13:33:55 -05:00
xuchen-amd 10e1d57bfb Improve kokkos trace handling. (#573)
* Improve kokkos trace is handling.

* Fix formatting.

* Syntax fix and adding comments.
2025-02-22 14:07:14 -05:00
David Galiffi f5c3e9a48e Add develop branch to rocm-ci.yml (#571)
Signed-off-by: David Galiffi <David.Galiffi@amd.com>
2025-02-21 00:20:02 -05:00
vedithal-amd 40f79c28b1 Standalone binary (#546)
* Add cmake function to create standalone binary

* Mention licenses used by dependencies in the LICENSE file

* Add test cases for standalone binary by adding --call-binary option for pytest

* Docker compose file to create standalone binary in standardized RHEL 8 environment

* Add README instructions on how to create and test standalone binary

* Move docker files from utils to docker folder; Add standalone binary testing instructions

* Add CHANGELOG statement

* Use different service names in docker compose files

* Use volume mounting in docker files
2025-02-20 17:51:57 -05:00
ywang103-amd e5803eb243 Parser: to_int() return None instead of np.nan (#569) 2025-02-20 10:20:06 -05:00
ywang103-amd b9fc44992a fix missing parameter of spatial multiplexing for create_df_pmc (#567) 2025-02-14 14:23:16 -05:00
Fei Zheng 8c3895e1d1 Update changelog 2025-02-13 18:05:49 -07:00
ywang103-amd 808a9a79ed Spatial multiplexing: part2, analysis (#542)
Co-authored-by: Yang Wang <ywang103@amd.com>
2025-02-13 15:38:59 -08:00
Peter Park 9643afa62d Fix links in docs (#564) 2025-02-13 16:24:48 -05:00
cfallows-amd c03cbf2389 Update CHANGELOG and docs 2025-02-11 10:46:33 -07:00
ywang103-amd 5ee37b3353 fix crash for running rocprofv3 on mi100 (#557)
* initial hack to fix for v3 stucking on mi100 becasue of -m parameter and missing counter csv file

* proper formating

* refactored profiler option function to take soc arch

* resolve missing step that casued error for profiler option

* fix typo of arch name

* change method of putting soc info into profiler option

* isort and black format

* add comment for the part that handles missing counter csv file

* remove unncecessary import

---------

Co-authored-by: YANG WANG <ywang@ywang-ubuntu.amd.com>
2025-02-10 17:59:03 -05:00
xuchen-amd 6391b3ca76 Change profiler mode trace options, the different trace modes (kernel, kokkos (marker), and hip) are now exclusive of each other based on rocprofv3 updates. (#561) 2025-02-10 16:11:03 -05:00
vedithal-amd bceddb0943 Add broken mclk workaround for all arch #453 (#556)
* Add the missing commit from #453

* Remove unnecessary conditional and use 'is' operator

Co-authored-by: Richard Ben <ben.richard@amd.com>
2025-02-07 14:54:17 -05:00
xuchen-amd 3396ba3906 Improve test run time by editing roofline related tests. (#554)
* Improve test run time by editing roofline related tests.

* Fix formatting.
2025-02-05 14:18:59 -05:00
Vignesh Edithal 25b6bbbb3c Use github app for develop to amd-staging sync
* Use the token for rocprofiler-compute-runner github app to perform
  auth to bypass amd-staging branch protection
2025-02-03 13:41:14 -05:00
dependabot[bot] 95b600ea58 Bump rocm-docs-core from 1.14.1 to 1.15.0 in /docs/sphinx (#547)
Bumps [rocm-docs-core](https://github.com/ROCm/rocm-docs-core) from 1.14.1 to 1.15.0.
- [Release notes](https://github.com/ROCm/rocm-docs-core/releases)
- [Changelog](https://github.com/ROCm/rocm-docs-core/blob/develop/CHANGELOG.md)
- [Commits](https://github.com/ROCm/rocm-docs-core/compare/v1.14.1...v1.15.0)

---
updated-dependencies:
- dependency-name: rocm-docs-core
  dependency-type: direct:production
  update-type: version-update:semver-minor
...

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-01-30 17:25:46 -07:00
xuchen-amd da1bd045ab Enable kokkos trace/features from rocprofv3 in rocprof-compute. (#545)
Signed-off-by: Xuan Chen <xuchen@amd.com>
2025-01-29 13:14:57 -05:00
Cole Ramos 0a52e6f19d Updates for new default branch (#537)
* Update contributing guide for new default branch

Signed-off-by: coleramos425 <colramos@amd.com>

* Add develop target to related workflows

Signed-off-by: coleramos425 <colramos@amd.com>

* Auto sync develop and staging

Signed-off-by: coleramos425 <colramos@amd.com>

* Run sync every night at 12AM

Signed-off-by: coleramos425 <colramos@amd.com>

* Rename sync workflow

Signed-off-by: coleramos425 <colramos@amd.com>

* Update maintainers guide

Signed-off-by: coleramos425 <colramos@amd.com>

* Update readme

Signed-off-by: coleramos425 <colramos@amd.com>

* Auto accept PRs for daily branch sync

Signed-off-by: coleramos425 <colramos@amd.com>

* Direct merge. No PR for merge required

Signed-off-by: coleramos425 <colramos@amd.com>

---------

Signed-off-by: coleramos425 <colramos@amd.com>
2025-01-28 16:40:41 -06:00
Sam Wu f58070aff1 Update documentation requirements (#540) 2025-01-24 16:34:03 -07:00
cfallows-amd 7602341b70 Add ubuntu 24.04 detection for roofline bin loading. (#541)
Also updated docker script for dev usage.

Signed-off-by: Carrie Fallows <Carrie.Fallows@amd.com>
2025-01-23 19:53:50 -05:00
ywang103-amd 713570908a enable rocprofv3 for all old soc architectures (#539)
* enable rocprofv3 for all old soc archetectures

* remove gfx906 from rocprofv3
2025-01-23 15:37:00 -05:00
Cole Ramos 34760ff814 Yearly license update (#538)
* Add license template designation

Signed-off-by: coleramos425 <colramos@amd.com>

* Update license year on *.py

Signed-off-by: coleramos425 <colramos@amd.com>

* Update license year on *.sh

Signed-off-by: coleramos425 <colramos@amd.com>

* Update license year on misc files

Signed-off-by: coleramos425 <colramos@amd.com>

---------

Signed-off-by: coleramos425 <colramos@amd.com>
2025-01-23 13:09:32 -06:00
Peter Park 5063e14309 Specify path to sphinx configuration in .readthedocs.yaml (#536) 2025-01-22 15:22:26 -05:00
Peter Park bb388b03f1 update LICENSE year to 2025 (#530) 2025-01-22 07:03:28 -08:00
dependabot[bot] c99d4525ef Bump rocm-docs-core from 1.12.0 to 1.13.0 in /docs/sphinx (#526)
Bumps [rocm-docs-core](https://github.com/ROCm/rocm-docs-core) from 1.12.0 to 1.13.0.
- [Release notes](https://github.com/ROCm/rocm-docs-core/releases)
- [Changelog](https://github.com/ROCm/rocm-docs-core/blob/develop/CHANGELOG.md)
- [Commits](https://github.com/ROCm/rocm-docs-core/compare/v1.12.0...v1.13.0)

---
updated-dependencies:
- dependency-name: rocm-docs-core
  dependency-type: direct:production
  update-type: version-update:semver-minor
...

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-01-22 07:03:19 -08:00