cfallows-amd 3df0d1dded Update roofline bins for rocm7 changes (#755)
* Update current bins to have rocm6 suffix. Add new rocm7 bins, built on rocm7.0 latest due to hip updates.

Signed-off-by: Carrie Fallows <Carrie.Fallows@amd.com>

* Re-add rocm version check for roof bins.

Signed-off-by: Carrie Fallows <Carrie.Fallows@amd.com>

* Rebuild roofline binaries on top of latest rocm7 (#16379) after recent mainline promotions.
Adjusting version and distro combinations of bins following rocm6 vs rocm 7 supported OS.
*rhel8 not supported on rocm7, also not built anymore
*sles15 not supported on rocm7 but is still being built
*ubuntu stays as 22.04 and above for rocm7

Signed-off-by: Carrie Fallows <Carrie.Fallows@amd.com>

* Minor fixes after testing.

Signed-off-by: Carrie Fallows <Carrie.Fallows@amd.com>

* Fixed bad copy after finding it in testing ctest.

Signed-off-by: Carrie Fallows <Carrie.Fallows@amd.com>

* Remove runpath from new bin

Signed-off-by: Carrie Fallows <Carrie.Fallows@amd.com>

* Rework target_binary map return in detect_roofline- we should not be returning maps of different sizes or with different keys for the same method. Expected output should be consistent in case we run into bad position, or for testing purposes. Manually tested all possible roofline bin expected cases to comfirm functionality and expected user output.

Signed-off-by: Carrie Fallows <Carrie.Fallows@amd.com>

* Update changelog with new roofline distro minimums

Signed-off-by: Carrie Fallows <Carrie.Fallows@amd.com>

---------

Signed-off-by: Carrie Fallows <Carrie.Fallows@amd.com>
2025-07-08 16:51:50 -04:00
2025-01-23 13:09:32 -06:00
2025-01-23 13:09:32 -06:00
2025-07-08 08:51:20 -04:00
2025-02-20 17:51:57 -05:00
2022-11-04 14:49:36 -05:00
2025-01-02 15:29:47 -06:00
2025-01-02 15:29:47 -06:00
2025-07-08 11:18:26 -04:00
2025-06-06 12:43:52 -06:00
2025-06-06 12:43:52 -06:00
2025-04-16 15:23:27 -06:00

Ubuntu 22.04 RHEL 8 Instinct Docs DOI

ROCm Compute Profiler

General

ROCm Compute Profiler is a system performance profiling tool for machine learning/HPC workloads running on AMD MI GPUs. The tool presently targets usage on MI100, MI200, and MI300 accelerators.

  • For more information on available features, installation steps, and workload profiling and analysis, please refer to the online documentation.

  • ROCm Compute Profiler is an AMD open source research project and is not supported as part of the ROCm software stack. We welcome contributions and feedback from the community. Please see the CONTRIBUTING.md file for additional details on our contribution process.

  • Licensing information can be found in the LICENSE file.

Development

ROCm Compute Profiler follows a main-dev branching model. As a result, our latest stable release is shipped from the amd-mainline branch, while new features are developed in our develop branch.

Users may checkout amd-staging to preview upcoming features.

Testing

To quickly get the environment (bash shell) for building and testing, run the following commands:

  • cd docker
  • docker compose -f docker-compose.test.yml up --force-recreate -d && docker attach docker-test-1

Inside the docker container, clean, build and install the project with tests enabled:

rm -rf build install && cmake -B build -D CMAKE_INSTALL_PREFIX=install -D ENABLE_TESTS=ON -D INSTALL_TESTS=ON -DENABLE_COVERAGE=ON -S . && cmake --build build --target install --parallel 8

Note that per the above command, build assets will be stored under build directory and installed assets will be stored under install directory.

Then, to run the automated test suite, run the following command:

ctest

For manual testing, you can find the executable at install/bin/rocprof-compute

NOTE: This Dockerfile uses rocm/dev-ubuntu-22.04 as the base image

Standalone binary

To create a standalone binary, run the following commands:

  • cd docker
  • docker compose -f docker-compose.standalone.yml up --force-recreate -d && docker attach docker-standalone-1

You should find the rocprof-compute.bin standalone binary inside the build folder in the root directory of the project.

To build the binary we follow these steps:

  • Use RHEL 8 image used to build ROCm as the base image
  • Install python3.8
  • Install dependencies for runtime and for making standalone binary
  • Call the make target which uses Nuitka to build the standalone binary

NOTE: Since RHEL 8 ships with glibc version 2.28, this standalone binary can only be run on environment with glibc version greater than 2.28. glibc version can be checked using ldd --version command.

NOTE: libnss3.so shared library is required when using --roof-only option which generates roofline data in PDF format

To test the standalone binary provide the --call-binary option to pytest.

How to Cite

This software can be cited using a Zenodo DOI reference. A BibTex style reference is provided below for convenience:

@software{xiaomin_lu_2022_7314631
  author       = {Xiaomin Lu and
                  Cole Ramos and
                  Fei Zheng and
                  Karl W. Schulz and
                  Jose Santos and
                  Keith Lowery and
                  Nicholas Curtis and
                  Cristian Di Pietrantonio},
  title        = {ROCm/rocprofiler-compute: v3.1.0 (12 February 2025)},
  month        = February,
  year         = 2025,
  publisher    = {Zenodo},
  version      = {v3.1.0},
  doi          = {10.5281/zenodo.7314631},
  url          = {https://doi.org/10.5281/zenodo.7314631}
}
S
Descrizione
No description provided
Readme 282 MiB
Languages
C++ 67.5%
C 20.6%
Python 6.6%
CMake 3.4%
Shell 0.6%
Altro 1.1%