커밋 그래프

163 커밋

작성자 SHA1 메시지 날짜
David Galiffi 1e501dd89a Free runner disk space (#1693)
Motivation:
Basic runners are frequently running out of space

Technical Details:
Running autoclean after package installations.
Use the jlumbroso/free-disk-space action.
2025-11-04 17:31:23 -05:00
Joseph Macaranas b19cf0aadf Revert "Disable therock summary check, make it always positive (#1675)" (#1686)
This reverts commit 0c32b90130.
2025-11-04 14:17:37 -05:00
Danylo Lytovchenko 0c32b90130 Disable therock summary check, make it always positive (#1675) 2025-11-04 12:58:53 +01:00
MachineTom fb006546d0 SWDEV-1 - Fix a typo (#1615)
* SWDEV-1 - Fix a typo

Fix a typo.
Remove unnecessary log.

* Removing patch

---------

Co-authored-by: geomin12 <geomin12@amd.com>
Co-authored-by: Scott Todd <scott.todd0@gmail.com>
2025-11-03 12:59:00 -08:00
Ammar ELWazir da297d46e8 [ROCProfiler-sdk] [Docs CI] Refactor Git setup and CMake commands in workflow (#1662) 2025-11-03 12:12:35 -06:00
Ammar ELWazir 9fa1d1b97e [ROCProfiler-SDK] Remove 'gfx900' and 'gfx940' from GPU targets (#1661)
* [ROCProfiler-SDK] Remove 'gfx900' and 'gfx940' from GPU targets

* Remove unsupported GPU targets from workflow

* Remove gfx900 and gfx940 from GPU targets
2025-11-03 11:09:29 -05:00
sluzynsk-amd 9f940c7265 Add missing API calls to rocprofiler (#1599)
Signed-off-by: Sebastian Luzynski <Sebastian.Luzynski@amd.com>
2025-11-03 09:40:16 -06:00
Ammar ELWazir fee5bd9a4e Fixing ROCProfiler Register CI & ROCProfiler-SDK Docs CI (#1570)
---------

Co-authored-by: bgopesh <gopesh.bhardwaj@amd.com>
2025-11-03 09:24:32 -06:00
Geo Min 8e98b80deb [TheRock CI] Fixing patches for rocm-systems (#1460)
* Fixing patches for rocm-systems

* Adding all

* Adding remaining projects

* Submodule bump

* adding compiler

* adding test commit hash

* Adding artifact group

* adding update for artifact group

* Adding new commit hash
2025-10-28 19:47:17 -07:00
Venkateshwar Reddy Kandula c5bd693478 [rocprofiler-sdk] Disable HIP/CLR build in rocprofiler-sdk CI jobs (#1574)
* disable HIP/CLR build

* misc. fix
2025-10-28 11:42:11 -05:00
Milan Radosavljevic 8806be162c Change how cache manager handles child process trace cache for rocpd (#1033)
* Change how cache manager handles child process trace cache

* Sampling and backtrace metrics to cache

* Apply cmake formatting

* Fix parsing of metadata json

* Code clean up

* Fix build nlohmann json from source

* Fix storage parsed finished callback

* Revert sampling for child process

* Change cache file name generating

* Fix thread start stop

* Fix process start end timestamp

* Applied suggestions from code review

* Try with late start of flushing task thread

* Change dockerfiles for ci

* Revert changes on github workflows

* Remove json_fwd.hpp include

* fix dump

* Build nlohmann/json by default

Signed-off-by: David Galiffi <David.Galiffi@amd.com>

* Update location of build artifacts for nlohmann/json

Signed-off-by: David Galiffi <David.Galiffi@amd.com>

* Revert use_output_suffix

* Remove unused logs

* Fix cache store inside counter due to structure change

* Remove decode tests from debian ci

* Fix issue where all databases have the same UUID (#1499)

Co-authored-by: Aleksandar Djordjevic <adjordje@amd.com>

* Removing the cpack and install steps to save space

* Revert "Remove decode tests from debian ci"

This reverts commit ddabf6dd142dcf438e6b8997b8abe86f2c868468.

* Revert "Removing the cpack and install steps to save space"

This reverts commit 973da3a1ba99d99d529af5269d30e177092f9bfa.

* Add prepare-runner job as dependency to clean up the space

* Fix formatting

* Free up even more space

* Remove verbose for workflows

* remove hw_counters from ext_data

* move space clean up inside container

* try to remove external folder to free up space

* Check space

* Refactor Cleanup to it's own step

---------

Signed-off-by: David Galiffi <David.Galiffi@amd.com>
Co-authored-by: David Galiffi <David.Galiffi@amd.com>
Co-authored-by: Aleksandar Djordjevic <aleksandar.djordjevic@amd.com>
Co-authored-by: Aleksandar Djordjevic <adjordje@amd.com>
2025-10-24 11:47:15 -04:00
amd-hsivasun 43687b24f8 [Github Actions] Added monorepo_source_of_truth flag (#1525) 2025-10-23 16:37:12 -04:00
Venkateshwar Reddy Kandula 8c89ed8ab1 [rocprofiler-sdk][CI] Use rock infra for rocprofiler-sdk build docs jobs (#1518)
* Initial changes to move build docs job to rock infra

* misc. fix

* clean up code.
2025-10-23 11:17:13 -05:00
Venkateshwar Reddy Kandula 4f590499c6 [rocprofiler-sdk] Fix rocm-release compatibility latest (#1479)
* Update rocprofiler-sdk-rocm_release_compatibility.yml

* apply Copilot

* addr comments

* remove 6.2 requirements. 6.2 now can use normal Install requirements step
2025-10-21 21:45:18 -05:00
Mythreya Kuricheti 65d4ff9d04 [CI][rocprofiler-compute] Fix rhel python deps (#1370)
Install `python39-devel` dependency for pandas. Fixes build on RHEL 8.10.
2025-10-21 08:28:57 -07:00
Fei Zheng 2c59a82fe1 Fix rocprof-compute TUI build err with python 39 (#303)
* Upgrade min python version from 3.8 to 3.9

* Set min version for textual-fspicker for TUI support

* Update workflows to use python 3.9 instead of 3.8

* fix formatting

* fix bug

---------

Co-authored-by: Vignesh Edithal <Vignesh.Edithal@amd.com>
2025-10-21 00:27:35 -04:00
David Galiffi 32f9fa6ca5 Enable some simple ROCpd testing (#834)
* Add for rocpd testing and output validation

Add for transpose, video-decode, jpeg-decode, roctx, and openmp-target
Add JSON check to pre-commit-config

Co-authored-by: Marjan Antic <Marjan.Antic@amd.com>

* Remove redundant environment variable

* Fix spelling typo

* Fix typo in error message

* Fix memory_allocation query

* Incorperate feedback from review. Handle case where there are multiple matching "name_prefix" tables.

* Fix environment settings in `rocprof-sys-testing.cmake`

Accidently removed in previous refactoring.

* Formatting python file

---------

Co-authored-by: Marjan Antic <Marjan.Antic@amd.com>
2025-10-20 17:40:10 -04:00
alexxu-amd 55baf27627 [CI] Copy over docs update workflow from rocm-libraries (#1400)
* migrate docs update workflow from rocm-libraries

* add test branch to the trigger condition

* modify docs to test workflow

* temporarily rename project folder name to match the test project

* add more content for testing

* test successful, restore test modifications
2025-10-17 13:47:28 -04:00
Jason Bonnell 9664f1dc91 [rocprofiler-systems] Add retries to RHEL install steps (#1384)
* Add GHCR retry logic

* Add retries to Install ROCm Packages step in rocprofiler-systems-redhat.yml

* Update containers-ci.yml file to use latest RHEL9/10 releases

* Use build-docker-ci script in rocprofiler-systems-containers

* Remove working-directory from step in rocprofiler-systems-redhat.yml

* Remove shell bash from Install ROCm Packages step

* Revert RHEL version change in rocprofiler-systems-redhat.yml
2025-10-17 10:20:54 -04:00
Venkateshwar Reddy Kandula 9404178ea5 [rocprofiler-sdk][CI] rhel sles workflow fix (#1373)
* bug fix.

* add backslash

* add export for path, bug
2025-10-15 11:48:59 -05:00
Mythreya Kuricheti ac8adbacff [CI][rocprofiler-sdk] Fix codeql jobs (#1366) 2025-10-15 10:34:29 -05:00
Mythreya Kuricheti 765d9026c7 [CI][rocprofiler-sdk] Workflow improvements (#1341) 2025-10-14 15:21:55 -05:00
Mythreya Kuricheti fd82a185c2 [CI][rocprofiler-sdk] Add HIP build to CI (#1311) 2025-10-08 21:37:42 -05:00
Geo Min 388edb1b57 [TheRock CI] Adding profiler builds (#1301)
* Adding profiler for TheRock CI

* adding temp test for rocproiler

* Removing subtrees

* PR comment
2025-10-08 14:38:38 -07:00
Jason Bonnell cccc350dc6 [rocprofiler-systems] Add different test coverage for CI/Nightly, add better logging for failures (#1272)
* Try outputting LastTest.log

* Update if condition for outputting log

* Another attempt

* Only run Ubuntu Noble on MI355 in push/PR

* Try exclude matrix

* Move conditional statement in matrix exclusion

* Create ci-matrix.yml file

* Add needs parameter to ubuntu job

* Fix typo in matrix output variable

* Add back pull_request_template.md

* Add back pull_request_template.md
2025-10-08 15:18:56 -04:00
David Galiffi d6bdc53f1a Update rocprofiler-systems-continuous-integration.yml (#1271)
Disabling network test from CI while we investigate it's instability.
2025-10-07 18:55:30 -04:00
Venkateshwar Reddy Kandula 952d1dabe2 [ROCProfiler-SDK][ROCR] HSA New API changes for HSA_AMD_EXT_API_TABLE_STEP_VERSION 8 (#1182)
* add new hsa ext api for version 8.

* use fmt instead of ostream.

* override rccl from therock

* Update rocprofiler-sdk-continuous_integration.yml

* Update rocprofiler-sdk-continuous_integration.yml

* Update rocprofiler-sdk-continuous_integration.yml

* enable rocr-build

* format

* disable att consecutive-kernels tests.

* Enable ROCR build in code coverage workflow

---------

Co-authored-by: Venkateshwar Reddy Kandula <venkateshwar.kandula1306@gmail.com>
2025-10-06 13:09:39 -05:00
Jason Bonnell ad78611674 rocprofiler-systems Nightly and CI on Ubuntu Jammy/Noble on MI355 and MI325 (#997)
* Initial steps added for rocprofiler-systems-continuous-integration.yml

* Add new line to end of rocprofiler-systems-continuous-integration.yml

* Fix matrix issue in rocprofiler-systems CI workflow

* Update runner to use mi355

* Remove sudo from ROCm download step

* Add Python venv

* Try to install python venv

* Add -y to pip venv install commands

* Add shell: bash to download ROCm step

* Fix issue in if statement

* Fix typo in mv command

* Fix mv command

* Update paths

* add directory in install step

* Use default runner for now while debugging setup

* Add set -e to steps

* debug build step

* Add amdgpu install step

* remove working-directory from amdgpu install step

* add path/ld lib path, add -S argument to run-ci.py

* Fix typo in DCMAKE_PREFIX_PATH

* Add DGPU_TARGETS to run-ci.py command

* add Docker options, remove GPU_TARGETS

* Install amd-smi-lib

* Add DCMAKE_BUILD_TYPE, update path

* Remove mkdir

* Add build dynist cmake arguments

* Update cmake arguments again

* Add missing \ to run-ci.py command

* add libdw dependency

* Add later install step

* Increase timeout of configure/build/test step

* use 16 jobs to try and speed up pipeline time

* Add GHCR image, remove TheRock tarball download step, minor changes for debugging

* Add credentials to container portion of step

* Add package read permissions to ubuntu step

* Update tarball name

* Increase jobs to 16, disable some tests for now due to timeouts

* Modify to only include gpu tests

* Fix configuration

* Enable MPI on run-ci.py run

* Add install MPI step, changed tests to be run

* Enable OMPI flags, enable network counter access

* Use new Docker image names, add privileged option to Docker

* Change cmake build type

* Add fail-fast false option for CI

* Update ROCM_VERSION variable to reflect docker changes

* Specify TARBALL_ROCM_VERSION as separate

* Add MI325 to debug pipeline errors

* Move location of env variables

* Only test on jammy for now, run all tests to assess other issues

* test with branch that contains fix for openmp

* Exclude "ompvv"

We will re-add one ticket is fixed.

* Test: Disable USE_MPI

* Replace TheRock ROCm install with rocm-dev for now

* Try out MI355 noble and MI325 for jammy/noble

* Update amdgpu step to support different ROCm versions

* Remove unused env variables

---------

Co-authored-by: David Galiffi <David.Galiffi@amd.com>
2025-10-06 11:40:58 -04:00
Jason Bonnell f0fd2797b6 Add rocm-version 7.0 to rocprofiler-systems workflows (#1139)
* Adding rocm 7.0 to Ubuntu, Red Hat, and Debian workflows

---------

Co-authored-by: David Galiffi <David.Galiffi@amd.com>
2025-10-03 13:16:21 -04:00
Geo Min 36a1fd87af Removing landed patch (#1184) 2025-09-30 16:51:41 -07:00
Geo Min b0a9a2386f [ci] Adding TheRock CI coverage for rocm-core (#868)
* TheRock CI points to rocm systems

* Fixing depth

* Fixing cache path

* Adding core components

* Adding more packages

* try this for windows building

* Add math libs

* Adding core only

* Attempt with no ccache

* adding patching

* Adding ls test

* adding this

* removing ls test

* changing dir name

* Adding cleanup for patch

* Adding ref

* adding correct no include

* Adding new temp branch for testing

* empty commit

* empty commit

* Adding commit hash bump

* Adding new hash for removed patches

* Adding TheRock submodule bump

* trying with compiler removed test

* Try dvc pull windows

* Update .github/workflows/therock-ci-linux.yml

Co-authored-by: Marius Brehler <marius.brehler@gmail.com>

* Adding correct env

* revert to ../

* Adding path

* try new var

* Adding new branch

* Adding correct hash

* Update .github/workflows/therock-ci-linux.yml

Co-authored-by: Marius Brehler <marius.brehler@gmail.com>

* Update .github/workflows/therock-ci-windows.yml

Co-authored-by: Marius Brehler <marius.brehler@gmail.com>

---------

Co-authored-by: Marius Brehler <marius.brehler@gmail.com>
2025-09-30 16:08:50 -07:00
Jason Bonnell 953fd60e9b rocprofiler GHCR Rename (#1112)
- Rename the GHCR packages for rocprofiler Docker images to reduce the number of packages that will be released on the repository
- Changed package name to only include the OS instead of OS+Version - version moved to the tag instead.
- Updated Dockerfile.*.ci files to specify target ROCm version from tarball in name.
2025-09-30 15:15:12 -04:00
Jason Bonnell cec7ce77d6 Add sudo apt-get update command to workflow (#1177)
- 404 Not Found errors when trying to download dependencies in the Get the latest therock build step. Adding `sudo apt-get update` command first to avoid this.
- Added `sudo apt-get update` to the rocprofiler-sdk-build-ci-docker-images.yml workflow.
2025-09-30 14:09:36 -04:00
Laura Promberger fb3677cad6 fetch_sources: replace flags that the newer version recognizes (#1148)
* fetch_sources: replace flags that the newer version recognizes

* fetch_sources: remove --no-include-rocm-libraries
2025-09-26 11:36:28 -07:00
amd-hsivasun c16b06a7d7 [Ex CI] Enable aqlprofile (#1002)
* [Ex CI] Enable aqlprofile

* [Ex CI] Added PipelineID for aqlprofile
2025-09-26 14:00:41 -04:00
amd-hsivasun b4f4d24dda [Ex CI] Enable rdc (#941)
* [Ex CI] Enable rdc

* Reorder pipelineID list
2025-09-26 11:53:03 -04:00
Laura Promberger 3fd56bfefc Update TheRock CI to use health status script (#1130)
This will update the CI to match TheRocks CI by introducing to use a
python script to report on health status. Commands that were in the
section that modified the status were moved to separate sections
(ccache, git config, ..).

Related PR:
https://github.com/ROCm/TheRock/pull/1516
2025-09-26 10:09:31 +02:00
Jason Bonnell 8b52d71cc7 rocprofiler-systems - add gfx containers to ghcr (#883)
* Initial skeleton code for rocprofiler-systems-continuous-integration.yml

* Add python3-devel to opensuse and rhel ci images

* Update rocprofiler-systems-containers.yml to include TheRock tarballs

* Update pip install command for Dockerfile.ubuntu.ci

* Fix pip install again for Dockerfile.ubuntu.ci

* Remove skeleton workflow for CI

* Add new ci-gfx containers for TheRock installs

* Add set -e and pipefail to ci Dockerfiles to detect errors

* Upgrade pip in Dockerfile.ubuntu.ci

* revert pipefail set -e change

* Replace build-docker-ci.sh script with Docker step for ci-base

* Add support for gfx950, add containers-ci-gfx.yml

* Add working-directory to matrix setup steps

* Try changing containers-ci-gfx.yml

* make more changes to containers-ci-gfx.yml

* Remove build-docker-ci.sh script from gfx step, fix typo in Dockerfile

* Remove gfx110X and gfx120X for now

* Update ci-gfx docker workflow to use ghcr.io

* Temporary change to test one image

* Enable push to test out ghcr package

* Add labels to debug oauth issue

* add pacakages permissions to step

* add rocprofiler-systems-ghcr.yml workflow

* Remove cache from Docker push action step

* Add prefix to tag

* Add back gfx94X and gfx950 support, add back no push on PR

* Remove gfx container creation from rocprofiler-systems-containers.yml

* Add a gfx950 image for now

* Revert change
2025-09-22 16:58:55 -04:00
Jason Bonnell 9d90286371 rocprofiler-sdk CI workflow improvements (#956)
Update rocprofiler-sdk and aqlprofile CI workflows to improve readability
2025-09-22 16:47:16 -04:00
Venkateshwar Reddy Kandula a4effb81a9 [rocprofiler-sdk][CI] install libva-amdgpu-dev in requirements CodeQL Job (#1038)
* install libva-amdgpu.

* Add rocprofiler-sdk-codeql.yml to paths

* Update rocprofiler-sdk-codeql.yml

* update requirements for rocm_release_compatibility job.

* address comments.

---------

Co-authored-by: Venkateshwar Reddy Kandula <venkateshwar.kandula1306@gmail.com>
Co-authored-by: jbonnell-amd <jason.bonnell@amd.com>
2025-09-22 12:17:03 -05:00
Venkateshwar Reddy Kandula d16e7adf13 [rocprofiler-sdk][CI] Nightly build testing for rocprofiler-sdk (#949)
* Implement nightly tests mode

* Update run-ci.py
2025-09-19 14:32:11 -05:00
Jason Bonnell eebf5ead8c Replace cmake-format with gersemi in rocprofiler-compute-formatting.yml (#1053)
* Replace cmake-format with gersemi in rocprofiler-compute-formatting.yml

* Run gersemi formatting on CMakeLists.txt files

* Remove .cmake-format.yaml, add .gersemirc file

* Add more options to .gersemirc

* Add new line to .gersemirc

* Add new line to CMakeLists.txt

* Run gersemi again with new options
2025-09-19 08:42:40 -04:00
JC 0cf70e85cf Always run post-job cleanup step if test fail (#1049)
coincides with https://github.com/ROCm/TheRock/pull/1521
2025-09-18 12:53:14 -04:00
JC b2e611a874 [CI] Add pre/post cleanup for windows GPU test jobs (TheRock PR#1361) (#1008) 2025-09-16 12:50:14 -04:00
JC 89f9ab1270 [CI] Add 30 minute timeout to Fetch sources and use 12 jobs for Windows (#1001) 2025-09-16 12:48:52 -04:00
Danylo Lytovchenko c0e7091b9f Fix syntax error in azure subtree script (#988) 2025-09-15 11:22:03 -04:00
amd-hsivasun 2b68ac750e Add rocprofiler-systems project dependency (#915) 2025-09-12 12:16:54 -04:00
Kian Cossettini 5d582fcd37 [rocprofiler-systems] Add Fortran OpenMP CTests (#874)
* Added Fortran (amdflang) openmp tests using the openmp-vv project

---------

Signed-off-by: David Galiffi <David.Galiffi@amd.com>
Co-authored-by: David Galiffi <David.Galiffi@amd.com>
2025-09-12 09:52:16 -04:00
amd-hsivasun 892a56cb54 [Ex CI] Enable hip-tests (#957)
* [Ex CI] Enable hip-tests

* Add Pipeline Id

* Fixed typo

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

---------

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>
2025-09-11 16:40:07 -04:00
amd-hsivasun 6b923ee1ac [Ex CI] Enable rocr-runtime (#925) 2025-09-11 16:25:09 -04:00