7 Коммитов

Автор SHA1 Сообщение Дата
dsclear-amd d5f490fa2f Sets heavy GitHub CI workflows to not trigger on text documentation-only changes. (#2417)
Sets heavy GitHub CI workflows to not trigger on docs-only changes.

Specifically, sets azure-ci-dispatcher.yml and therock-ci.yml, as well as many rocprofiler workflows, to not trigger when the change consists entirely of docs-only files.
2026-01-12 18:31:30 -05:00
Jason Bonnell 1d5a6e9bfe Update rocprofiler workflows to use new mi325 runner names (#2467)
* Update rocprofiler workflows to use new runner naming for mi325

* Add input options to workflow_dispatch for rocprofiler-systems CI workflow

* Update runner name on therock-ci-linux.yml as well
2026-01-05 15:41:01 -05:00
David Galiffi fbaeb74107 [rocprof-sys] Update nightly CI workflow (#2263)
Update ROCm version to 7.1.0

Signed-off-by: David Galiffi <David.Galiffi@amd.com>
2025-12-11 12:23:26 -05:00
Milan Radosavljevic d9b00da102 Add clean up of buffered_storage files (#1738)
* Add clean up of buffered_storage files

* Add step to workflows to test for remaining temp files after tests

* Applied suggestions from code review

* add deletion of all cache files

---------

Co-authored-by: David Galiffi <David.Galiffi@amd.com>
2025-11-07 11:51:09 -05:00
Jason Bonnell cccc350dc6 [rocprofiler-systems] Add different test coverage for CI/Nightly, add better logging for failures (#1272)
* Try outputting LastTest.log

* Update if condition for outputting log

* Another attempt

* Only run Ubuntu Noble on MI355 in push/PR

* Try exclude matrix

* Move conditional statement in matrix exclusion

* Create ci-matrix.yml file

* Add needs parameter to ubuntu job

* Fix typo in matrix output variable

* Add back pull_request_template.md

* Add back pull_request_template.md
2025-10-08 15:18:56 -04:00
David Galiffi d6bdc53f1a Update rocprofiler-systems-continuous-integration.yml (#1271)
Disabling network test from CI while we investigate it's instability.
2025-10-07 18:55:30 -04:00
Jason Bonnell ad78611674 rocprofiler-systems Nightly and CI on Ubuntu Jammy/Noble on MI355 and MI325 (#997)
* Initial steps added for rocprofiler-systems-continuous-integration.yml

* Add new line to end of rocprofiler-systems-continuous-integration.yml

* Fix matrix issue in rocprofiler-systems CI workflow

* Update runner to use mi355

* Remove sudo from ROCm download step

* Add Python venv

* Try to install python venv

* Add -y to pip venv install commands

* Add shell: bash to download ROCm step

* Fix issue in if statement

* Fix typo in mv command

* Fix mv command

* Update paths

* add directory in install step

* Use default runner for now while debugging setup

* Add set -e to steps

* debug build step

* Add amdgpu install step

* remove working-directory from amdgpu install step

* add path/ld lib path, add -S argument to run-ci.py

* Fix typo in DCMAKE_PREFIX_PATH

* Add DGPU_TARGETS to run-ci.py command

* add Docker options, remove GPU_TARGETS

* Install amd-smi-lib

* Add DCMAKE_BUILD_TYPE, update path

* Remove mkdir

* Add build dynist cmake arguments

* Update cmake arguments again

* Add missing \ to run-ci.py command

* add libdw dependency

* Add later install step

* Increase timeout of configure/build/test step

* use 16 jobs to try and speed up pipeline time

* Add GHCR image, remove TheRock tarball download step, minor changes for debugging

* Add credentials to container portion of step

* Add package read permissions to ubuntu step

* Update tarball name

* Increase jobs to 16, disable some tests for now due to timeouts

* Modify to only include gpu tests

* Fix configuration

* Enable MPI on run-ci.py run

* Add install MPI step, changed tests to be run

* Enable OMPI flags, enable network counter access

* Use new Docker image names, add privileged option to Docker

* Change cmake build type

* Add fail-fast false option for CI

* Update ROCM_VERSION variable to reflect docker changes

* Specify TARBALL_ROCM_VERSION as separate

* Add MI325 to debug pipeline errors

* Move location of env variables

* Only test on jammy for now, run all tests to assess other issues

* test with branch that contains fix for openmp

* Exclude "ompvv"

We will re-add one ticket is fixed.

* Test: Disable USE_MPI

* Replace TheRock ROCm install with rocm-dev for now

* Try out MI355 noble and MI325 for jammy/noble

* Update amdgpu step to support different ROCm versions

* Remove unused env variables

---------

Co-authored-by: David Galiffi <David.Galiffi@amd.com>
2025-10-06 11:40:58 -04:00