fa772be675
## Overview and rationale This reverts https://github.com/ROCm/rocm-systems/pull/1886, which... * Re-applies https://github.com/ROCm/rocm-systems/pull/1866 * Reverts https://github.com/ROCm/rocm-systems/pull/1728 (So it restores the [`amdgpu-windows-interop/`](https://github.com/ROCm/rocm-systems/tree/develop/shared/amdgpu-windows-interop) folder back to the state from a few weeks ago) The rationale for this change is at https://github.com/ROCm/rocm-systems/pull/1866: > Last PAL update broke applications on gfx12 Windows. ## Cross-repository change details That PR failed to build but was merged with this explanation: > TheRock CI Windows build fails as expected with this revert. > > References to these PAL members need to be stripped out in a patch on TheRock. > > ``` > 11.3 C:\home\runner\_work\rocm-systems\rocm-systems\projects\clr\rocclr\device\pal\palubercapturemgr.cpp(152): error C2039: 'RegisterTraceStateChangeCallback': is not a member of 'GpuUtil::TraceSession' > 11.4 C:\home\runner\_work\rocm-systems\rocm-systems\shared\amdgpu-windows-interop\pal\inc\gpuUtil\palTraceSession.h(372): note: see declaration of 'GpuUtil::TraceSession' > 11.4 C:\home\runner\_work\rocm-systems\rocm-systems\projects\clr\rocclr\device\pal\palubercapturemgr.cpp(195): error C2039: 'UnregisterTraceStateChangeCallback': is not a member of 'GpuUtil::TraceSession' > 11.4 C:\home\runner\_work\rocm-systems\rocm-systems\shared\amdgpu-windows-interop\pal\inc\gpuUtil\palTraceSession.h(372): note: see declaration of 'GpuUtil::TraceSession' > ``` The patch in TheRock was updated in https://github.com/ROCm/TheRock/pull/2154. This rolls forward by updating the ref for TheRock. That original PR could have been sequenced differently to avoid a build break - perhaps by * Pointing to a branch in TheRock with the patch rebased * Deleting the patch in the workflows here but holding a local copy of the path to be applied in workflows * Landing the patch as a normal commit instead of carrying it at all ## Test plan 1. Watch TheRock CI here (https://github.com/ROCm/rocm-systems/actions/runs/19447202693/job/55644411119?pr=1893) 2. Build locally: ```bash # In rocm-systems git am --whitespace=nowarn D:\projects\TheRock\patches\amd-mainline\rocm-systems\0001-Revert-SWDEV-543498-Some-compute-Ubertrace-profiles-.patch git am --whitespace=nowarn D:\projects\TheRock\patches\amd-mainline\rocm-systems\0003-Use-is_versioned-true-consistently-in-both-Comgr-Loa.patch git am --whitespace=nowarn D:\projects\TheRock\patches\amd-mainline\rocm-systems\0006-Explicitly-load-libamdhip64.so.7.patch # Note: the build fails with the observed errors if patch 0001 is not applied! # In TheRock cmake -DCMAKE_BUILD_TYPE=Release \ -DCMAKE_C_COMPILER=cl.exe -DCMAKE_CXX_COMPILER=cl.exe \ -DCMAKE_C_COMPILER_LAUNCHER=ccache -DCMAKE_CXX_COMPILER_LAUNCHER=ccache \ -DPython3_EXECUTABLE=d:/projects/TheRock/.venv/Scripts/python \ -DTHEROCK_ROCM_SYSTEMS_SOURCE_DIR=d:/projects/TheRock/../rocm-systems \ # IMPORTANT -DTHEROCK_AMDGPU_FAMILIES=gfx110X-all \ -DBUILD_TESTING=ON \ -DTHEROCK_ENABLE_ALL=ON \ -Damd-llvm_BUILD_TYPE=RelWithDebInfo \ -S D:/projects/TheRock \ -B D:/projects/TheRock/build \ -G Ninja cmake --build D:/projects/TheRock/build --target hip-clr # [build] Build finished with exit code 0 cmake --build D:/projects/TheRock/build --target ocl-clr+dist # [build] Build finished with exit code 0 ```
132 regels
4.4 KiB
YAML
132 regels
4.4 KiB
YAML
name: TheRock CI Linux
|
|
|
|
on:
|
|
workflow_call:
|
|
inputs:
|
|
cmake_options:
|
|
type: string
|
|
project_to_test:
|
|
type: string
|
|
|
|
|
|
permissions:
|
|
contents: read
|
|
|
|
jobs:
|
|
therock-build-linux:
|
|
name: Build Linux Packages
|
|
runs-on: azure-linux-scale-rocm
|
|
permissions:
|
|
id-token: write
|
|
container:
|
|
image: ghcr.io/rocm/therock_build_manylinux_x86_64@sha256:044b113562629f4bd2ec5d2e64b32eee11562d48fb1a75d7493daec9dd8d8292
|
|
options: -v /runner/config:/home/awsconfig/
|
|
strategy:
|
|
fail-fast: true
|
|
env:
|
|
CACHE_DIR: ${{ github.workspace }}/.container-cache
|
|
CCACHE_CONFIGPATH: ${{ github.workspace }}/.ccache/ccache.conf
|
|
AMDGPU_FAMILIES: "gfx94X-dcgpu"
|
|
TEATIME_FORCE_INTERACTIVE: 0
|
|
AWS_SHARED_CREDENTIALS_FILE: /home/awsconfig/credentials.ini
|
|
steps:
|
|
- name: "Checking out repository for rocm-systems"
|
|
uses: actions/checkout@08c6903cd8c0fde910a37f88322edcfb5dd907a8 # v5.0.0
|
|
|
|
- name: Checkout TheRock repository
|
|
uses: actions/checkout@08c6903cd8c0fde910a37f88322edcfb5dd907a8 # v5.0.0
|
|
with:
|
|
repository: "ROCm/TheRock"
|
|
path: "TheRock"
|
|
ref: eb8f187ff47eb6af9cd5aaa0b8d9a04b06b12796 # 2025-11-15 commit
|
|
|
|
- name: Install python deps
|
|
run: |
|
|
pip install -r TheRock/requirements.txt
|
|
|
|
# safe.directory must be set before Runner health status
|
|
- name: Adjust git config
|
|
run: |
|
|
git config --global --add safe.directory $PWD
|
|
git config fetch.parallel 10
|
|
|
|
- name: Setup ccache
|
|
run: |
|
|
./TheRock/build_tools/setup_ccache.py \
|
|
--config-preset "github-oss-presubmit" \
|
|
--dir "$(dirname $CCACHE_CONFIGPATH)" \
|
|
--local-path "$CACHE_DIR/ccache"
|
|
|
|
- name: Runner health status
|
|
run: |
|
|
./TheRock/build_tools/health_status.py
|
|
|
|
- name: Fetch sources
|
|
timeout-minutes: 30
|
|
run: |
|
|
# Remove patches here if they cannot be applied cleanly, and they have not been deleted from TheRock repo
|
|
# rm ./TheRock/patches/amd-mainline/rocm-systems/*.patch
|
|
./TheRock/build_tools/fetch_sources.py --jobs 12 --no-include-rocm-systems --no-include-rocm-libraries --no-include-ml-frameworks
|
|
|
|
|
|
- name: Patch rocm-systems
|
|
run: |
|
|
git -c user.name="therockbot" -c "user.email=therockbot@amd.com" am --whitespace=nowarn ./TheRock/patches/amd-mainline/rocm-systems/*.patch
|
|
|
|
- name: Configure Projects
|
|
env:
|
|
amdgpu_families: ${{ env.AMDGPU_FAMILIES }}
|
|
package_version: ADHOCBUILD
|
|
extra_cmake_options: "-DTHEROCK_ROCM_SYSTEMS_SOURCE_DIR=../ ${{ inputs.cmake_options }}"
|
|
BUILD_DIR: build
|
|
run: |
|
|
python TheRock/build_tools/github_actions/build_configure.py
|
|
|
|
- name: Build therock-dist
|
|
run: cmake --build TheRock/build --target therock-dist
|
|
|
|
- name: Build therock-archives
|
|
run: cmake --build TheRock/build --target therock-archives
|
|
|
|
- name: Report
|
|
if: ${{ !cancelled() }}
|
|
run: |
|
|
echo "Full SDK du:"
|
|
echo "------------"
|
|
du -h -d 1 TheRock/build/dist/rocm
|
|
echo "Artifact Archives:"
|
|
echo "------------------"
|
|
ls -lh TheRock/build/artifacts/*.tar.xz
|
|
echo "Artifacts:"
|
|
echo "----------"
|
|
du -h -d 1 TheRock/build/artifacts
|
|
echo "CCache Stats:"
|
|
echo "-------------"
|
|
ccache -s
|
|
|
|
- name: Configure AWS Credentials
|
|
if: ${{ always() && !github.event.pull_request.head.repo.fork }}
|
|
uses: aws-actions/configure-aws-credentials@ececac1a45f3b08a01d2dd070d28d111c5fe6722 # v4.1.0
|
|
with:
|
|
aws-region: us-east-2
|
|
role-to-assume: arn:aws:iam::692859939525:role/therock-ci-external
|
|
|
|
- name: Post Build Upload
|
|
if: always()
|
|
run: |
|
|
python TheRock/build_tools/github_actions/post_build_upload.py \
|
|
--run-id ${{ github.run_id }} \
|
|
--artifact-group ${{ env.AMDGPU_FAMILIES }} \
|
|
--build-dir TheRock/build \
|
|
--upload
|
|
|
|
therock-test-linux:
|
|
name: "Test"
|
|
needs: [therock-build-linux]
|
|
uses: ./.github/workflows/therock-test-packages.yml
|
|
with:
|
|
project_to_test: ${{ inputs.project_to_test }}
|
|
amdgpu_families: "gfx94X-dcgpu"
|
|
test_runs_on: "linux-mi325-1gpu-ossci-rocm"
|
|
platform: "linux"
|