SWDEV-79445 - OCL generic changes and code clean-up
- Make a fallback to system memory without device access if USWC allocation failed
- Destroy the backing store after the load of all kernels
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palkernel.cpp#65 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palkernel.hpp#24 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palprogram.cpp#69 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palprogram.hpp#24 edit
SWDEV-2 - Change OpenCL version number from 2732 to 2733.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#2480 edit
SWDEV-2 - Change OpenCL version number from 2731 to 2732.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#2479 edit
SWDEV-79445 - Remove use of irif.
This is now directly linked in the library build and will be
removed as a standalone library.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palprogram.cpp#67 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocprogram.cpp#83 edit
SWDEV-2 - Change OpenCL version number from 2730 to 2731.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#2478 edit
SWDEV-126897 - Use the new __amd_scheduler_rocm.
It's only a name change.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocschedcl.cpp#2 edit
SWDEV-2 - Change OpenCL version number from 2729 to 2730.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#2477 edit
SWDEV-2 - Change OpenCL version number from 2728 to 2729.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#2476 edit
SWDEV-143465 - HSAIL - Compiler Lib - Stop supporting SPIR binary on CI+ as well
This change plugs a hole when compiling from SPIR precompiled binary is possible on CI+ devices without option "-x spir" specified.
SPIR text has been already deprecated.
[Reviewers] Brian Sumner, Stanislav Mekhanoshin
[Testing] http://ocltc.amd.com:8111/viewModification.html?modId=107222&personal=true&tab=vcsModificationBuilds
Affected files ...
... //depot/stg/opencl/drivers/opencl/compiler/lib/backends/common/v0_8/if_acl.cpp#101 edit
SWDEV-2 - Change OpenCL version number from 2727 to 2728.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#2475 edit
SWDEV-79445 - OCL generic changes and code clean-up
- Use SDMA staging transfers for data upload if pinning fails. Fixes HIP failure in a test that uses the code segment data for uppload.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palblit.cpp#26 edit
SWDEV-2 - Change OpenCL version number from 2726 to 2727.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#2474 edit
SWDEV-2 - Change OpenCL version number from 2725 to 2726.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#2473 edit
SWDEV-2 - Change OpenCL version number from 2724 to 2725.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#2472 edit
SWDEV-160377 - [OCL][ROCM] Add OpenCL Runtime Support for GFX10 in ROCm - disable XNACK for Navi10 and Ariel
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocdefs.hpp#19 edit
SWDEV-2 - Change OpenCL version number from 2723 to 2724.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#2471 edit
SWDEV-2 - Change OpenCL version number from 2722 to 2723.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#2470 edit
SWDEV-159881 - [OCL][ROCm] Add SVM coarse-grain buffer support with device memory
1. Use the system memory pool for coarse grain allocations on APUs
ReviewBoardURL = http://ocltc.amd.com/reviews/r/15671/diff/
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocmemory.cpp#39 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocvirtual.cpp#64 edit
SWDEV-2 - Change OpenCL version number from 2721 to 2722.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#2469 edit
SWDEV-161959 - [ROCm QA][RAVEN] QCDGPU-S test is having ERROR -61: (clCreateKernel failed) on RAVEN
SWDEV-161983 - [ROCm QA][RAVEN] Cachebench test is failing with CL_INVALID_BUFFER_SIZE issue
SWDEV-161978 - [ROCm QA][RAVEN] PCIeBW is failing on -with error : 61, OpenCL error creating buffer !
SWDEV-161962 - [ROCm QA][RAVEN] rodinia->nw test has ERROR: clCreateBuffer input_item_set (size:67125249) => -61
- We make the global memory size at least 1G byte. This is to avoid issue/regression if sysconf API misbehaves.
ReviewBoardURL = http://ocltc.amd.com/reviews/r/15660/
Tests:
1. ocltst -m oclruntime.so -A oclruntime.exclude - PASS except SVM test (non regression)
2. TeamCity presubmission test (OpenCL) - PASS
3. Run test qcdgpu-s.sh : PASS
4. Run test cachebench-ocl : PASS
5. Run test PCIeBandwidth -c 0 -g 0 : PASS
6. Run test Rodinia/opencl/nw/run : PASS
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocdevice.cpp#98 edit
SWDEV-161959 - [ROCm QA][RAVEN] QCDGPU-S test is having ERROR -61: (clCreateKernel failed) on RAVEN
SWDEV-161983 - [ROCm QA][RAVEN] Cachebench test is failing with CL_INVALID_BUFFER_SIZE issue
SWDEV-161978 - [ROCm QA][RAVEN] PCIeBW is failing on -with error : 61, OpenCL error creating buffer !
SWDEV-161962 - [ROCm QA][RAVEN] rodinia->nw test has ERROR: clCreateBuffer input_item_set (size:67125249) => -61
- Adjust Global memory size as half of the system physical memory size in APU for Rocm
- Similar to current DGPU calculation, environment GPU_SINGLE_ALLOC_PERCENT can be used to adjust max memory allocation size.
ReviewBoardURL = http://ocltc.amd.com/reviews/r/15659/
Tests:
1. ocltst -m oclruntime.so -A oclruntime.exclude - PASS except SVM test (non regression)
2. TeamCity presubmission test (OpenCL) - PASS
http://ocltc.amd.com:8111/viewModification.html?modId=106628&personal=true&init=1&tab=vcsModificationBuilds
3. Run test qcdgpu-s.sh : PASS
4. Run test cachebench-ocl : PASS
5. Run test PCIeBandwidth -c 0 -g 0 : PASS
6. Run test Rodinia/opencl/nw/run : PASS
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocdevice.cpp#97 edit
SWDEV-2 - Change OpenCL version number from 2720 to 2721.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#2468 edit
SWDEV-2 - Change OpenCL version number from 2719 to 2720.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#2467 edit
SWDEV-160930 - Back out CL1594730 as it is failing OCL/LC and HIP tests
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palkernel.cpp#59 edit
SWDEV-2 - Change OpenCL version number from 2718 to 2719.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#2466 edit
SWDEV-160930 - SPECworkstation 3 benchmark GPU Compute tests fail
Root cause: Caffe compute benchmark fails within SPECWorkstation app because one of the Caffe's OCL kernel tries to launch a kernel with the local_work_size of 1024 causing the clEnqueueNDRangeKernel API to return CL_INVALID_WORK_GROUP_SIZE (i.e., the maximum allowable number is 256)
Proposed workaround: In order to run a kernel with a local_work_size of 1024, we check the number of used VGPRs in the Kernel and if the Kernel is not using all the available VGPRs we let the Kernel to use 1024 as the local_work_size.
ReviewURLBoard = http://ocltc.amd.com/reviews/r/15638/
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palkernel.cpp#58 edit
SWDEV-2 - Change OpenCL version number from 2717 to 2718.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#2465 edit
SWDEV-2 - Change OpenCL version number from 2716 to 2717.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#2464 edit
SWDEV-2 - Change OpenCL version number from 2715 to 2716.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#2463 edit
SWDEV-145570 - [HIP] Output Kernel name and mem arguments passed with LOG_LEVEL=3 for PAL and ROCm backends
ReviewBoardURL = http://ocltc.amd.com/reviews/r/15617/diff/
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palvirtual.cpp#120 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocvirtual.cpp#62 edit