SWDEV-2 - Change OpenCL version number from 2617 to 2618.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#2365 edit
[ROCm/clr commit: 0caa7e4cc1]
SWDEV-79445 - OCL generic changes and code clean-up
- Add explicit type specialization for AQL copy types to avoid memcpy() calls in the most of cases
- Increase size of sysmem copy for constants buffers to account the internal arguments
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palconstbuf.cpp#5 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palkernel.cpp#46 edit
[ROCm/clr commit: 62a00d336e]
SWDEV-134107 - Add support for respecting target's xnack setting
- Port Konstantin's change of "Bring naming o par with the spec" in git (Change 139283)
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocdevice.cpp#85 edit
[ROCm/clr commit: 6002477091]
SWDEV-2 - Change OpenCL version number from 2616 to 2617.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#2364 edit
[ROCm/clr commit: 4a60726fc4]
SWDEV-79445 - OCL generic changes and code clean-up
- Prepare the changes to remove resource rename feature. Resource rename can substitute a low level memory object in the current resource with a different one in order to avoid GPU stalls on frequent CPU updates, like constant buffers or staging buffers. Renaming was necessary due to CAL and old HW limitations to manage suballocations and has some inefficiency with tiny updates and staging buffers.
- This change removes renames usage from the constant buffer management
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palconstbuf.cpp#3 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palconstbuf.hpp#4 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palkernel.cpp#45 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palvirtual.cpp#78 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palvirtual.hpp#43 edit
[ROCm/clr commit: db04b3b295]
SWDEV-2 - Change OpenCL version number from 2615 to 2616.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#2363 edit
[ROCm/clr commit: 41404377b1]
SWDEV-2 - Change OpenCL version number from 2614 to 2615.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#2362 edit
[ROCm/clr commit: 2c0282ae12]
SWDEV-79445 - OCL generic changes and code clean-up
- Fix a regression in the AMF test and reenable the suballoc optimization. Rearrange the locks around cache field access only to avoid calling memory release under the cache lock.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palprogram.cpp#57 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palresource.cpp#53 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palresource.hpp#18 edit
... //depot/stg/opencl/drivers/opencl/runtime/utils/flags.hpp#287 edit
[ROCm/clr commit: 1f53841421]
SWDEV-2 - Change OpenCL version number from 2613 to 2614.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#2361 edit
[ROCm/clr commit: 312057dbe2]
SWDEV-133818 - PAL support for Linux Pro: Coarse Grain SVM for OpenCL 2.0
1. This change enables OCL 2.0 on Linux for devices using PAL backend.
2. Set the alignment for Coarse Grain SVM allocations to be the gpu fragment size (2MB on Linux).
ReviewBoardURL = http://ocltc.amd.com/reviews/r/14437/diff/
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palresource.cpp#52 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palsettings.cpp#46 edit
... //depot/stg/opencl/drivers/opencl/runtime/platform/memory.cpp#128 edit
[ROCm/clr commit: 3683b3d03c]
SWDEV-2 - Change OpenCL version number from 2612 to 2613.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#2360 edit
[ROCm/clr commit: 320062f843]
SWDEV-2 - Change OpenCL version number from 2611 to 2612.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#2359 edit
[ROCm/clr commit: e74383ba6e]
SWDEV-2 - Change OpenCL version number from 2610 to 2611.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#2358 edit
[ROCm/clr commit: c4a81872f3]
SWDEV-2 - Change OpenCL version number from 2609 to 2610.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#2357 edit
[ROCm/clr commit: 4b9a657cb6]
SWDEV-2 - Change OpenCL version number from 2608 to 2609.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#2356 edit
[ROCm/clr commit: 1376dc8503]
SWDEV-2 - Change OpenCL version number from 2607 to 2608.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#2355 edit
[ROCm/clr commit: 9e5f869e1c]
SWDEV-132899 - [gfx10][OCL]- Adding support for forcing WaveSize32 from runtime for testing on gfx10 HW emulator
Motivation: During testing ocltst on Windows on PAL/HSAIL/SC path on gfx10 HW emulator, it was found that SC uses WaveSize64 by default for compute kernels.
SC also has an interface that can be used for forcing the WaveSize to 32 or 64.
- Adding the "-force-wave-size-32" into compiler to be passed down to Finalizer/SC
- Adding environment variable "GPU_FORCE_WAVE_SIZE_32" that can be used from runtime to force WaveSize32 compilation in HSAIL/SC path
ReviewBoardURL = http://ocltc.amd.com/reviews/r/14364/
Affected files ...
... //depot/stg/opencl/drivers/opencl/compiler/lib/backends/gpu/hsail_be.cpp#69 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/utils/OPTIONS.def#138 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palprogram.cpp#55 edit
... //depot/stg/opencl/drivers/opencl/runtime/utils/flags.hpp#284 edit
[ROCm/clr commit: 3c0e80074b]
SWDEV-2 - Change OpenCL version number from 2606 to 2607.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#2354 edit
[ROCm/clr commit: 5bec004355]
SWDEV-147487 - DX9/DX11 texture and OpenCL interop for YUY2
- Enable YUY2 support for DX11 and DX9. YUY2 contains just one plane of interleaved Y0UY1V components and can be mapped to (CL_RGBA, CL_UNSIGNED_INT8) with image width reduced by 2. YUY2 provides better quality due to 16bit data per pixel
Affected files ...
... //depot/stg/opencl/drivers/opencl/api/opencl/amdocl/cl_d3d11.cpp#23 edit
... //depot/stg/opencl/drivers/opencl/api/opencl/amdocl/cl_d3d9.cpp#33 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuresource.cpp#241 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palresource.cpp#49 edit
[ROCm/clr commit: fd3644ee58]
SWDEV-2 - Change OpenCL version number from 2605 to 2606.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#2353 edit
[ROCm/clr commit: 85ae79b3e4]
SWDEV-2 - Change OpenCL version number from 2604 to 2605.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#2352 edit
[ROCm/clr commit: 936cfbb4b9]
SWDEV-143465 - AMDIL - Compiler Lib - Mark as unsupported all CI+ targets
1. Mark all CI+ targets as unsupported for AMDIL target mappings.
2. Let aclGetTargetInfo, aclGetTargetInfoFromChipID, aclGetDeviceInfo to report unsupported chips.
3. Fix aclReadFromMem in order to early determination of the target of binary in memory and reporting ACL_INVALID_TARGET if target is invalid, or ACL_UNSUPPORTED if chip is unsupported.
4. Add utility inline functions IsValidTarget, IsChipSupported, make getTargetMapping inline too, as it is used in the first two.
5. Fix affected tests.
[Testing] http://ocltc.amd.com:8111/viewModification.html?modId=98724&personal=true&init=1&tab=vcsModificationBuilds (clean)
[Reviewed] by Stas (http://ocltc.amd.com/reviews/r/14277)
Affected files ...
... //depot/stg/opencl/drivers/opencl/compiler/legacy-lib/api/v0_8/acl.cpp#3 edit
... //depot/stg/opencl/drivers/opencl/compiler/legacy-lib/utils/v0_8/libUtils.cpp#9 edit
... //depot/stg/opencl/drivers/opencl/compiler/legacy-lib/utils/v0_8/libUtils.h#4 edit
... //depot/stg/opencl/drivers/opencl/compiler/legacy-lib/utils/v0_8/target_mappings.h#22 edit
... //depot/stg/opencl/drivers/opencl/compiler/legacy-lib/utils/v0_8/target_mappings_amdil.h#23 edit
... //depot/stg/opencl/drivers/opencl/compiler/legacy-lib/utils/v0_8/target_mappings_amdil64.h#22 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/utils/v0_8/target_mappings_amdil.h#49 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/utils/v0_8/target_mappings_amdil64.h#45 edit
... //depot/stg/opencl/drivers/opencl/tests/ocltst/module/binary/BIF2xTo3x.cpp#3 edit
... //depot/stg/opencl/drivers/opencl/tests/ocltst/module/binary/BIFBase.cpp#5 edit
... //depot/stg/opencl/drivers/opencl/tests/ocltst/module/binary/BIFSymbol.cpp#14 edit
... //depot/stg/opencl/drivers/opencl/tests/ocltst/module/complib/aclAPI.cpp#22 edit
[ROCm/clr commit: 354d0df8bd]
SWDEV-2 - Change OpenCL version number from 2603 to 2604.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#2351 edit
[ROCm/clr commit: c9b33609c5]
SWDEV-2 - Change OpenCL version number from 2602 to 2603.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#2350 edit
[ROCm/clr commit: 78450aa252]
SWDEV-2 - Change OpenCL version number from 2601 to 2602.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#2349 edit
[ROCm/clr commit: a4b65e6c2c]
SWDEV-145716 - dGMA via OpenCL not working properly on AMD EPYC Ethanol platform with two WX9100/W9100s
- Add a stall and cache flushes before a barrier write in GSL and PAL
- Place temporary workarounds for CP prefetch of WRITE_DATA and the lack of SDMA marker in PAL
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuvirtual.cpp#414 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palvirtual.cpp#76 edit
[ROCm/clr commit: 30eacbe243]
SWDEV-2 - Change OpenCL version number from 2600 to 2601.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#2348 edit
[ROCm/clr commit: 8e57fbbf10]
SWDEV-129762 - Remove following macros that are passed through the command line to the lightning compiler:
-D__AMD__=1
-D__devicename=1
-D__devicename__=1
-DFP_FAST_FMA=1
-DFP_FAST_FMAF=1
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palprogram.cpp#54 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocprogram.cpp#79 edit
[ROCm/clr commit: 87f42bdb73]
SWDEV-2 - Change OpenCL version number from 2599 to 2600.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#2347 edit
[ROCm/clr commit: cb3754fe12]
SWDEV-2 - Change OpenCL version number from 2598 to 2599.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#2346 edit
[ROCm/clr commit: 900ba35d50]
SWDEV-2 - Change OpenCL version number from 2597 to 2598.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#2345 edit
[ROCm/clr commit: f4ac96ebce]
SWDEV-79445 - OCL generic changes and code clean-up
- Remove debugger checks from the dispatch path. There are no plans to enable debugger in the near future.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palvirtual.cpp#74 edit
[ROCm/clr commit: b561617774]
SWDEV-2 - Change OpenCL version number from 2596 to 2597.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#2344 edit
[ROCm/clr commit: 674711913a]
SWDEV-145750 - SSG Player drop in performance observed when using the OCL Api in 18.10
- Keep persistent memory mapped all time for Linux and Win10
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/paldevice.cpp#73 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palresource.cpp#48 edit
[ROCm/clr commit: 9018bc85cc]
SWDEV-102733 - Update some CMakeLists.txt files:
- when setting CMAKE_CXX_FLAGS, append to it rather than overwriting it.
- some linker flags, -Wl,--unresolved-symbols=report-all being one of them, requires -shared-libasan if using -fsanitize=address.
Affected files ...
... //depot/stg/opencl/drivers/opencl/api/opencl/amdocl/CMakeLists.txt#13 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/loaders/elf/utils/libelf/CMakeLists.txt#2 edit
... //depot/stg/opencl/drivers/opencl/runtime/CMakeLists.txt#9 edit
... //depot/stg/opencl/drivers/opencl/tools/clinfo/CMakeLists.txt#5 edit
[ROCm/clr commit: a31d6b2e6d]