SWDEV-2 - Change OpenCL version number from 2377 to 2378.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#2124 edit
SWDEV-2 - Change OpenCL version number from 2376 to 2377.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#2123 edit
SWDEV-2 - Change OpenCL version number from 2375 to 2376.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#2122 edit
SWDEV-2 - Change OpenCL version number from 2374 to 2375.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#2121 edit
SWDEV-2 - Change OpenCL version number from 2373 to 2374.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#2120 edit
SWDEV-2 - Change OpenCL version number from 2372 to 2373.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#2119 edit
SWDEV-2 - Change OpenCL version number from 2371 to 2372.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#2118 edit
SWDEV-2 - Change OpenCL version number from 2370 to 2371.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#2117 edit
SWDEV-107271 - For APU system, when allocating USWC with enough USWC reported and the allocation fails, we should return false instead of looping forever.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palmemory.cpp#12 edit
SWDEV-2 - Change OpenCL version number from 2369 to 2370.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#2116 edit
SWDEV-2 - Change OpenCL version number from 2368 to 2369.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#2115 edit
SWDEV-99150 - Convert PerfCounter indexing scheme from ORCA to PAL.
PAL is using a different PerfCounter indexing scheme, so need to make the conversion to make CodeXL behave the same.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palcounters.cpp#9 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palcounters.hpp#8 edit
SWDEV-2 - Change OpenCL version number from 2367 to 2368.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#2114 edit
SWDEV-2 - Change OpenCL version number from 2366 to 2367.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#2113 edit
SWDEV-2 - Change OpenCL version number from 2365 to 2366.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#2112 edit
SWDEV-2 - Change OpenCL version number from 2364 to 2365.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#2111 edit
SWDEV-112401 - [OCLonROCm vs LinuxPro][Fiji][Perf] Around ~ 70% performance drop observed with OCL SDK - BlackScholes test
- Switch persistent emulation to device memory instead of system memory, since ROCr doesn't support visible heap without large bar. It's a potential tradeoff of persistent usage in the applications between an upload buffer or directly in a kernel.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocmemory.cpp#11 edit
SWDEV-2 - Change OpenCL version number from 2363 to 2364.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#2110 edit
SWDEV-2 - Change OpenCL version number from 2362 to 2363.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#2109 edit
SWDEV-99150 - Update ORCA PerfCounter enum conversion to PAL.
PAL changes its PerfCounter enum in palPerfExperiment.h since version 236.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palcounters.cpp#8 edit
SWDEV-2 - Change OpenCL version number from 2361 to 2362.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#2108 edit
SWDEV-107075 - [OCL][VG10] Conformance RQ Mipmaps tests are failing. Adding the below changes to fix the failures
- For 1Darray images the Height should be 1 and we should use the correct value for "numSlices" in copyRegion
- For 2Darray images the Depth should be 1 and we should use the correct value for "numSlices" in copyRegion
- The right Height should be used when gpuMemoryDepthPitch is calculated
Also, PAL was not obeying the "numSlices" parameter for SDMA-based memory <--> image or image <--> image copies and this issue is fixed in CL#1369626.
The current changes in OCL runtime will be submitted into stg once the above CL from PAL gets promoted into OCL stg.
ReviewBoardURL = http://ocltc.amd.com/reviews/r/12260/
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palresource.cpp#24 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palsettings.cpp#17 edit
SWDEV-2 - Change OpenCL version number from 2360 to 2361.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#2107 edit
SWDEV-2 - Change OpenCL version number from 2359 to 2360.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#2106 edit
SWDEV-103424 - [ROCm CQE][OCL] OCLRuntime - OCLCreateBuffer tests are failing. The failure is due to AQL cannot support global size > 32bit range. Adding dispatch split support for ROCm, similar to that of GSL (CL#1159349), to resolve the issue.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rockernel.hpp#13 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocprogram.cpp#56 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocvirtual.cpp#31 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocvirtual.hpp#8 edit
... //depot/stg/opencl/drivers/opencl/tests/ocltst/module/runtime/OCLCreateBuffer.cpp#6 edit
SWDEV-2 - Change OpenCL version number from 2358 to 2359.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#2105 edit
SWDEV-112661 - [ROCm CQE][OCLonLC][QR][G] 80 failures observed with Images(Samplerless and Samplerless pitch) tests
- Make sure the pitch for image buffer is properly aligned to the reported device info
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocmemory.cpp#10 edit
SWDEV-112171 - [ROCm CQE][OCLonLC][QR][G] System hangs/Failures observed with few WF conf tests, due to CL#1364923
- reenable HSA copy in ROCr, since OCL copy doesn't really fix the hang issue in integer_ops, but just delays it.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocvirtual.cpp#30 edit
SWDEV-2 - Change OpenCL version number from 2357 to 2358.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#2104 edit
SWDEV-2 - Change OpenCL version number from 2356 to 2357.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#2103 edit
SWDEV-2 - Change OpenCL version number from 2355 to 2356.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#2102 edit
SWDEV-107271 - Fix crash that happens when Resource::create() fails and "owner" gets cleared. This is only triggered by Conformance test Basic progvar_prog_scope_misc on Raven with 16GB memory (8GB is fine), but should be a general bug.
http://ocltc.amd.com/reviews/r/12253
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palmemory.cpp#11 edit
SWDEV-2 - Change OpenCL version number from 2354 to 2355.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#2101 edit
SWDEV-2 - Change OpenCL version number from 2353 to 2354.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#2100 edit
SWDEV-112352 - [ROCm CQE][OCLonLC][QR][G] Aborted/Segmentation fault is observed with "sub_buffers_read_write_dual_devices" of Buffers, due to CL#1364923
- Keep system memory for multiple devices in case of possible initialization on the second device
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocdevice.cpp#41 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocmemory.cpp#9 edit
SWDEV-112171 - [ROCm CQE][OCLonLC][QR][G] System hangs/Failures observed with few WF conf tests, due to CL#1364923
- Disable ROCr copy buffer and use runtime kernel to avoid HW hangs in integer_ops.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/device/blit.hpp#3 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocblit.cpp#12 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocvirtual.cpp#29 edit
SWDEV-112171 - [ROCm CQE][OCLonLC][QR][G] System hangs/Failures observed with few WF conf tests, due to CL#1364923
- Back out changelist 1367810. A hang was delayed, but still occured much later in the test.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocblit.cpp#11 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocdevice.cpp#40 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocdevice.hpp#16 edit
SWDEV-2 - Change OpenCL version number from 2352 to 2353.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#2099 edit
SWDEV-105835 - ROCm OpenCL: add -amdgpu-internalize-symbols to BE
The option -amdgpu-internalize-symbols allows to drop unused symbols from program,
functions and global variables. This saves compile time and object size, a lot in
case of a big program.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palprogram.cpp#33 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocprogram.cpp#55 edit