ECR #333753 - Partial fix for Bug 10478 "Fix -fno-bin-llvmir/-fno-bin-hsail options"
If option -fno-bin-llvmi is set, .llvmir section is deleted from BIF on CG phase instead of FE. Both HSA & AMDIL are affected.
[Fixed] -fno-bin-llvm option causes clBuildProgram fail with error -11.
Took place only if compiled from OpenCL
[TODO] If possible -fno-bin-hsail should avoid putting HSAIL binary (BRIG) into BIF.
[Tests] pre check-in, make smoke, complib
[Reviewers] Brian Sumner, Nikolay Haustov
Affected files ...
... //depot/stg/opencl/drivers/opencl/compiler/lib/backends/common/frontend.cpp#31 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/backends/common/frontend_clang.cpp#17 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/backends/common/v0_8/if_acl.cpp#58 edit
... //depot/stg/opencl/drivers/opencl/compiler/tools/aoc2/aoc2.cpp#63 edit
... //depot/stg/opencl/drivers/opencl/tests/hsa/src/complib/options/-fbin-llvmir/HelloWorld_Kernel_cl.cl#1 add
... //depot/stg/opencl/drivers/opencl/tests/hsa/src/complib/options/-fno-bin-llvmir/HelloWorld_Kernel_cl.cl#1 add
... //depot/stg/opencl/drivers/opencl/tests/hsa/tlst/complib.tlst#3 edit
EPR #010002 - Change OpenCL version number from 1718 to 1719.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1465 edit
EPR #410560 - Forum: [169727] Bug? CL_MEM_COPY_HOST_PTR is not thread safe
- Ignore the transfer queue on memory release, since it releases resources after every operation
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuresource.cpp#201 edit
EPR #010002 - Change OpenCL version number from 1717 to 1718.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1464 edit
EPR #010002 - Change OpenCL version number from 1716 to 1717.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1463 edit
EPR #412821 - Default AMD_THREAD_TRACE_ENABLE to true for stg.
So OpenCL sanity will test the OCLThreadTrace test.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/flags.hpp#224 edit
EPR #010002 - Change OpenCL version number from 1715 to 1716.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1462 edit
EPR #010002 - Change OpenCL version number from 1714 to 1715.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1461 edit
EPR #010002 - Change OpenCL version number from 1713 to 1714.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1460 edit
EPR #010002 - Change OpenCL version number from 1712 to 1713.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1459 edit
EPR #010002 - Change OpenCL version number from 1711 to 1712.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1458 edit
EPR #010002 - Change OpenCL version number from 1710 to 1711.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1457 edit
EPR #010002 - Change OpenCL version number from 1709 to 1710.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1456 edit
EPR #010002 - Change OpenCL version number from 1708 to 1709.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1455 edit
EPR #010002 - Change OpenCL version number from 1707 to 1708.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1454 edit
EPR #010002 - Change OpenCL version number from 1706 to 1707.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1453 edit
EPR #010002 - Change OpenCL version number from 1705 to 1706.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1452 edit
EPR #010002 - Change OpenCL version number from 1704 to 1705.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1451 edit
EPR #010002 - Change OpenCL version number from 1703 to 1704.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1450 edit
EPR #010002 - Change OpenCL version number from 1702 to 1703.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1449 edit
ECR #304775 - Don't disable second SDMA if configuration has just 1 compute ring.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuvirtual.cpp#343 edit
EPR #410797 - Specific OCL kernel is 5x slower on Hawaii than on Nvidia K40 GPU when tested under Linux.
- The logic for local workgroup size search was prioritizing ALU utilization, but with multidemensional launches X dimension could affect address calculation and cacheline utlization more than others. Add cacheline size into the consideration.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpukernel.cpp#270 edit
EPR #411189 - Fix windows handling of SIGFPE. We cannot check worker thread during creation, because the constructor has not yet been called.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/os/os_win32.cpp#43 edit
EPR #411058 - [CQE OCL][Lnx][QR][CZ]MultiDevice_Context fails in 2.0 conformance wimpyfull due to CL# 1101352
- The detection of different map types is overcomplicated with possibility of multiple maps and multithreading environment. Thus keep USWC indirect map optimization based on the allocation flags.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpumemory.cpp#114 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpumemory.hpp#46 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuvirtual.cpp#342 edit
EPR #010002 - Change OpenCL version number from 1701 to 1702.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1448 edit
EPR #010002 - Change OpenCL version number from 1700 to 1701.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1447 edit
EPR #405889 - Added option to set VGPR/SGPR/LDS usage in ISA to certain value greater than actual usage for debugging purpose. If the given value is smaller than actual value, this option has no effect.
Affected files ...
... //depot/stg/opencl/drivers/opencl/compiler/lib/backends/gpu/scwrapper/SI/scCompileSI.cpp#52 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/backends/gpu/scwrapper/scHWShaderInfo.h#2 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/utils/OPTIONS.def#121 edit
EPR #010002 - Change OpenCL version number from 1699 to 1700.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1446 edit
EPR #010002 - Change OpenCL version number from 1698 to 1699.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1445 edit
EPR #410824 - [CQE OCL][CZ][S/G][QR] Two Bolt sample failing on CPU; Faulty CL: 1101352
- The test performs double maps with different map flags. Optimization could choose different map schemes for each call and memory coherency could be broken. Add extra conditions to detect multiple maps and use the same path as the first map.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpumemory.hpp#45 edit
EPR #010002 - Change OpenCL version number from 1697 to 1698.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1444 edit
EPR #010002 - Change OpenCL version number from 1696 to 1697.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1443 edit
EPR #410736 - [CQE OCL][ISV][QR][G] FFMPEG app generating corrupted video output; Faulty CL:1101352
- Add detection for AHP allocation.
FFmpeg uses AHP allocations with CL_MAP_READ flag, but actually performs CPU write into the buffer. With indirect map runtime executes useless transfer on map and doesn't write updated memory on unmap, because a wrong flag sent by the app.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpumemory.cpp#113 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpumemory.hpp#44 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuvirtual.cpp#341 edit
... //depot/stg/opencl/drivers/opencl/tests/ocltst/module/perf/TestList.cpp#40 edit
EPR #010002 - Change OpenCL version number from 1695 to 1696.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1442 edit
EPR #010002 - Change OpenCL version number from 1694 to 1695.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1441 edit