ECR #304775 - OpenCL version should be 2.0 for only CI+
ReviewBoardURL = http://ocltc.amd.com/reviews/r/5311/
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpudevice.cpp#448 edit
EPR #010002 - Change OpenCL version number from 1592 to 1593.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1339 edit
EPR #010002 - Change OpenCL version number from 1591 to 1592.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1338 edit
EPR #010002 - Change OpenCL version number from 1590 to 1591.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1337 edit
ECR #333753 - Compiler Lib: Error if wrong combination of march & OpenCL version
HSAIL doesn't support OpenCL version < 2.0
AMDIL doesn't support OpenCL version >= 2.0
Affects only ORCA builds (both 1.2 & 2.0), doesn't affect .hsa build.
Testing: smoke_clang, pre check-in
Reviewer: Stanislav Mekhanoshin
Affected files ...
... //depot/stg/opencl/drivers/opencl/compiler/lib/backends/common/v0_8/if_acl.cpp#38 edit
EPR #010002 - Change OpenCL version number from 1589 to 1590.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1336 edit
EPR #010002 - Change OpenCL version number from 1588 to 1589.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1335 edit
EPR #389586 - Add workaround for VI SPI SGPR initialization hardware bug for HSAIL path.
There is a hardware bug in VI (UBTS502672) which requires a workaround. Compute shaders need to tell shader compiler the available sGPR is 78 and set sGPUR usage in the compiled ISA to be 94. It has been done in AMDIL path but not done in HSAIL path. This change will apply the workaround to HSAIL path.
Affected files ...
... //depot/stg/opencl/drivers/opencl/compiler/lib/backends/gpu/scwrapper/SI/devStateSI.cpp#16 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/backends/gpu/scwrapper/SI/devStateSI.h#11 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/backends/gpu/scwrapper/SI/scCompileSI.cpp#41 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/utils/OPTIONS.def#109 edit
EPR #010002 - Change OpenCL version number from 1587 to 1588.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1334 edit
ECR #304775 - Device enqueuing
- Switch to the single thread scheduler for now(the current version isn't friendly for single thread). Hopefully it's a temporary solution until synchronization issue with multithreaded scheduler will be identified.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpublit.cpp#104 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuschedcl.cpp#20 edit
EPR #394069 - [CQE OCL][ISV][QR][G] Debugger CAL path no longer works due to OCL CL # 1003498. Fix ACL path to support debugger
ReviewBoardURL=http://ocltc.amd.com/reviews/r/5245/
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpubinary.cpp#57 edit
EPR #010002 - Change OpenCL version number from 1586 to 1587.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1333 edit
EPR #010002 - Change OpenCL version number from 1585 to 1586.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1332 edit
ECR #304775 - Fix valgrind use of uninitialized errors, modernize by using unique_ptr, and use existing function to extract filename from path.
Should probably also fix the static constructors this is called from.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/os/os_posix.cpp#37 edit
ECR #377625 - Allow function call for function with internal linkage.
Internal linkage correponds to static function in C. The function could be large and should be allowed to be not inlined.
Affected files ...
... //depot/stg/opencl/drivers/opencl/compiler/lib/backends/common/linker.cpp#106 edit
EPR #010002 - Change OpenCL version number from 1584 to 1585.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1331 edit
EPR #010002 - Change OpenCL version number from 1583 to 1584.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1330 edit
EPR #397491 - enabled the svm fine grain buffer for stg, disabled for mainline
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpudevice.cpp#446 edit
EPR #010002 - Change OpenCL version number from 1582 to 1583.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1329 edit
EPR #304775 - temporarily disable the SVM fine_grained_buffer support for OpenCL 2.0 on discrete GPUs, because the feature is supposed to release in 14.50. After the 14.40 is branched, we will enable it again on stg.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpudevice.cpp#445 edit
ECR #304775 - Device enqueuing
- Add printing of the waiting events
- Add early exit in the scheduler if nothing to launch
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuschedcl.cpp#19 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuvirtual.cpp#321 edit
EPR #010002 - Change OpenCL version number from 1581 to 1582.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1328 edit
EPR #397491 - Disable platform atomics temporarily until AFE which will be done on July 8.
Modify the flag of GPU_ENABLE_HIGH_PERFORMANCE_STATE to use it for platform atomics because GPU_ENABLE_HIGH_PERFORMANCE_STATE is not necessary for high clock anymore.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpusettings.cpp#271 edit
... //depot/stg/opencl/drivers/opencl/runtime/utils/flags.hpp#206 edit
EPR #010002 - Change OpenCL version number from 1580 to 1581.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1327 edit
EPR #010002 - Change OpenCL version number from 1579 to 1580.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1326 edit
ECR #304775 - Device enqueuing
- Match the printed value width with the argument size
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuvirtual.cpp#319 edit
ECR #304775 - Device enqueuing
- Added debug print for the generated child kernels. GPU_PRINT_CHILD_KERNEL=N, where N is the number of child kernels for dump.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuvirtual.cpp#318 edit
... //depot/stg/opencl/drivers/opencl/runtime/utils/flags.hpp#205 edit