EPR #397491 - changed the CPU SVM capability availablility only for OpenCL 2.0, not for 1.2.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/device/cpu/cpudevice.cpp#267 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/cpu/cpusettings.cpp#28 edit
[ROCm/clr commit: fe61054635]
EPR #010002 - Change OpenCL version number from 1670 to 1671.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1417 edit
[ROCm/clr commit: b3a41138c9]
EPR #010002 - Change OpenCL version number from 1669 to 1670.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1416 edit
[ROCm/clr commit: eeb3b78b29]
EPR #010002 - Change OpenCL version number from 1668 to 1669.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1415 edit
[ROCm/clr commit: b5addb1038]
EPR #010002 - Change OpenCL version number from 1667 to 1668.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1414 edit
[ROCm/clr commit: df8347472e]
EPR #010002 - Change OpenCL version number from 1666 to 1667.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1413 edit
[ROCm/clr commit: 35b929b3e3]
EPR #010002 - Change OpenCL version number from 1665 to 1666.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1412 edit
[ROCm/clr commit: 5abff30311]
EPR #407469 - disabled the SVM fine grain buffer support for CZ on mainline
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpudevice.cpp#472 edit
[ROCm/clr commit: 206bd8113d]
EPR #010002 - Change OpenCL version number from 1664 to 1665.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1411 edit
[ROCm/clr commit: 9745d6a49d]
EPR #010002 - Change OpenCL version number from 1663 to 1664.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1410 edit
[ROCm/clr commit: f885bbc25f]
EPR #010002 - Change OpenCL version number from 1662 to 1663.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1409 edit
[ROCm/clr commit: e5e20c0b67]
ECR #333753 - clc2: disallow implicit function declarations (bug 10328)
In addition, the change fixes the following side-effects:
1. Fix a typo in runtime/.../gpuschedcl.cpp, which fails due to the stricter check in Clang.
2. Unconditionally add sub_group builtins for pipes, without checking if the extension is enabled. See bug 10366.
3. Also added a test in ocl_features_clang to check for the sub_group builtins.
Passes smoke, smoke_clang, precheckin.
Additionally passes new tests added in ocl_features.
Reviewed by Brian Sumner.
Affected files ...
... //depot/stg/opencl/drivers/opencl/compiler/clang/lib/Sema/SemaLookup.cpp#9 edit
... //depot/stg/opencl/drivers/opencl/compiler/clc2/wrapper/ClangWrapper.cpp#12 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuschedcl.cpp#33 edit
[ROCm/clr commit: e25e8f4561]
EPR #010002 - Change OpenCL version number from 1661 to 1662.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1408 edit
[ROCm/clr commit: 69de913e4b]
EPR #010002 - Change OpenCL version number from 1660 to 1661.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1407 edit
[ROCm/clr commit: a4865b25bd]
EPR #405824 - On apus, if we run out of local memory to allocate cl_mem objects, ocl runtime will use remote (system) memory. Update maxMemAllocSize_ to include that.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpudevice.cpp#469 edit
[ROCm/clr commit: 1dfb720281]
EPR #010002 - Change OpenCL version number from 1659 to 1660.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1406 edit
[ROCm/clr commit: b124361187]
EPR #407358 - Enable remoteAlloc_ by default if the frame buffer is less than 128 MB on an APU system
Reviewed by: German
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpusettings.cpp#290 edit
[ROCm/clr commit: e00c0d6992]
EPR #010002 - Change OpenCL version number from 1658 to 1659.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1405 edit
[ROCm/clr commit: f4582fd447]
EPR #403341 - [Regression][OCL] P2P SDI to GPU (RGBA) and P2P SDI to GPU (RGB) show corruption on sdi output
- Enable GL Acquire/Release calls for more resource types. GL can enable color compression, but compute doesn't support it. Thus decompression can be required.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuresource.cpp#194 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gslbe/src/rt/GSLDevice.h#41 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gslbe/src/rt/GSLDeviceGL.cpp#18 edit
[ROCm/clr commit: b4f7a4274b]
EPR #010002 - Change OpenCL version number from 1657 to 1658.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1404 edit
[ROCm/clr commit: 41d7aee04a]
EPR #010002 - Change OpenCL version number from 1656 to 1657.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1403 edit
[ROCm/clr commit: 971db82561]
ECR #333753 - Performance: Stop obligatory BRIG disassembling to HSAIL
[Important]: HSAIL is not being disassembled from BRIG and not being inserted into BIF anymore by default.
Testing: pre check-in, smoke_clang
Reviewers: Stanislav Mekhanoshin, Brian Sumner, Artem Tamazov
Affected files ...
... //depot/stg/opencl/drivers/opencl/compiler/lib/backends/common/v0_8/if_acl.cpp#55 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/backends/gpu/hsail_be.cpp#33 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/backends/gpu/hsail_be.hpp#11 edit
... //depot/stg/opencl/drivers/opencl/compiler/tools/aoc2/aoc2.cpp#60 edit
[ROCm/clr commit: 7f8d708ca9]
ECR #333753 - HSA RT/Performance: Elimination of HSAIL text usage in RT (part 2)
(as it's already done for gpuprogram.cpp)
Extracting HSAIL from the binary and parsing it for the kernel names in RT were replaced with aclQueryInfo call for RT_KERNEL_NAMES. Kernel names are obtained now from the corresponding metadata symbols names, which are already presented in BIF at kernel finalization stage.
Side effect: performance improvement
Next Step: Performance: Stop obligatory BRIG disassembling to HSAIL and insertion into BIF (previously was needed only by RT).
testing: pre check-in
Reviewers: Stanislav Mekhanoshin, German Andryeyev, Brian Sumner
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/device/hsa/hsaprogram.cpp#37 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/hsa_foundation/hsaprogram.cpp#6 edit
[ROCm/clr commit: 3e74ee1f88]
EPR #010002 - Change OpenCL version number from 1655 to 1656.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1402 edit
[ROCm/clr commit: 94be44eaae]
EPR #010002 - Change OpenCL version number from 1654 to 1655.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1401 edit
[ROCm/clr commit: ea12251885]
EPR #407056, #407061, #406980 - Back out changelist 1083545 since it causes a bunch of perf degradations. Will add a heurstics for -scras=2 for memory bound kernels only.
Affected files ...
... //depot/stg/opencl/drivers/opencl/compiler/lib/utils/OPTIONS.def#118 edit
[ROCm/clr commit: d853eb5ebb]
EPR #010002 - Change OpenCL version number from 1653 to 1654.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1400 edit
[ROCm/clr commit: f59973500f]
ECR #333753 - new tests are added to ocltst -m oclcomplib -t OCLRTGetInfo
tests on aclQueryInfo for:
RT_KERNEL_NAMES, RT_CONTAINS_LLVMIR, RT_CONTAINS_OPTIONS, RT_CONTAINS_BRIG, RT_CONTAINS_HSAIL, RT_CONTAINS_ISA
+ query for RT_CONTAINS_HSAIL is fixed in Compiler Lib: looking for symbol symHSAILText instead of section aclCODEGEN, because aclCODEGEN section may contain also symOpenclMeta, symOpenclKernel, symOpenclStub besides symHSAILText.
Testing: pre check-in, ocltst -m oclcomplib
Reviewer: Brian Sumner
Affected files ...
... //depot/stg/opencl/drivers/opencl/compiler/lib/backends/common/v0_8/if_acl.cpp#53 edit
... //depot/stg/opencl/drivers/opencl/tests/ocltst/module/complib/oclrtGetInfo.cpp#16 edit
[ROCm/clr commit: 99fdc9adbb]
EPR #010002 - Change OpenCL version number from 1652 to 1653.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1399 edit
[ROCm/clr commit: 4bdac403d2]
EPR #365147 - OGL - VI Bringup Support
Change include of si_ci_merged header to si_ci_vi_merged header since the si_ci ones are out of date and will be removed shortly.
ReviewBoardURL = http://ocltc.amd.com/reviews/r/5980/
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuscsi.cpp#25 edit
[ROCm/clr commit: c4338bb327]
EPR #406328 - made more changes so that the OpenCL can run 2.0 devices on the first 2.0 capable device and OpenCL 1.2 on the rest of devices if they can suppport. Also uncommented an assert accidently commted by the previous check in.
this is the OpenCL part of checkin.
Prechecking:
http://ocltc.amd.com:8111/viewModification.html?modId=40944&personal=true&buildTypeId=&tab=vcsModificationBuilds&show_all_builds=true
code review:
http://ocltc.amd.com/reviews/r/5987/
Affected files ...
... //depot/stg/opencl/drivers/opencl/api/opencl/amdocl/cl_svm.cpp#6 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpusettings.cpp#288 edit
... //depot/stg/opencl/drivers/opencl/runtime/platform/context.cpp#31 edit
[ROCm/clr commit: 4390727621]
ECR #333753 - Compiler Lib/RT/Performance: Replace aclExtractSymbol/Section with aclQueryInfo for symbol/section detection.
The replaced calls in RT previously performed actual extraction of the sections from the BIF with memory allocation and copying. But what is needed in fact is only to determine whether the section exists in BIF or not to make a further decision on needed recompilations. With aclQueryInfo and new added enums RT_CONTAINS_LLVMIR, RT_CONTAINS_OPTIONS, RT_CONTAINS_BRIG, RT_CONTAINS_HSAIL, RT_CONTAINS_ISA Runtime starts querying not the whole sections but the bool flag which indicates the existance of the corresponding section(s) without any memory allocations. Every compilation on RT starting from LLVMIR is affected by the change including compilation of blit kernels.
Side Effects: performance improvement, memory consumption reduction
Testing: pre check-in, ocl conformance (api, basic, compiler), ocltst complib
Reviewers: Brian Sumner, German Andryeyev, Artem Tamazov
Affected files ...
... //depot/stg/opencl/drivers/opencl/compiler/lib/api/v0_8/acl.cpp#21 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/backends/common/v0_8/if_acl.cpp#52 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/include/v0_8/aclEnums.h#14 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuprogram.cpp#180 edit
... //depot/stg/opencl/drivers/opencl/tests/ocltst/module/complib/CLEnumCheck.cpp#38 edit
[ROCm/clr commit: 936e8f603d]
EPR #010002 - Change OpenCL version number from 1651 to 1652.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1398 edit
[ROCm/clr commit: 91f9b36376]
EPR #406328 - changed the device open algorithm so that the we only open the first OpenCL device. This is the OPENCL runtime changes, but this will be removed once we implemented multiple device support for SVM.
the code review and precheckin test:
http://ocltc.amd.com/reviews/r/5942/http://ocltc.amd.com:8111/viewModification.html?modId=40902&personal=true&buildTypeId=&tab=vcsModificationBuilds&show_all_builds=true
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpudevice.cpp#466 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpusettings.cpp#287 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpusettings.hpp#87 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gslbe/src/include/cal/cal.h#29 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gslbe/src/rt/GSLDevice.cpp#100 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gslbe/src/rt/GSLDevice.h#40 edit
[ROCm/clr commit: c248f04970]
EPR #397491 - fixed the temporary directory under windows system directory issue for windows, because apps couldn't write into C:\windows directory.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/os/os_win32.cpp#41 edit
[ROCm/clr commit: 794dc568be]