EPR #010002 - Change OpenCL version number from 1770 to 1771.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1517 edit
[ROCm/clr commit: fd97e52530]
EPR #010002 - Change OpenCL version number from 1769 to 1770.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1516 edit
[ROCm/clr commit: 3f6c269c74]
EPR #010002 - Change OpenCL version number from 1768 to 1769.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1515 edit
[ROCm/clr commit: 9bff526ef0]
ECR #333753 - TargetMachine is created not only for codegen, but for optimizer as well
This is to provide target specific optimizations in the intermediate optimizer.
LLVM 3.6 provides TargetTransformInfo for this purpose which requires TargetMachine.
No correctness issues will occur if TargetInfo is not created, but optmizations will target generic machine.
Testing: smoke, precheckin
Reviewed by Evgeny Mankov
Affected files ...
... //depot/stg/opencl/drivers/opencl/compiler/lib/backends/common/codegen.cpp#60 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/backends/common/opt_level.cpp#23 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/backends/common/opt_level.hpp#3 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/backends/common/optimizer.cpp#25 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/utils/v0_8/libUtils.cpp#6 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/utils/v0_8/libUtils.h#16 edit
[ROCm/clr commit: 5f67b5bb68]
EPR #010002 - Change OpenCL version number from 1767 to 1768.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1514 edit
[ROCm/clr commit: e8b97ef23f]
EPR #010002 - Change OpenCL version number from 1766 to 1767.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1513 edit
[ROCm/clr commit: 401506bc9a]
EPR #010002 - Change OpenCL version number from 1765 to 1766.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1512 edit
[ROCm/clr commit: 7ec0c127ec]
EPR #010002 - Change OpenCL version number from 1764 to 1765.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1511 edit
[ROCm/clr commit: 7dc723e95e]
EPR #010002 - Change OpenCL version number from 1763 to 1764.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1510 edit
[ROCm/clr commit: 2873d2b27d]
EPR #410989 - [Project Brahma] - Report GPU devices as 1.2 until 2.0 is supported on Brahma stack.
1. Remove BRAHMA related definition from opencldefs since they are not needed.
2. Add the definition BRAHMA to runtimedefs since brahma stack is only for runtime/gpu
3. Report GPU devices as 1.2 until 2.0 is supported on Brahma stack.
Affected files ...
... //depot/stg/opencl/drivers/opencl/opencldefs#157 edit
... //depot/stg/opencl/drivers/opencl/runtime/runtimedefs#33 edit
... //depot/stg/opencl/drivers/opencl/runtime/utils/flags.hpp#227 edit
... //depot/stg/opencl/drivers/opencl/runtime/utils/macros.hpp#6 edit
[ROCm/clr commit: 51f3eeb8f2]
EPR #010002 - Change OpenCL version number from 1762 to 1763.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1509 edit
[ROCm/clr commit: 58b20f1bc4]
EPR #394115 - [OpenCL][Arctic Islands] adding aiPlus_ variable for AI and post AI features
ReviewBoardURL = http://ocltc.amd.com/reviews/r/7167/
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpusettings.cpp#307 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpusettings.hpp#90 edit
[ROCm/clr commit: 88afeee3e3]
EPR #010002 - Change OpenCL version number from 1761 to 1762.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1508 edit
[ROCm/clr commit: d8b3117f8d]
EPR #010002 - Change OpenCL version number from 1760 to 1761.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1507 edit
[ROCm/clr commit: 964ddc0a84]
EPR #010002 - Change OpenCL version number from 1759 to 1760.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1506 edit
[ROCm/clr commit: 7634061989]
ECR #399840 - OpenCL Runtime HW Debug support development - add support to the VI asics & support the use case of debug registeration in a pre-dispatch callback function
** Cross branch check-in with CL1131894
Affected files ...
... //depot/stg/opencl/drivers/opencl/api/opencl/amdocl/cl_debugger_amd.cpp#7 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpudebugmanager.cpp#9 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpudevice.cpp#501 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpudevice.hpp#139 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gputrap.hpp#2 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuvirtual.cpp#355 edit
[ROCm/clr commit: 95596795fc]
EPR #010002 - Change OpenCL version number from 1758 to 1759.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1505 edit
[ROCm/clr commit: 3da2bd1161]
ECR #304775 - Mipmaps support
- Create views for the specified mip level in the transfer operations
- OCL requires just one mip level transfer. Thus we could keep the original blit kernels and just create a view for the specified mip level.
Affected files ...
... //depot/stg/opencl/drivers/opencl/api/opencl/amdocl/cl_memobj.cpp#75 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpudevice.cpp#500 edit
... //depot/stg/opencl/drivers/opencl/runtime/platform/memory.cpp#119 edit
... //depot/stg/opencl/drivers/opencl/runtime/platform/memory.hpp#90 edit
[ROCm/clr commit: f20ddcf1ce]
EPR #010002 - Change OpenCL version number from 1757 to 1758.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1504 edit
[ROCm/clr commit: 65421eb202]
ECR #377625 - Refactor amdil function support. Move the code from linker.cpp to llvm32 to facilitate porting linker.cpp to llvm36.
Also made changes to respect user specified noinline attribute and added options to allow a list of functions to be inlined or not inlined to facilitate debugging.
Affected files ...
... //depot/stg/opencl/drivers/opencl/compiler/lib/backends/common/linker.cpp#119 edit
... //depot/stg/opencl/drivers/opencl/compiler/llvm32/include/llvm/AMDILFuncSupport.h#1 add
... //depot/stg/opencl/drivers/opencl/compiler/llvm32/lib/Linker/AMDILFuncSupport.cpp#1 add
[ROCm/clr commit: cc17b01cee]
EPR #010002 - Change OpenCL version number from 1756 to 1757.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1503 edit
[ROCm/clr commit: 6b505035c0]
EPR #010002 - Change OpenCL version number from 1755 to 1756.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1502 edit
[ROCm/clr commit: f24223953a]
EPR #010002 - Change OpenCL version number from 1754 to 1755.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1501 edit
[ROCm/clr commit: 848605153d]
EPR #394115 - Adding the environment variable "GPU_SELECT_COMPUTE_RINGS_ID" to select a specific compute queue for OCL submission. This EV was requested from KMD team for testing the CWSR demo on CZ.
ReviewBoardURL = http://ocltc.amd.com/reviews/r/7082/
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuvirtual.cpp#354 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gslbe/src/rt/GSLDevice.cpp#111 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gslbe/src/rt/GSLDevice.h#44 edit
... //depot/stg/opencl/drivers/opencl/runtime/utils/flags.hpp#226 edit
[ROCm/clr commit: 0494cd6ace]
EPR #010002 - Change OpenCL version number from 1753 to 1754.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1500 edit
[ROCm/clr commit: 034de4c607]
EPR #415603 - [CQE OCL][QR][CPU] 29 of 34 SPIR tests failed in all CPUs due to CL#1127589.
Two changes:
1. OCL builtin needs to be demangled.
2. setupLoaderObject() needs to be moved into dll and called through comp lib interface since it needs to reset function pointers of several comp lib interface.
Otherwise it will use function pointers of the static comp lib and causes strange errors.
Affected files ...
... //depot/stg/opencl/drivers/opencl/compiler/lib/api/v0_8/acl.cpp#26 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/api/v0_8/aclLoaders.cpp#11 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/backends/common/v0_8/if_acl.cpp#62 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/backends/common/v0_8/if_acl.h#2 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/include/v0_8/aclFunctors.h#4 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/include/v0_8/aclStructs.h#15 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/include/v0_8/aclTypes.h#5 edit
... //depot/stg/opencl/drivers/opencl/compiler/llvm32/lib/Linker/AMDFixupKernelModule.cpp#3 edit
... //depot/stg/opencl/drivers/opencl/tests/ocltst/module/complib/CLAssumptionCheck.cpp#40 edit
[ROCm/clr commit: eca7138cd1]
EPR #010002 - Change OpenCL version number from 1752 to 1753.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1499 edit
[ROCm/clr commit: 36782c8b2a]
EPR #414133 - HSA HLC: fixed device enqueue code for no-capture case
A dead store to a context structure may be generated by clang, while our code was only
expecting a bitcast.
Affected files ...
... //depot/stg/opencl/drivers/opencl/compiler/llvm36/lib/Transforms/Scalar/AMDLowerEnqueueKernel.cpp#6 edit
[ROCm/clr commit: 51f603a4cc]
EPR #010002 - Change OpenCL version number from 1751 to 1752.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1498 edit
[ROCm/clr commit: 0428e6eb52]
EPR #010002 - Change OpenCL version number from 1750 to 1751.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1497 edit
[ROCm/clr commit: e0a8c76219]
EPR #010002 - Change OpenCL version number from 1749 to 1750.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1496 edit
[ROCm/clr commit: 3207f16588]
EPR #415638 - Improve APU performance
- Force remote allocation of local and persistent memory to Remote from RemoteUSWC:
- Use gpu copy for remote/pinned image/buffer.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpublit.cpp#114 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuresource.cpp#211 edit
[ROCm/clr commit: 637492a7dd]
EPR #010002 - Change OpenCL version number from 1748 to 1749.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1495 edit
[ROCm/clr commit: 1976010ccd]