Commit graph

2106 Commits

Autor SHA1 Nachricht Datum
foreman 083efcd692 P4 to Git Change 1464512 by gandryey@gera-w8 on 2017/09/28 12:57:40
SWDEV-133851 - [CQE OCL][1.2][LNX-PRO] A subtest from OCLcompiler is failing due to faulty cl#1458879
	- If pinning failed and allocation was forced to system memory, then copy the original data

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpudevice.cpp#577 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/paldevice.cpp#64 edit


[ROCm/clr commit: af98be0351]
2017-09-28 13:03:15 -04:00
foreman e8ec5c989c P4 to Git Change 1464352 by johtaylo@johtaylo-jtincrementor-increment on 2017/09/28 03:00:05
SWDEV-2 - Change OpenCL version number from 2507 to 2508.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#2255 edit


[ROCm/clr commit: 2276473e9d]
2017-09-28 03:08:07 -04:00
foreman 2ebbbd9c3a P4 to Git Change 1463727 by johtaylo@johtaylo-jtincrementor-increment on 2017/09/27 03:00:05
SWDEV-2 - Change OpenCL version number from 2506 to 2507.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#2254 edit


[ROCm/clr commit: 75653253bb]
2017-09-27 03:07:56 -04:00
foreman b7c4cf0b6b P4 to Git Change 1463205 by johtaylo@johtaylo-jtincrementor-increment on 2017/09/26 03:00:05
SWDEV-2 - Change OpenCL version number from 2505 to 2506.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#2253 edit


[ROCm/clr commit: c7ea67c7c6]
2017-09-26 03:06:41 -04:00
foreman 5c13941648 P4 to Git Change 1463068 by lmoriche@lmoriche_opencl_dev2 on 2017/09/25 17:58:14
SWDEV-86035 - Fix OCL/PAL build on Linux

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/pal/paldefs.hpp#22 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palresource.cpp#38 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palsettings.cpp#34 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palvirtual.cpp#60 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palvirtual.hpp#33 edit


[ROCm/clr commit: 98366d2b59]
2017-09-25 18:16:57 -04:00
foreman 4b8a49106f P4 to Git Change 1462860 by skudchad@skudchad_test2_win_opencl on 2017/09/25 14:13:13
SWDEV-95919 - Expose coutners by instances and not number of counters. Also expose EA and RMI instances.

	ReviewBoardURL = http://ocltc.amd.com/reviews/r/13502/diff/

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palcounters.cpp#15 edit
... //depot/stg/opencl/drivers/opencl/tests/ocltst/module/runtime/OCLPerfCounters.cpp#41 edit


[ROCm/clr commit: db63366bcb]
2017-09-25 14:25:11 -04:00
foreman 54eb06541d P4 to Git Change 1462490 by johtaylo@johtaylo-jtincrementor-increment on 2017/09/23 03:00:04
SWDEV-2 - Change OpenCL version number from 2504 to 2505.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#2252 edit


[ROCm/clr commit: 4460abcd9c]
2017-09-23 03:13:36 -04:00
foreman 083b8ee0b4 P4 to Git Change 1462147 by asalmanp@asalmanp-ocl-stg on 2017/09/22 10:49:57
SWDEV-132899 - [OCL][GFX10] Add support for GFX10

	Adjusting WaveFrontSize for Null Devices based on the gfxip (the WaveFrontSize is 32 for gfxip10)

	ReviewBoardURL = http://ocltc.amd.com/reviews/r/13486/

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/pal/paldevice.cpp#63 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palkernel.cpp#37 edit


[ROCm/clr commit: cb4585939d]
2017-09-22 11:05:41 -04:00
foreman 7e076a69ae P4 to Git Change 1461998 by johtaylo@johtaylo-jtincrementor-increment on 2017/09/22 03:00:06
SWDEV-2 - Change OpenCL version number from 2503 to 2504.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#2251 edit


[ROCm/clr commit: b236524590]
2017-09-22 03:15:45 -04:00
foreman e592f3ce2d P4 to Git Change 1461559 by akasauro@akasauro_fiji1_opencl_rocm on 2017/09/21 10:00:24
SWDEV-118564 - [OCL-LC-ROCm] Revert deletion of "-x cl" related comments

	Review: http://ocltc.amd.com/reviews/r/13454/

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palcompiler.cpp#19 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/roccompiler.cpp#37 edit


[ROCm/clr commit: 92570dd4cd]
2017-09-21 10:11:44 -04:00
foreman 75c42c1ed6 P4 to Git Change 1461455 by johtaylo@johtaylo-jtincrementor-increment on 2017/09/21 03:00:48
SWDEV-2 - Change OpenCL version number from 2502 to 2503.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#2250 edit


[ROCm/clr commit: c7dbd490b0]
2017-09-21 03:14:06 -04:00
foreman acb23488de P4 to Git Change 1461118 by gandryey@gera-w8 on 2017/09/20 12:04:57
SWDEV-86035 -  Switch back to 8 CBs due to HW hangs with HWSC on VI.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palvirtual.cpp#59 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palvirtual.hpp#32 edit


[ROCm/clr commit: e53571d05b]
2017-09-20 12:18:30 -04:00
foreman 6576f84861 P4 to Git Change 1461094 by akasauro@akasauro_fiji1_opencl_rocm on 2017/09/20 11:23:36
SWDEV-118564 - [OCL-LC-ROCm] Remove options, which have already been moved to AMDGPUToolChain by https://reviews.llvm.org/rL312524

	In order to have similar set of optipons for online and offline compilation a mechanism of setting default options in AMDGPUToolChain was implemented by https://reviews.llvm.org/rL312524. That commit also sets two default options in AMDGPUToolChain: -m64 and -O3 (the latter only set if there is no -O{N} option in the args). The commit has already reached amd-common.

	The current change relates to LC only and removes setting of -m64 from compileImpl_LC() as it is set later in TranslateArgs(); for online -O{N} is set as before by RT and stays unchanged in AMDGPUToolChain; for  offline it is set to -O3 by TranslateArgs() if no -O{N} is passed through args.

	Also remove comments regarding "-x cl" as it is now correctly set in OpenCL driver.

	Review: http://ocltc.amd.com/reviews/r/13454/

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palcompiler.cpp#18 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/roccompiler.cpp#36 edit


[ROCm/clr commit: a348a08391]
2017-09-20 11:38:12 -04:00
foreman 8f5426ddef P4 to Git Change 1460945 by johtaylo@johtaylo-jtincrementor-increment on 2017/09/20 03:00:04
SWDEV-2 - Change OpenCL version number from 2501 to 2502.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#2249 edit


[ROCm/clr commit: 5b85bc981c]
2017-09-20 03:13:12 -04:00
foreman d08042a472 P4 to Git Change 1460659 by skudchad@skudchad_test2_win_opencl on 2017/09/19 14:21:48
SWDEV-95919 - Expose all MCVML2 counters for gfx9 on PAL OCL backend.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palcounters.cpp#14 edit
... //depot/stg/opencl/drivers/opencl/tests/ocltst/module/runtime/OCLPerfCounters.cpp#40 edit


[ROCm/clr commit: f42b464126]
2017-09-19 15:15:04 -04:00
foreman e36ef13c43 P4 to Git Change 1460349 by johtaylo@johtaylo-jtincrementor-increment on 2017/09/19 03:00:07
SWDEV-2 - Change OpenCL version number from 2500 to 2501.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#2248 edit


[ROCm/clr commit: 61de12fb47]
2017-09-19 03:15:02 -04:00
foreman f398248c2a P4 to Git Change 1460106 by wchau@wchau_OCL_boltzmann on 2017/09/18 15:29:29
SWDEV-120036 - Supporting the cl_amd_device_attribute_query on the ROC device - Back out changelist 1459984
	- not all device attributes are supported, will re-submit the changes when every attribute is supported.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocdevice.cpp#64 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocsettings.cpp#26 edit


[ROCm/clr commit: fb18f128c9]
2017-09-18 17:12:05 -04:00
foreman cfa750bda5 P4 to Git Change 1460028 by wchau@wchau_OCL_boltzmann on 2017/09/18 14:03:40
SWDEV-111439 - Add query for preferred constant size
	- fixed a mistake of using 64KiB for the size, which should be 16KiB.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpudevice.cpp#576 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/paldevice.cpp#62 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocdevice.cpp#63 edit


[ROCm/clr commit: a851cc152c]
2017-09-18 14:15:44 -04:00
foreman 8ddf97e634 P4 to Git Change 1459984 by wchau@wchau_OCL_boltzmann on 2017/09/18 12:13:12
SWDEV-120036 - Supporting the cl_amd_device_attribute_query on the ROC device

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocdevice.cpp#62 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocsettings.cpp#25 edit


[ROCm/clr commit: d540aab99d]
2017-09-18 12:28:52 -04:00
foreman 610b9840e8 P4 to Git Change 1459702 by johtaylo@johtaylo-jtincrementor-increment on 2017/09/16 03:00:05
SWDEV-2 - Change OpenCL version number from 2499 to 2500.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#2247 edit


[ROCm/clr commit: f8bc731619]
2017-09-16 03:14:05 -04:00
foreman da71d0e518 P4 to Git Change 1459235 by johtaylo@johtaylo-jtincrementor-increment on 2017/09/15 03:00:04
SWDEV-2 - Change OpenCL version number from 2498 to 2499.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#2246 edit


[ROCm/clr commit: 040f46d584]
2017-09-15 03:13:25 -04:00
foreman 9021c30aad P4 to Git Change 1459027 by wchau@wchau_OCL_boltzmann on 2017/09/14 15:39:04
SWDEV-111439 - Add query for preferred constant size

Affected files ...

... //depot/stg/opencl/drivers/opencl/api/opencl/amdocl/cl_device.cpp#67 edit
... //depot/stg/opencl/drivers/opencl/api/opencl/khronos/headers/opencl1.2/CL/cl.hpp#7 edit
... //depot/stg/opencl/drivers/opencl/api/opencl/khronos/headers/opencl1.2/CL/cl_ext.h#18 edit
... //depot/stg/opencl/drivers/opencl/api/opencl/khronos/headers/opencl2.0/CL/cl.hpp#9 edit
... //depot/stg/opencl/drivers/opencl/api/opencl/khronos/headers/opencl2.0/CL/cl_ext.h#34 edit
... //depot/stg/opencl/drivers/opencl/api/opencl/khronos/headers/opencl2.1/CL/cl.hpp#6 edit
... //depot/stg/opencl/drivers/opencl/api/opencl/khronos/headers/opencl2.1/CL/cl_ext.h#11 edit
... //depot/stg/opencl/drivers/opencl/api/opencl/khronos/headers/opencl2.2/CL/cl.hpp#5 edit
... //depot/stg/opencl/drivers/opencl/api/opencl/khronos/headers/opencl2.2/CL/cl_ext.h#5 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/device.hpp#291 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpudevice.cpp#575 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/paldevice.cpp#61 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocdevice.cpp#61 edit


[ROCm/clr commit: b44fd81c76]
2017-09-14 16:26:32 -04:00
foreman a12776ba65 P4 to Git Change 1458879 by gandryey@gera-w8 on 2017/09/14 11:41:44
SWDEV-129129 - [[CQE OCL][Vega vs Fiji] Upto 12% Performance drop observed on VEGA10 compared to FIJI while running BlackMagic Davinci Resolve
	More benchmark tuning:
	- Keep system memory locked in the resource cache. That removes huge amount of lock/unlock calls to OS due to the resource creation and destruciton
	- Reduce the command buffer size to 256 commands and incrrease the amount of CBs to 16
	- Increase the amount of resident resources to 2048

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpudevice.cpp#574 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palmemory.cpp#17 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palresource.cpp#37 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palvirtual.cpp#58 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palvirtual.hpp#31 edit


[ROCm/clr commit: 4066449a8b]
2017-09-14 11:58:52 -04:00
foreman 80971280eb P4 to Git Change 1458724 by johtaylo@johtaylo-jtincrementor-increment on 2017/09/14 03:00:06
SWDEV-2 - Change OpenCL version number from 2497 to 2498.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#2245 edit


[ROCm/clr commit: 44b7cfefaf]
2017-09-14 03:14:53 -04:00
foreman fe289ba112 P4 to Git Change 1458387 by rili@rili-new-stg-pal-opencl on 2017/09/13 13:59:02
SWDEV-131925 - Initialize virtual queue
	                            Code Review: http://ocltc.amd.com/reviews/r/13436/

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palvirtual.cpp#57 edit


[ROCm/clr commit: 8b2c819023]
2017-09-13 14:08:50 -04:00
foreman 8ef3235c5a P4 to Git Change 1458325 by gandryey@gera-w8 on 2017/09/13 11:33:32
SWDEV-132238 - [CQE OCL][Vega10][DTB-Blocker][QR] 'Allocation (Single)' test of WF Conformance is failing; Faulty CL# 1451444
	- Disable reporting extra HBCC memory by default. Reporting extra memory can be reenabled with GPU_ADD_HBCC_SIZE=1

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/pal/paldevice.cpp#60 edit
... //depot/stg/opencl/drivers/opencl/runtime/utils/flags.hpp#280 edit


[ROCm/clr commit: 068bf554fb]
2017-09-13 11:47:55 -04:00
foreman b8032a52ef P4 to Git Change 1458204 by johtaylo@johtaylo-jtincrementor-increment on 2017/09/13 03:00:04
SWDEV-2 - Change OpenCL version number from 2496 to 2497.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#2244 edit


[ROCm/clr commit: d0cd65755a]
2017-09-13 03:12:50 -04:00
foreman 1e875380b6 P4 to Git Change 1458035 by wchau@wchau_OCL_boltzmann on 2017/09/12 16:31:24
SWDEV-130808 - set the local sizes to preferredWorkGroupSize_ when clEnqueueNDRange is not given and the kernel does not have required workgroup sizes.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpukernel.cpp#320 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuvirtual.cpp#411 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palkernel.cpp#36 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palvirtual.cpp#56 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rockernel.cpp#26 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocvirtual.cpp#43 edit


[ROCm/clr commit: 8aef16e13c]
2017-09-12 17:13:01 -04:00
foreman 84be665e67 P4 to Git Change 1458033 by asalmanp@asalmanp-ocl-stg on 2017/09/12 16:29:49
SWDEV-130722 - Channel order in an interop buffer from OpenCL to OpenGL is flipped on Vega

	Follow up for CL#1456230. Adding a new table that maps the OGL surface formats (hData.format) returned by wglResourceAttachAMD function into the OCL image format. The hData.format is the internal image surface format created for an interop by OGL and should be used by OCL for cl_gl interop.

	ReviewBoardURL = http://ocltc.amd.com/reviews/r/13421/

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/pal/paldevice.hpp#20 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/paldevicegl.cpp#6 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palresource.cpp#36 edit


[ROCm/clr commit: 23f12d5ea4]
2017-09-12 17:07:18 -04:00
foreman 3831725584 P4 to Git Change 1458019 by wchau@wchau_WIN_OCL_HSA on 2017/09/12 16:12:22
SWDEV-122517 - DVR toolbar and timer are corrupted when recording in fullscreen with portrait oriented monitors using Eyefinity.
	Fixed by obtaining the rotation information from OGL driver and set tha displayable attribute accordingly. (For OCL RT changes)
	- fix the type casting issue that causes build error

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gslbe/src/rt/GSLDeviceGL.cpp#31 edit


[ROCm/clr commit: a88ad35556]
2017-09-12 16:56:44 -04:00
foreman 0a0a0dc374 P4 to Git Change 1457982 by wchau@wchau_WIN_OCL_HSA on 2017/09/12 14:56:38
SWDEV-122517 - DVR toolbar and timer are corrupted when recording in fullscreen with portrait oriented monitors using Eyefinity.
	Fixed by obtaining the rotation information from OGL driver and set tha displayable attribute accordingly. (For OCL RT changes)

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gslbe/src/rt/GSLDeviceGL.cpp#30 edit


[ROCm/clr commit: 3373a1ef2f]
2017-09-12 15:39:29 -04:00
foreman 39d9b2c81f P4 to Git Change 1457957 by asalmanp@asalmanp-ocl-stg on 2017/09/12 14:27:56
SWDEV-79278 - [OCL] Dont add gfx804 device into the offline device list twice.

	ReviewBoardURL = http://ocltc.amd.com/reviews/r/13432/

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpudevice.cpp#573 edit


[ROCm/clr commit: 7efd5f7f98]
2017-09-12 14:42:44 -04:00
foreman 77dd50cb45 P4 to Git Change 1457669 by johtaylo@johtaylo-jtincrementor-increment on 2017/09/12 03:00:05
SWDEV-2 - Change OpenCL version number from 2495 to 2496.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#2243 edit


[ROCm/clr commit: 1ff1a9a9c7]
2017-09-12 03:16:03 -04:00
foreman 906abbac5d P4 to Git Change 1457559 by lmoriche@lmoriche_opencl_dev2 on 2017/09/11 18:05:30
SWDEV-130808 - Add support of two new queries: CL_DEVICE_PREFERRED_WORK_GROUP_SIZE_AMD, CL_DEVICE_MAX_WORK_GROUP_SIZE_AMD.
	- Restore the original behavior when setting GPU_MAX_WORKGROUP_SIZE.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpusettings.cpp#356 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palsettings.cpp#33 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocsettings.cpp#24 edit
... //depot/stg/opencl/drivers/opencl/runtime/utils/flags.hpp#279 edit


[ROCm/clr commit: c870a30523]
2017-09-11 18:19:56 -04:00
foreman 8f2a3ebf44 P4 to Git Change 1457530 by wchau@wchau_WIN_OCL_HSA on 2017/09/11 17:14:57
SWDEV-126884 - OCL 2.1 Platform APIs

Affected files ...

... //depot/stg/opencl/drivers/opencl/api/opencl/amdocl/amdocl.def.in#17 edit
... //depot/stg/opencl/drivers/opencl/api/opencl/amdocl/amdocl.map#4 edit
... //depot/stg/opencl/drivers/opencl/api/opencl/amdocl/amdocl.map.in#19 edit
... //depot/stg/opencl/drivers/opencl/api/opencl/amdocl/cl_command.cpp#14 edit
... //depot/stg/opencl/drivers/opencl/api/opencl/amdocl/cl_context.cpp#56 edit
... //depot/stg/opencl/drivers/opencl/api/opencl/amdocl/cl_execute.cpp#24 edit
... //depot/stg/opencl/drivers/opencl/api/opencl/amdocl/cl_icd.cpp#30 edit
... //depot/stg/opencl/drivers/opencl/api/opencl/amdocl/cl_icd_amd.h#20 edit
... //depot/stg/opencl/drivers/opencl/api/opencl/amdocl/cl_program.cpp#42 edit
... //depot/stg/opencl/drivers/opencl/api/opencl/amdocl/cl_svm.cpp#21 edit
... //depot/stg/opencl/drivers/opencl/api/opencl/khronos/headers/opencl2.1/CL/cl.h#5 edit
... //depot/stg/opencl/drivers/opencl/api/opencl/khronos/headers/opencl2.2/CL/cl.h#2 edit
... //depot/stg/opencl/drivers/opencl/runtime/platform/command.hpp#86 edit


[ROCm/clr commit: 5199f1bc48]
2017-09-11 17:58:22 -04:00
foreman 4a95a92c0c P4 to Git Change 1457038 by johtaylo@johtaylo-jtincrementor-increment on 2017/09/09 03:00:05
SWDEV-2 - Change OpenCL version number from 2494 to 2495.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#2242 edit


[ROCm/clr commit: 5f7750e9b1]
2017-09-09 03:13:29 -04:00
foreman 37d6bf11f4 P4 to Git Change 1456988 by gandryey@gera-w8 on 2017/09/08 17:48:25
SWDEV-129129 - [[CQE OCL][Vega vs Fiji] Upto 12% Performance drop observed on VEGA10 compared to FIJI while running BlackMagic Davinci Resolve
	- Force tiny read_only buffers into USWC memory. That will avoid expensive tiny data uploads, which occur every frame.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/pal/paldevice.cpp#59 edit
... //depot/stg/opencl/drivers/opencl/runtime/utils/flags.hpp#278 edit


[ROCm/clr commit: caa25fc792]
2017-09-08 18:03:23 -04:00
foreman 6cf8497f45 P4 to Git Change 1456841 by wchau@wchau_OCL_boltzmann on 2017/09/08 13:45:27
SWDEV-130808 - Add support of two new queries: CL_DEVICE_PREFERRED_WORK_GROUP_SIZE_AMD, CL_DEVICE_MAX_WORK_GROUP_SIZE_AMD.
	- Initialize the "preferredWorkGroupSize_" for CPU device so that CL_MAX_WORK_GROUP_SIZE correctly reports CPU_MAX_WORKGROUP_SIZE.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/cpu/cpudevice.cpp#281 edit


[ROCm/clr commit: 00e913da6d]
2017-09-08 13:58:47 -04:00
foreman 7ebb63648b P4 to Git Change 1456735 by wchau@wchau_OCL_boltzmann on 2017/09/08 11:04:08
SWDEV-130808 - Resubmit the CL#1456363 after finding the AMD SDK 2.9.1 Apps test failure is an app issue.

Affected files ...

... //depot/stg/opencl/drivers/opencl/api/opencl/amdocl/cl_device.cpp#66 edit
... //depot/stg/opencl/drivers/opencl/api/opencl/khronos/headers/opencl1.2/CL/cl.hpp#6 edit
... //depot/stg/opencl/drivers/opencl/api/opencl/khronos/headers/opencl1.2/CL/cl_ext.h#17 edit
... //depot/stg/opencl/drivers/opencl/api/opencl/khronos/headers/opencl2.0/CL/cl.hpp#8 edit
... //depot/stg/opencl/drivers/opencl/api/opencl/khronos/headers/opencl2.0/CL/cl_ext.h#33 edit
... //depot/stg/opencl/drivers/opencl/api/opencl/khronos/headers/opencl2.1/CL/cl.hpp#5 edit
... //depot/stg/opencl/drivers/opencl/api/opencl/khronos/headers/opencl2.1/CL/cl_ext.h#10 edit
... //depot/stg/opencl/drivers/opencl/api/opencl/khronos/headers/opencl2.2/CL/cl.hpp#4 edit
... //depot/stg/opencl/drivers/opencl/api/opencl/khronos/headers/opencl2.2/CL/cl_ext.h#4 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/device.hpp#290 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpudevice.cpp#572 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpusettings.cpp#355 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpusettings.hpp#101 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/paldevice.cpp#58 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palsettings.cpp#32 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palsettings.hpp#13 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocdevice.cpp#60 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocsettings.cpp#23 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocsettings.hpp#11 edit
... //depot/stg/opencl/drivers/opencl/runtime/utils/flags.hpp#277 edit


[ROCm/clr commit: 703f2f6900]
2017-09-08 11:17:38 -04:00
foreman e63f8c4885 P4 to Git Change 1456589 by johtaylo@johtaylo-jtincrementor-increment on 2017/09/08 03:00:04
SWDEV-2 - Change OpenCL version number from 2493 to 2494.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#2241 edit


[ROCm/clr commit: cd4c8a168b]
2017-09-08 03:20:14 -04:00
foreman d2804df5c4 P4 to Git Change 1456420 by wchau@wchau_OCL_boltzmann on 2017/09/07 16:56:26
SWDEV-130808 - Back out changelist 1456363 as it cause AMD SDK 2.9.1 Apps test failure when running OpenCL Sanity tests with Brahma build.

Affected files ...

... //depot/stg/opencl/drivers/opencl/api/opencl/amdocl/cl_device.cpp#65 edit
... //depot/stg/opencl/drivers/opencl/api/opencl/khronos/headers/opencl1.2/CL/cl.hpp#5 edit
... //depot/stg/opencl/drivers/opencl/api/opencl/khronos/headers/opencl1.2/CL/cl_ext.h#16 edit
... //depot/stg/opencl/drivers/opencl/api/opencl/khronos/headers/opencl2.0/CL/cl.hpp#7 edit
... //depot/stg/opencl/drivers/opencl/api/opencl/khronos/headers/opencl2.0/CL/cl_ext.h#32 edit
... //depot/stg/opencl/drivers/opencl/api/opencl/khronos/headers/opencl2.1/CL/cl.hpp#4 edit
... //depot/stg/opencl/drivers/opencl/api/opencl/khronos/headers/opencl2.1/CL/cl_ext.h#9 edit
... //depot/stg/opencl/drivers/opencl/api/opencl/khronos/headers/opencl2.2/CL/cl.hpp#3 edit
... //depot/stg/opencl/drivers/opencl/api/opencl/khronos/headers/opencl2.2/CL/cl_ext.h#3 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/device.hpp#289 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpudevice.cpp#571 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpusettings.cpp#354 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpusettings.hpp#100 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/paldevice.cpp#57 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palsettings.cpp#31 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palsettings.hpp#12 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocdevice.cpp#59 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocsettings.cpp#22 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocsettings.hpp#10 edit
... //depot/stg/opencl/drivers/opencl/runtime/utils/flags.hpp#276 edit


[ROCm/clr commit: 946984cba0]
2017-09-07 17:12:29 -04:00
foreman a1d2208873 P4 to Git Change 1456409 by skudchad@skudchad_test_win_opencl2 on 2017/09/07 16:42:55
SWDEV-130305 - For Vega CF configuration on specific chipset (AMD Ryzen 7 1800X) slave ASIC comes out of BACO when ReLive is enabled
	- Finalize() in PAL shouldnt be called during enumeration. This creates a paging queue in WDDM which causes the second GPU to come out of BACO. Move Finalize to initializeHeapResources.

	ReviewBoardURL = http://ocltc.amd.com/reviews/r/13410/diff/

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/pal/paldevice.cpp#56 edit


[ROCm/clr commit: 881613438a]
2017-09-07 17:02:22 -04:00
foreman 2d8634d5c0 P4 to Git Change 1456363 by wchau@wchau_OCL_boltzmann on 2017/09/07 15:47:03
SWDEV-130808 - Add support of two new queries: CL_DEVICE_PREFERRED_WORK_GROUP_SIZE_AMD, CL_DEVICE_MAX_WORK_GROUP_SIZE_AMD.

Affected files ...

... //depot/stg/opencl/drivers/opencl/api/opencl/amdocl/cl_device.cpp#64 edit
... //depot/stg/opencl/drivers/opencl/api/opencl/khronos/headers/opencl1.2/CL/cl.hpp#4 edit
... //depot/stg/opencl/drivers/opencl/api/opencl/khronos/headers/opencl1.2/CL/cl_ext.h#15 edit
... //depot/stg/opencl/drivers/opencl/api/opencl/khronos/headers/opencl2.0/CL/cl.hpp#6 edit
... //depot/stg/opencl/drivers/opencl/api/opencl/khronos/headers/opencl2.0/CL/cl_ext.h#31 edit
... //depot/stg/opencl/drivers/opencl/api/opencl/khronos/headers/opencl2.1/CL/cl.hpp#3 edit
... //depot/stg/opencl/drivers/opencl/api/opencl/khronos/headers/opencl2.1/CL/cl_ext.h#8 edit
... //depot/stg/opencl/drivers/opencl/api/opencl/khronos/headers/opencl2.2/CL/cl.hpp#2 edit
... //depot/stg/opencl/drivers/opencl/api/opencl/khronos/headers/opencl2.2/CL/cl_ext.h#2 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/device.hpp#288 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpudevice.cpp#570 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpusettings.cpp#353 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpusettings.hpp#99 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/paldevice.cpp#55 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palsettings.cpp#30 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palsettings.hpp#11 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocdevice.cpp#58 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocsettings.cpp#21 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocsettings.hpp#9 edit
... //depot/stg/opencl/drivers/opencl/runtime/utils/flags.hpp#275 edit


[ROCm/clr commit: 0b90f3418f]
2017-09-07 16:00:01 -04:00
foreman 85beaf8888 P4 to Git Change 1456230 by asalmanp@asalmanp-ocl-stg on 2017/09/07 12:00:21
SWDEV-130722 - Channel order in an interop buffer from OpenCL to OpenGL is flipped on Vega

	OCL calls glGetTexLevelParameteriv_ function to get the internal GL format but this format is the one chosen by app in OGL API such as glTexImage2D.

	The issue is that OGL sometimes selects a different format than defined in the glTexImage2D and this causes some issues in cl_gl interop. One example is shown below

	glTexImage2D(GL_TEXTURE_2D, 0, GL_RGBA/**internal format**/, width, height, 0, GL_BGRA/**external format**/, GL_UNSIGNED_BYTES, NULL);

	in this case GL_RGBA is selected by app as the internal format but OGL switches to BGRA8 internally and causes an issue later in cl_gl interop (i.e., R and B channels are swapped) because OCL gets GL_RGBA as the internal format in the glGetTexLevelParameteriv_ call.

	To avoid this issue, OCL needs to query the real internal  gl format in wglResourceAttachAMD and adjusts the CL format accordingly.

	ReviewBoardURL = http://ocltc.amd.com/reviews/r/13408/

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/pal/paldevice.hpp#19 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/paldevicegl.cpp#5 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palresource.cpp#35 edit


[ROCm/clr commit: e8395888c5]
2017-09-07 12:15:41 -04:00
foreman 25cf7df1be P4 to Git Change 1456058 by johtaylo@johtaylo-jtincrementor-increment on 2017/09/07 03:00:04
SWDEV-2 - Change OpenCL version number from 2492 to 2493.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#2240 edit


[ROCm/clr commit: a90c0025a4]
2017-09-07 03:15:00 -04:00
foreman deab032014 P4 to Git Change 1455765 by gandryey@gera-w8 on 2017/09/06 11:57:19
SWDEV-131497 - [CQE OCL][Vega10][OclTst][QR][DTB-Blocker] 'Spir' test of OCLTST is crashing randomly 3/10 times; Faulty CL# 1451293
	- The test doesn't release command queues, which may cause a crash on the device destruction. Force the app's queue destruction if the app didn't release them.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/pal/paldevice.cpp#54 edit


[ROCm/clr commit: d3d97c5010]
2017-09-06 12:08:13 -04:00
foreman 8a1db4ea7c P4 to Git Change 1455573 by johtaylo@johtaylo-jtincrementor-increment on 2017/09/06 03:00:05
SWDEV-2 - Change OpenCL version number from 2491 to 2492.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#2239 edit


[ROCm/clr commit: b71ee777ff]
2017-09-06 03:18:19 -04:00
foreman 4394baef5b P4 to Git Change 1455464 by asalmanp@asalmanp-ocl-stg on 2017/09/05 18:07:43
SWDEV-79278 - [OCL][PAL] refactoring PAL Null device create function to account for creating all the gfx9+ subtarget devices such as gfx901/gfx902/etc

	ReviewboardURL = http://ocltc.amd.com/reviews/r/13378/

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/pal/paldefs.hpp#21 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/paldevice.cpp#53 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/paldevice.hpp#18 edit


[ROCm/clr commit: 9772217dcb]
2017-09-05 18:22:55 -04:00
foreman c3bff7dc35 P4 to Git Change 1455453 by gandryey@gera-w8 on 2017/09/05 17:49:48
SWDEV-131493 - [CQE OCL][Vega10][QR][DTB-Blocker] Soft Hang is observed while running 'Mipmaps-clCopyImage' tests of WF Conformance due to Faulty CL# 1451293
	Multiple runtime locks could conflict each other:
	- Remove PAL lock from the resource creation/destruction. PAL should be thread safe for those operations.
	- Avoid queue execution lock for a mipmap view destruction in submitUnmapMemory

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palresource.cpp#34 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palvirtual.cpp#55 edit


[ROCm/clr commit: 6b103f1bf6]
2017-09-05 18:03:48 -04:00
foreman f2f2f2c457 P4 to Git Change 1455369 by asalmanp@asalmanp-ocl-stg on 2017/09/05 16:02:34
SWDEV-79278 - [OCL] Dont add gfx9+ devices into offline devices list in orca path as they will be added in pal.
	ReviewboardURL = http://ocltc.amd.com/reviews/r/13396/

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpudevice.cpp#569 edit


[ROCm/clr commit: 121ffcc6ec]
2017-09-05 16:19:02 -04:00