Граф коммитов

56 Коммитов

Автор SHA1 Сообщение Дата
foreman bd3af873ec P4 to Git Change 1732253 by asalmanp@asalmanp-ocl-stg on 2019/01/17 21:45:23
SWDEV-132899 - [OCL][GFX10] 70 subtests of Conformance Mipmaps (clCopyImage) test failed for image type 1Darray
	This is the follow up for CL#1517501

	copyImage1DA blit kernel uses image2d_array_t type for src/dst images. On gx10, num of arrays/layers is expected in Z component for a 2Darray image so a swap is required for 1Darray images when we use 2Darray image for the image copy. The copyImage1DA has code for swapping z and y components as follows:

	if (srcOrigin.w != 0) {
	coordsSrc.z = coordsSrc.y;
	coordsSrc.y = 0;
	}
	if (dstOrigin.w != 0) {
	coordsDst.z = coordsDst.y;
	coordsDst.y = 0;
	}

	So to use this path force the w component to 1 for src and dst images on gfx10 if image type is 1Darray.

	ReviewRequestURL = http://ocltc.amd.com/reviews/r/16538/diff/

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palblit.cpp#28 edit
2019-01-17 22:11:38 -05:00
foreman eed47ec8b3 P4 to Git Change 1706391 by gandryey@gera-w8 on 2018/11/13 15:31:20
SWDEV-79445 - OCL generic changes and code clean-up
	- Remove mapping of some internal CL formats in PAL backend, since it shouldn't need them anymore.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palblit.cpp#27 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/paldefs.hpp#43 edit
2018-11-13 15:39:48 -05:00
foreman 45d50bb738 P4 to Git Change 1599701 by gandryey@gera-w8 on 2018/08/29 18:48:36
SWDEV-79445 - OCL generic changes and code clean-up
	- Use SDMA staging transfers for data upload if pinning fails. Fixes HIP failure in a test that uses the code segment data for uppload.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palblit.cpp#26 edit
2018-08-29 18:57:40 -04:00
foreman f6d8feabeb P4 to Git Change 1568521 by gandryey@gera-w8 on 2018/06/14 17:43:52
SWDEV-79445 - OCL generic changes and code clean-up
	- Change LDS setup to account the size, since LC forces 4 bytes for LDS offsets always

	http://ocltc.amd.com/reviews/r/15197/

Affected files ...

... //depot/stg/opencl/drivers/opencl/api/opencl/amdocl/cl_execute.cpp#28 edit
... //depot/stg/opencl/drivers/opencl/api/opencl/amdocl/cl_program.cpp#49 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpublit.cpp#130 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpukernel.cpp#327 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palblit.cpp#25 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palkernel.cpp#56 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palvirtual.cpp#109 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocblit.hpp#11 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rockernel.cpp#38 edit
... //depot/stg/opencl/drivers/opencl/runtime/platform/command.cpp#88 edit
... //depot/stg/opencl/drivers/opencl/runtime/platform/kernel.cpp#34 edit
... //depot/stg/opencl/drivers/opencl/runtime/platform/kernel.hpp#25 edit
2018-06-14 17:51:35 -04:00
foreman 1be400ff01 P4 to Git Change 1567428 by gandryey@gera-w8 on 2018/06/12 18:39:23
SWDEV-79445 - OCL generic changes and code clean-up
	- Optimize setup of kernel arguments. Stage 2.
	- Add HW ABI support in the abstraction layer
	- Remove arguments parsing loop from the kernel launch. Memory processing will be responsible for dependency tracking and  patching of arguments.

	http://ocltc.amd.com/reviews/r/15122/

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/device.cpp#221 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/device.hpp#307 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpukernel.cpp#325 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palblit.cpp#24 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palkernel.cpp#53 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palkernel.hpp#17 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palmemory.hpp#9 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palvirtual.cpp#107 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palvirtual.hpp#53 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rockernel.cpp#36 edit
... //depot/stg/opencl/drivers/opencl/runtime/platform/kernel.cpp#30 edit
... //depot/stg/opencl/drivers/opencl/runtime/platform/kernel.hpp#23 edit
... //depot/stg/opencl/drivers/opencl/runtime/platform/program.cpp#95 edit
2018-06-12 18:57:20 -04:00
foreman 2176dc3b19 P4 to Git Change 1558704 by gandryey@gera-w8 on 2018/05/23 17:20:01
SWDEV-79445 - OCL generic changes and code clean-up
	- ABI clean-up. Stage 1: Separate kernel arguments and OCL objects. OCL objects will be passed in the new arrays of mem objects, samplers and device queue objects. The kernel arguments will contain GPU virtual addresses.

	http://ocltc.amd.com/reviews/r/14881/

Affected files ...

... //depot/stg/opencl/drivers/opencl/api/opencl/amdocl/cl_program.cpp#48 edit
... //depot/stg/opencl/drivers/opencl/api/opencl/amdocl/cl_svm.cpp#25 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/device.hpp#302 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpublit.cpp#129 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpukernel.cpp#323 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpukernel.hpp#128 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpumemory.hpp#51 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuvirtual.cpp#417 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palblit.cpp#23 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palkernel.cpp#50 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palmemory.hpp#7 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palvirtual.cpp#97 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocblit.cpp#22 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocblit.hpp#9 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocdevice.hpp#28 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocmemory.hpp#12 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocvirtual.cpp#51 edit
... //depot/stg/opencl/drivers/opencl/runtime/platform/command.cpp#86 edit
... //depot/stg/opencl/drivers/opencl/runtime/platform/kernel.cpp#26 edit
... //depot/stg/opencl/drivers/opencl/runtime/platform/kernel.hpp#20 edit
2018-05-23 17:24:32 -04:00
foreman c11b2d52b7 P4 to Git Change 1552704 by gandryey@gera-w8 on 2018/05/09 15:11:23
SWDEV-79445 - OCL generic changes and code clean-up
	- Following CL#1552596. Make sure virtual GPU is set for the internal allocations before the create() call, since the deferred alloc is disabled.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpublit.cpp#128 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuvirtual.cpp#416 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuvirtual.hpp#144 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palblit.cpp#22 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palvirtual.cpp#96 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palvirtual.hpp#51 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocblit.cpp#21 edit
2018-05-09 15:16:40 -04:00
foreman 79ba5904dc P4 to Git Change 1546657 by gandryey@gera-w8 on 2018/04/26 10:59:34
SWDEV-151739 - [CQE OCL][DTB][Perf][QR][DTB-BLOCKER][VEGA10] Upto 18% performance drop observed while running Video Composition test sub test of Compubench due to faulty CL#1544622
	- Implement customized TS tracking for managed buffers. The common TS tracking mechanism saves the event of the last command, assuming SDMA and compute operations occur in order, but for managed buffers it's not the case. Also managed buffer doesn't have to validate TS for the parent resource.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palblit.cpp#21 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palconstbuf.cpp#11 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palconstbuf.hpp#9 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palmemory.hpp#6 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palresource.hpp#22 edit
2018-04-26 11:13:29 -04:00
foreman 392724cc3f P4 to Git Change 1544622 by gandryey@gera-w8 on 2018/04/20 17:02:52
SWDEV-79445 - OCL generic changes and code clean-up
	- Add managed buffer support and replace all uploads with the managed buffer allocations
	- Add staging copy for small image writes
	- Replace constant buffer in FillBuffer with a managed buffer also

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palblit.cpp#20 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palblit.hpp#7 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palconstbuf.cpp#8 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palconstbuf.hpp#7 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/paldevice.cpp#84 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/paldevice.hpp#26 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palprogram.cpp#62 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palresource.cpp#63 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palvirtual.cpp#92 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palvirtual.hpp#48 edit
2018-04-20 17:08:29 -04:00
foreman e7fb98dd94 P4 to Git Change 1535454 by todli@todli-win-opencl-kv1 on 2018/04/02 11:12:16
SWDEV-133452 - 1.Add HostMemoryDirectAccess flag to SDI ExternalPhysical memory
	2.Disable host writeBuffer for SDI ExternalPhysical memory
	3.Support marker offset for SDI WriteMarker in Runtime

	http://ocltc.amd.com/reviews/r/14518/

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palblit.cpp#19 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palmemory.cpp#19 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palvirtual.cpp#81 edit
2018-04-02 11:28:59 -04:00
foreman ace31f6a11 P4 to Git Change 1530988 by gandryey@gera-w8 on 2018/03/22 17:50:10
SWDEV-79445 - OCL generic changes and code clean-up
	- Remove renames support from the Resource object.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palblit.cpp#18 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palresource.cpp#54 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palresource.hpp#19 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palsettings.cpp#47 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palsettings.hpp#17 edit
2018-03-22 17:58:21 -04:00
foreman e0f4740fc4 P4 to Git Change 1517501 by asalmanp@asalmanp-ocl-stg on 2018/02/20 13:54:28
SWDEV-132899 - [OCL][GFX10] OCLCreateImage[3] fails for the image type 1Darray
	Issue: FillImage blit kernel is not working properly on gfx10 if the image type is 1Darray (i.e., it only fills the first slice/layer and ignores the rest of the layers when number of layers >1)
	Root cause: gfx10 HW expects the number of layers in Z component
	Fix: To fix this issue we swap the Y and Z components if the image type is 1Darray for gfx10+ in image blit kernels.

	ReviewBoardURL = http://ocltc.amd.com/reviews/r/14281/diff/

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palblit.cpp#17 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palsettings.cpp#42 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palsettings.hpp#14 edit
2018-02-20 14:15:36 -05:00
foreman 8e17a1e6b4 P4 to Git Change 1501728 by gandryey@gera-w8 on 2018/01/09 15:35:03
SWDEV-79445 - OCL generic changes and code clean-up
	- Code style clean-up. No functional changes.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palblit.cpp#16 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/paldevice.cpp#71 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palkernel.cpp#43 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palprintf.cpp#8 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palvirtual.cpp#70 edit
2018-01-09 15:43:43 -05:00
foreman e5ee74a086 P4 to Git Change 1501660 by gandryey@gera-w8 on 2018/01/09 14:04:56
SWDEV-79445 - OCL generic changes and code clean-up
	- Code style clean-up. No functional changes.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpublit.cpp#127 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpudevice.hpp#164 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpukernel.cpp#321 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuprogram.cpp#236 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuvirtual.cpp#412 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuvirtual.hpp#142 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palblit.cpp#15 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palcompiler.cpp#20 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/paldevice.cpp#70 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/paldevice.hpp#22 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/paldevicegl.cpp#9 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palkernel.cpp#42 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palmemory.cpp#18 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palprintf.cpp#7 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palprogram.cpp#53 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palresource.cpp#45 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palvirtual.cpp#69 edit
2018-01-09 14:09:12 -05:00
foreman b82be1113f P4 to Git Change 1451293 by gandryey@gera-w8 on 2017/08/24 13:37:00
SWDEV-129129 - [[CQE OCL][Vega vs Fiji] Upto 12% Performance drop observed on VEGA10 compared to FIJI while running BlackMagic Davinci Resolve
	The app creates/destroys hundred resources each frame. PAL path was removing the destroyed resources from the resident list, although the resource was kept in the cache. This change does the follwoing:
	- Switch TS tracking from a map in VirtualGPU to resource
	- Don't remove references until the actual memory destruction
	- Add a residency threshold to avoid OS resident/eviction calls

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/blit.hpp#5 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palblit.cpp#14 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palblit.hpp#6 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/paldefs.hpp#19 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/paldevice.cpp#50 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/paldevice.hpp#17 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palkernel.cpp#35 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palkernel.hpp#13 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palmemory.cpp#14 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palprogram.cpp#46 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palprogram.hpp#19 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palresource.cpp#30 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palresource.hpp#13 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palvirtual.cpp#52 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palvirtual.hpp#28 edit
2017-08-24 13:51:55 -04:00
foreman 465c1c0287 P4 to Git Change 1398097 by lmoriche@lmoriche_opencl_dev2 on 2017/04/13 13:01:56
SWDEV-102733 - [OCL-LC-ROCm] Cmake build Write CMakeLists.txt to enable building with and without the DK environment
	- Change the coding convention of the runtime files. Use Google's Style (https://google.github.io/styleguide/cppguide.html).

Affected files ...

... //depot/stg/opencl/drivers/opencl/.clang-format#1 add
... //depot/stg/opencl/drivers/opencl/api/opencl/amdocl/cl_agent_amd.h#2 edit
... //depot/stg/opencl/drivers/opencl/api/opencl/amdocl/cl_command.cpp#13 edit
... //depot/stg/opencl/drivers/opencl/api/opencl/amdocl/cl_context.cpp#53 edit
... //depot/stg/opencl/drivers/opencl/api/opencl/amdocl/cl_counter.cpp#2 edit
... //depot/stg/opencl/drivers/opencl/api/opencl/amdocl/cl_d3d10.cpp#15 edit
... //depot/stg/opencl/drivers/opencl/api/opencl/amdocl/cl_d3d11.cpp#22 edit
... //depot/stg/opencl/drivers/opencl/api/opencl/amdocl/cl_d3d9.cpp#32 edit
... //depot/stg/opencl/drivers/opencl/api/opencl/amdocl/cl_debugger_amd.cpp#8 edit
... //depot/stg/opencl/drivers/opencl/api/opencl/amdocl/cl_debugger_amd.h#7 edit
... //depot/stg/opencl/drivers/opencl/api/opencl/amdocl/cl_device.cpp#61 edit
... //depot/stg/opencl/drivers/opencl/api/opencl/amdocl/cl_event.cpp#10 edit
... //depot/stg/opencl/drivers/opencl/api/opencl/amdocl/cl_execute.cpp#23 edit
... //depot/stg/opencl/drivers/opencl/api/opencl/amdocl/cl_gl.cpp#53 edit
... //depot/stg/opencl/drivers/opencl/api/opencl/amdocl/cl_icd.cpp#27 edit
... //depot/stg/opencl/drivers/opencl/api/opencl/amdocl/cl_icd_amd.h#18 edit
... //depot/stg/opencl/drivers/opencl/api/opencl/amdocl/cl_kernel.h#24 edit
... //depot/stg/opencl/drivers/opencl/api/opencl/amdocl/cl_kernel_info_amd.cpp#3 edit
... //depot/stg/opencl/drivers/opencl/api/opencl/amdocl/cl_kernel_info_amd.h#4 edit
... //depot/stg/opencl/drivers/opencl/api/opencl/amdocl/cl_lqdflash_amd.cpp#17 edit
... //depot/stg/opencl/drivers/opencl/api/opencl/amdocl/cl_lqdflash_amd.h#6 edit
... //depot/stg/opencl/drivers/opencl/api/opencl/amdocl/cl_memobj.cpp#81 edit
... //depot/stg/opencl/drivers/opencl/api/opencl/amdocl/cl_object.cpp#3 edit
... //depot/stg/opencl/drivers/opencl/api/opencl/amdocl/cl_pipe.cpp#6 edit
... //depot/stg/opencl/drivers/opencl/api/opencl/amdocl/cl_platform_amd.cpp#2 edit
... //depot/stg/opencl/drivers/opencl/api/opencl/amdocl/cl_platform_amd.h#2 edit
... //depot/stg/opencl/drivers/opencl/api/opencl/amdocl/cl_profile_amd.cpp#3 edit
... //depot/stg/opencl/drivers/opencl/api/opencl/amdocl/cl_profile_amd.h#2 edit
... //depot/stg/opencl/drivers/opencl/api/opencl/amdocl/cl_program.cpp#41 edit
... //depot/stg/opencl/drivers/opencl/api/opencl/amdocl/cl_sampler.cpp#6 edit
... //depot/stg/opencl/drivers/opencl/api/opencl/amdocl/cl_sdi_amd.cpp#3 edit
... //depot/stg/opencl/drivers/opencl/api/opencl/amdocl/cl_sdi_amd.h#2 edit
... //depot/stg/opencl/drivers/opencl/api/opencl/amdocl/cl_semaphore_amd.h#3 edit
... //depot/stg/opencl/drivers/opencl/api/opencl/amdocl/cl_svm.cpp#20 edit
... //depot/stg/opencl/drivers/opencl/api/opencl/amdocl/cl_thread_trace_amd.cpp#8 edit
... //depot/stg/opencl/drivers/opencl/api/opencl/amdocl/cl_thread_trace_amd.h#4 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/appprofile.cpp#17 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/appprofile.hpp#12 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/blit.cpp#4 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/blit.hpp#4 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/blitcl.cpp#11 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/cpu/cpubinary.cpp#11 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/cpu/cpubinary.hpp#4 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/cpu/cpubuiltins.cpp#13 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/cpu/cpubuiltins.hpp#4 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/cpu/cpucommand.cpp#66 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/cpu/cpucommand.hpp#40 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/cpu/cpudevice.cpp#280 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/cpu/cpudevice.hpp#96 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/cpu/cpufeat.hpp#3 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/cpu/cpukernel.hpp#8 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/cpu/cpumapping.cpp#6 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/cpu/cpumapping.hpp#4 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/cpu/cpuprogram.cpp#70 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/cpu/cpuprogram.hpp#14 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/cpu/cpusettings.cpp#33 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/cpu/cpusettings.hpp#2 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/cpu/cputables.hpp#5 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/cpu/cpuvirtual.cpp#26 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/cpu/cpuvirtual.hpp#13 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/device.cpp#209 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/device.hpp#284 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuappprofile.cpp#12 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuappprofile.hpp#7 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpubinary.cpp#58 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpubinary.hpp#27 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpublit.cpp#126 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpublit.hpp#41 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpucompiler.cpp#156 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuconstbuf.cpp#10 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuconstbuf.hpp#7 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpucounters.cpp#12 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpucounters.hpp#9 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpudebugger.hpp#7 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpudebugmanager.cpp#10 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpudebugmanager.hpp#6 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpudefs.hpp#147 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpudevice.cpp#567 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpudevice.hpp#163 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpukernel.cpp#318 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpukernel.hpp#126 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpumemory.cpp#131 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpumemory.hpp#50 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuprintf.cpp#44 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuprintf.hpp#15 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuprogram.cpp#232 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuprogram.hpp#69 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuresource.cpp#238 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuresource.hpp#87 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpusched.hpp#19 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuschedcl.cpp#35 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuscsi.cpp#37 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpusettings.cpp#350 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpusettings.hpp#98 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gputhreadtrace.cpp#9 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gputhreadtrace.hpp#7 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gputimestamp.cpp#27 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gputimestamp.hpp#16 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gputrap.hpp#4 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuvirtual.cpp#410 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuvirtual.hpp#140 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuwavelimiter.cpp#13 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuwavelimiter.hpp#9 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/hwdebug.cpp#7 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/hwdebug.hpp#8 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palappprofile.cpp#2 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palappprofile.hpp#3 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palbinary.cpp#2 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palbinary.hpp#3 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palblit.cpp#13 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palblit.hpp#5 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palcompiler.cpp#15 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palconstbuf.cpp#2 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palconstbuf.hpp#3 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palcounters.cpp#11 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palcounters.hpp#9 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/paldebugger.hpp#3 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/paldebugmanager.cpp#2 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/paldebugmanager.hpp#3 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/paldefs.hpp#16 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/paldevice.cpp#45 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/paldevice.hpp#16 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/paldeviced3d10.cpp#2 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/paldeviced3d11.cpp#2 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/paldeviced3d9.cpp#2 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/paldevicegl.cpp#4 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palkernel.cpp#34 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palkernel.hpp#11 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palmemory.cpp#13 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palmemory.hpp#3 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palprintf.cpp#5 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palprintf.hpp#3 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palprogram.cpp#39 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palprogram.hpp#17 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palresource.cpp#28 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palresource.hpp#12 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palsched.hpp#3 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palschedcl.cpp#4 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palsettings.cpp#24 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palsettings.hpp#10 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palthreadtrace.cpp#3 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palthreadtrace.hpp#5 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/paltimestamp.cpp#2 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/paltimestamp.hpp#3 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/paltrap.hpp#2 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palvirtual.cpp#48 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palvirtual.hpp#21 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palwavelimiter.cpp#3 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palwavelimiter.hpp#4 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/mesa_glinterop.h#4 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocappprofile.cpp#6 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocappprofile.hpp#4 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocbinary.hpp#4 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocblit.cpp#17 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocblit.hpp#8 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/roccompiler.cpp#32 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/roccompilerlib.cpp#6 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/roccompilerlib.hpp#5 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocdefs.hpp#10 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocdevice.cpp#48 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocdevice.hpp#20 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocglinterop.cpp#4 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocglinterop.hpp#4 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rockernel.cpp#22 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rockernel.hpp#16 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocmemory.cpp#15 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocmemory.hpp#8 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocprintf.cpp#7 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocprintf.hpp#5 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocprogram.cpp#64 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocprogram.hpp#23 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocregisters.hpp#4 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocsettings.cpp#17 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocsettings.hpp#8 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocvirtual.cpp#34 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocvirtual.hpp#10 edit
... //depot/stg/opencl/drivers/opencl/runtime/os/alloc.cpp#7 edit
... //depot/stg/opencl/drivers/opencl/runtime/os/alloc.hpp#4 edit
... //depot/stg/opencl/drivers/opencl/runtime/os/os.cpp#8 edit
... //depot/stg/opencl/drivers/opencl/runtime/os/os.hpp#30 edit
... //depot/stg/opencl/drivers/opencl/runtime/os/os_posix.cpp#42 edit
... //depot/stg/opencl/drivers/opencl/runtime/os/os_win32.cpp#47 edit
... //depot/stg/opencl/drivers/opencl/runtime/platform/agent.cpp#8 edit
... //depot/stg/opencl/drivers/opencl/runtime/platform/agent.hpp#6 edit
... //depot/stg/opencl/drivers/opencl/runtime/platform/command.cpp#78 edit
... //depot/stg/opencl/drivers/opencl/runtime/platform/command.hpp#83 edit
... //depot/stg/opencl/drivers/opencl/runtime/platform/commandqueue.cpp#23 edit
... //depot/stg/opencl/drivers/opencl/runtime/platform/commandqueue.hpp#18 edit
... //depot/stg/opencl/drivers/opencl/runtime/platform/context.cpp#42 edit
... //depot/stg/opencl/drivers/opencl/runtime/platform/context.hpp#26 edit
... //depot/stg/opencl/drivers/opencl/runtime/platform/counter.hpp#4 edit
... //depot/stg/opencl/drivers/opencl/runtime/platform/interop.hpp#12 edit
... //depot/stg/opencl/drivers/opencl/runtime/platform/kernel.cpp#23 edit
... //depot/stg/opencl/drivers/opencl/runtime/platform/kernel.hpp#18 edit
... //depot/stg/opencl/drivers/opencl/runtime/platform/memory.cpp#127 edit
... //depot/stg/opencl/drivers/opencl/runtime/platform/memory.hpp#100 edit
... //depot/stg/opencl/drivers/opencl/runtime/platform/ndrange.cpp#8 edit
... //depot/stg/opencl/drivers/opencl/runtime/platform/ndrange.hpp#9 edit
... //depot/stg/opencl/drivers/opencl/runtime/platform/object.cpp#2 edit
... //depot/stg/opencl/drivers/opencl/runtime/platform/object.hpp#17 edit
... //depot/stg/opencl/drivers/opencl/runtime/platform/perfctr.hpp#5 edit
... //depot/stg/opencl/drivers/opencl/runtime/platform/program.cpp#86 edit
... //depot/stg/opencl/drivers/opencl/runtime/platform/program.hpp#41 edit
... //depot/stg/opencl/drivers/opencl/runtime/platform/runtime.cpp#35 edit
... //depot/stg/opencl/drivers/opencl/runtime/platform/runtime.hpp#4 edit
... //depot/stg/opencl/drivers/opencl/runtime/platform/sampler.hpp#8 edit
... //depot/stg/opencl/drivers/opencl/runtime/platform/threadtrace.hpp#6 edit
... //depot/stg/opencl/drivers/opencl/runtime/thread/atomic.hpp#7 edit
... //depot/stg/opencl/drivers/opencl/runtime/thread/monitor.cpp#7 edit
... //depot/stg/opencl/drivers/opencl/runtime/thread/monitor.hpp#8 edit
... //depot/stg/opencl/drivers/opencl/runtime/thread/semaphore.cpp#10 edit
... //depot/stg/opencl/drivers/opencl/runtime/thread/semaphore.hpp#7 edit
... //depot/stg/opencl/drivers/opencl/runtime/thread/thread.cpp#14 edit
... //depot/stg/opencl/drivers/opencl/runtime/thread/thread.hpp#15 edit
... //depot/stg/opencl/drivers/opencl/runtime/top.hpp#26 edit
... //depot/stg/opencl/drivers/opencl/runtime/utils/concurrent.hpp#8 edit
... //depot/stg/opencl/drivers/opencl/runtime/utils/debug.cpp#5 edit
... //depot/stg/opencl/drivers/opencl/runtime/utils/debug.hpp#7 edit
... //depot/stg/opencl/drivers/opencl/runtime/utils/flags.cpp#16 edit
... //depot/stg/opencl/drivers/opencl/runtime/utils/flags.hpp#271 edit
... //depot/stg/opencl/drivers/opencl/runtime/utils/macros.hpp#8 edit
... //depot/stg/opencl/drivers/opencl/runtime/utils/util.hpp#12 edit
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#2150 edit
2017-04-13 13:56:38 -04:00
foreman 4d659dfd37 P4 to Git Change 1344768 by skudchad@skudchad_test_win_opencl2 on 2016/11/22 14:31:30
SWDEV-107226 - [SDI] SDISpeedTest Corruption for OCL GPU to SDI RGBA
	- Single step copy using SDMA to remote SDI buffer seems to be causing corruption. This fix is a workaround to do transfer via a staging buffer and seems to be fixing corruption. The issue is under investigation

	ReviewBoardURL = http://ocltc.amd.com/reviews/r/11882/diff/

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpublit.cpp#125 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palblit.cpp#12 edit
2016-11-22 14:43:24 -05:00
foreman 43f8188d59 P4 to Git Change 1329979 by gandryey@gera-w8 on 2016/10/21 13:26:53
SWDEV-86035 - Add PAL backend to OpenCL
	- Allow device memory usage for blit kernels

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palblit.cpp#11 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/paldevice.cpp#27 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/paldevice.hpp#11 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palkernel.cpp#11 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palprogram.cpp#11 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palprogram.hpp#9 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palresource.cpp#15 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palvirtual.cpp#32 edit
2016-10-21 13:31:50 -04:00
foreman b750057405 P4 to Git Change 1311385 by gandryey@gera-w8 on 2016/09/06 16:51:05
SWDEV-101448 - [CQE OCL][Brahma][PERF][QR] ~21% perf drop is observed with lulesh-cl subtest of ComputeApps tests : Faulty CL # 1306133
	- Use the logic for transfer size before CL#1306133

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpublit.cpp#124 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palblit.cpp#10 edit
2016-09-06 17:00:06 -04:00
foreman 45eedbec7c P4 to Git Change 1310046 by gandryey@gera-w8 on 2016/09/01 18:43:53
SWDEV-101621 - [CQE OCL][OpenCL on PAL] 6 WF Conformance tests are failing
	- Fix fillimage with DEPTH16 format

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palblit.cpp#9 edit
2016-09-01 18:50:31 -04:00
foreman 57043d662d P4 to Git Change 1309866 by gandryey@gera-w8 on 2016/09/01 13:50:12
SWDEV-79445 - OCL generic changes and code clean-up
	- Improve image fill performance with multiple writes in a single thread. The current split has 3 regions

Affected files ...

... //depot/stg/opencl/drivers/opencl/library/common.hsa/src/blitKernels.cl#4 edit
... //depot/stg/opencl/drivers/opencl/library/common/src/blitKernels.cl#4 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpublit.cpp#123 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpublit.hpp#40 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palblit.cpp#8 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palblit.hpp#4 edit
2016-09-01 14:01:08 -04:00
foreman cd7727d007 P4 to Git Change 1308294 by gandryey@gera-w8 on 2016/08/29 18:22:03
SWDEV-101206 - [CQE OCL][Perf][G][QR] Upto ~9% Performance drop observed while running Video Composition subtest of Compubench; Faulty CL#1306133
	- Use the original logic without DMA flush. Flush on staging write helps with a blocking op only, but currently VDI doesn't have that information.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpublit.cpp#122 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palblit.cpp#7 edit
2016-08-29 18:31:20 -04:00
foreman 862e3a1a79 P4 to Git Change 1306133 by gandryey@gera-w8 on 2016/08/23 14:00:09
SWDEV-79445 - OCL generic changes and code clean-up
	- Update staging copy path with a flush so CPU copy and SDMA transfer could run asynchronously.
	- Tune chunk size for transfers

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpublit.cpp#121 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palblit.cpp#6 edit
2016-08-23 14:12:24 -04:00
foreman c0b7901c61 P4 to Git Change 1288179 by jsjodin@jsjodin-git2p4-llvm on 2016/07/06 20:14:24
SWDEV-3 - [X86] Add shuffle mask rescaling helper function. NFCI.

	git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@274476 91177308-0d34-0410-b5e6-96231b3b80d8

	GitHash: e52fac8db47f9fda742974bb51a38341c444a08e

Affected files ...

... //depot/stg/opencl/drivers/opencl/compiler/llvm.git/lib/Target/X86/X86ISelLowering.cpp#18 edit
2016-07-07 03:55:10 -04:00
foreman 739bdacc65 P4 to Git Change 1288113 by jsjodin@jsjodin-git2p4-llvm on 2016/07/06 18:23:12
SWDEV-3 - AMDGPU: Expand unaligned accesses early

	Due to visit order problems, in the case of an unaligned copy
	the legalized DAG fails to eliminate extra instructions introduced
	by the expansion of both unaligned parts.

	git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@274397 91177308-0d34-0410-b5e6-96231b3b80d8

	GitHash: d4452f8fcf496a2e19c1a1c9792f5f063f4e9703

Affected files ...

... //depot/stg/opencl/drivers/opencl/compiler/llvm.git/lib/Target/AMDGPU/AMDGPUISelLowering.cpp#3 edit
... //depot/stg/opencl/drivers/opencl/compiler/llvm.git/lib/Target/AMDGPU/AMDGPUISelLowering.h#3 edit
... //depot/stg/opencl/drivers/opencl/compiler/llvm.git/test/CodeGen/AMDGPU/cvt_f32_ubyte.ll#3 edit
... //depot/stg/opencl/drivers/opencl/compiler/llvm.git/test/CodeGen/AMDGPU/sext-in-reg-failure-r600.ll#1 add
... //depot/stg/opencl/drivers/opencl/compiler/llvm.git/test/CodeGen/AMDGPU/sext-in-reg.ll#3 edit
... //depot/stg/opencl/drivers/opencl/compiler/llvm.git/test/CodeGen/AMDGPU/unaligned-load-store.ll#2 edit
2016-07-07 03:52:33 -04:00
foreman e9185dda22 P4 to Git Change 1288315 by jsjodin@jsjodin-git2p4-lld on 2016/07/06 22:35:20
SWDEV-3 - COFF: Switch to new archive writer interface (D21721).

	Differential Revision: http://reviews.llvm.org/D21722

	git-svn-id: https://llvm.org/svn/llvm-project/lld/trunk@274184 91177308-0d34-0410-b5e6-96231b3b80d8

	GitHash: 99e6679883180439d118958c40448b7bd3cef8c6

Affected files ...

... //depot/stg/opencl/drivers/opencl/compiler/lld.git/COFF/DriverUtils.cpp#3 edit
... //depot/stg/opencl/drivers/opencl/compiler/lld.git/COFF/Error.cpp#2 edit
... //depot/stg/opencl/drivers/opencl/compiler/lld.git/COFF/Error.h#2 edit
2016-07-07 03:44:48 -04:00
foreman 89153addab P4 to Git Change 1288111 by jsjodin@jsjodin-git2p4-llvm on 2016/07/06 18:20:21
SWDEV-3 - [msan] Fix __msan_maybe_ for non-standard type sizes.

	Fix incorrect calculation of the type size for __msan_maybe_warning_N
	call that resulted in an invalid (narrowing) zext instruction and
	\"Assertion `castIsValid(op, S, Ty) && \"Invalid cast!\"' failed.\"

	Only happens in very large functions (with more than 3500 MSan
	checks) operating on integer types that are not power-of-two.

	git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@274395 91177308-0d34-0410-b5e6-96231b3b80d8

	GitHash: dcfa1b5241a1d0484ad1a67485329b1c7c13b575

Affected files ...

... //depot/stg/opencl/drivers/opencl/compiler/llvm.git/lib/Transforms/Instrumentation/MemorySanitizer.cpp#4 edit
... //depot/stg/opencl/drivers/opencl/compiler/llvm.git/test/Instrumentation/MemorySanitizer/with-call-type-size.ll#1 add
2016-07-07 03:40:09 -04:00
foreman 476d5fff29 P4 to Git Change 1288161 by jsjodin@jsjodin-git2p4-llvm on 2016/07/06 19:42:25
SWDEV-3 - add vector bool select tests and regenerate checks for scalar bool select tests

	git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@274460 91177308-0d34-0410-b5e6-96231b3b80d8

	GitHash: b4d7428f8ad572d7f1b9511fd30b01de58a99b57

Affected files ...

... //depot/stg/opencl/drivers/opencl/compiler/llvm.git/test/Transforms/InstCombine/select.ll#2 edit
2016-07-07 03:22:33 -04:00
foreman a4d709447f P4 to Git Change 1288106 by jsjodin@jsjodin-git2p4-llvm on 2016/07/06 18:17:28
SWDEV-3 - AMDGPU: Improve load/store of illegal types.

	There was a combine before to handle the simple copy case.
	Split this into handling loads and stores separately.

	We might want to change how this handles some of the vector
	extloads, since this can result in large code size increases.

	git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@274394 91177308-0d34-0410-b5e6-96231b3b80d8

	GitHash: df587174ebb04a30c8b55aef601121c25af11986

Affected files ...

... //depot/stg/opencl/drivers/opencl/compiler/llvm.git/lib/Target/AMDGPU/AMDGPUISelLowering.cpp#2 edit
... //depot/stg/opencl/drivers/opencl/compiler/llvm.git/lib/Target/AMDGPU/AMDGPUISelLowering.h#2 edit
... //depot/stg/opencl/drivers/opencl/compiler/llvm.git/lib/Target/AMDGPU/SIISelLowering.cpp#6 edit
... //depot/stg/opencl/drivers/opencl/compiler/llvm.git/test/CodeGen/AMDGPU/add.ll#2 edit
... //depot/stg/opencl/drivers/opencl/compiler/llvm.git/test/CodeGen/AMDGPU/copy-illegal-type.ll#2 edit
... //depot/stg/opencl/drivers/opencl/compiler/llvm.git/test/CodeGen/AMDGPU/ctpop64.ll#2 edit
... //depot/stg/opencl/drivers/opencl/compiler/llvm.git/test/CodeGen/AMDGPU/cvt_f32_ubyte.ll#2 edit
... //depot/stg/opencl/drivers/opencl/compiler/llvm.git/test/CodeGen/AMDGPU/half.ll#2 edit
... //depot/stg/opencl/drivers/opencl/compiler/llvm.git/test/CodeGen/AMDGPU/insert_vector_elt.ll#2 edit
... //depot/stg/opencl/drivers/opencl/compiler/llvm.git/test/CodeGen/AMDGPU/load-constant-i16.ll#2 edit
... //depot/stg/opencl/drivers/opencl/compiler/llvm.git/test/CodeGen/AMDGPU/load-constant-i32.ll#2 edit
... //depot/stg/opencl/drivers/opencl/compiler/llvm.git/test/CodeGen/AMDGPU/load-constant-i64.ll#2 edit
... //depot/stg/opencl/drivers/opencl/compiler/llvm.git/test/CodeGen/AMDGPU/load-constant-i8.ll#2 edit
... //depot/stg/opencl/drivers/opencl/compiler/llvm.git/test/CodeGen/AMDGPU/load-global-f64.ll#2 edit
... //depot/stg/opencl/drivers/opencl/compiler/llvm.git/test/CodeGen/AMDGPU/load-global-i16.ll#2 edit
... //depot/stg/opencl/drivers/opencl/compiler/llvm.git/test/CodeGen/AMDGPU/load-global-i32.ll#2 edit
... //depot/stg/opencl/drivers/opencl/compiler/llvm.git/test/CodeGen/AMDGPU/load-global-i64.ll#2 edit
... //depot/stg/opencl/drivers/opencl/compiler/llvm.git/test/CodeGen/AMDGPU/load-global-i8.ll#2 edit
... //depot/stg/opencl/drivers/opencl/compiler/llvm.git/test/CodeGen/AMDGPU/load-local-f32.ll#2 edit
... //depot/stg/opencl/drivers/opencl/compiler/llvm.git/test/CodeGen/AMDGPU/load-local-i16.ll#2 edit
... //depot/stg/opencl/drivers/opencl/compiler/llvm.git/test/CodeGen/AMDGPU/load-local-i8.ll#2 edit
... //depot/stg/opencl/drivers/opencl/compiler/llvm.git/test/CodeGen/AMDGPU/load-weird-sizes.ll#1 add
... //depot/stg/opencl/drivers/opencl/compiler/llvm.git/test/CodeGen/AMDGPU/scalar_to_vector.ll#2 edit
... //depot/stg/opencl/drivers/opencl/compiler/llvm.git/test/CodeGen/AMDGPU/sext-in-reg.ll#2 edit
... //depot/stg/opencl/drivers/opencl/compiler/llvm.git/test/CodeGen/AMDGPU/shift-and-i128-ubfe.ll#2 edit
... //depot/stg/opencl/drivers/opencl/compiler/llvm.git/test/CodeGen/AMDGPU/store-barrier.ll#2 edit
... //depot/stg/opencl/drivers/opencl/compiler/llvm.git/test/CodeGen/AMDGPU/store.ll#2 edit
... //depot/stg/opencl/drivers/opencl/compiler/llvm.git/test/CodeGen/AMDGPU/sub.ll#2 edit
... //depot/stg/opencl/drivers/opencl/compiler/llvm.git/test/CodeGen/AMDGPU/trunc-bitcast-vector.ll#2 edit
... //depot/stg/opencl/drivers/opencl/compiler/llvm.git/test/CodeGen/AMDGPU/trunc-store.ll#2 edit
2016-07-07 03:19:07 -04:00
foreman cb5fc90bd1 P4 to Git Change 1288152 by jsjodin@jsjodin-git2p4-llvm on 2016/07/06 19:35:43
SWDEV-3 - [PM] Port LoopAccessInfo analysis to new PM

	It is implemented as a LoopAnalysis pass as
	discussed and agreed upon.

	git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@274452 91177308-0d34-0410-b5e6-96231b3b80d8

	GitHash: 10b22c88941392b23543cf8c8eb84044e30a3c7b

Affected files ...

... //depot/stg/opencl/drivers/opencl/compiler/llvm.git/include/llvm/Analysis/LoopAccessAnalysis.h#6 edit
... //depot/stg/opencl/drivers/opencl/compiler/llvm.git/lib/Analysis/LoopAccessAnalysis.cpp#7 edit
... //depot/stg/opencl/drivers/opencl/compiler/llvm.git/lib/Passes/PassBuilder.cpp#5 edit
... //depot/stg/opencl/drivers/opencl/compiler/llvm.git/lib/Passes/PassRegistry.def#7 edit
... //depot/stg/opencl/drivers/opencl/compiler/llvm.git/test/Analysis/LoopAccessAnalysis/backward-dep-different-types.ll#2 edit
... //depot/stg/opencl/drivers/opencl/compiler/llvm.git/test/Analysis/LoopAccessAnalysis/forward-loop-carried.ll#2 edit
... //depot/stg/opencl/drivers/opencl/compiler/llvm.git/test/Analysis/LoopAccessAnalysis/forward-loop-independent.ll#2 edit
... //depot/stg/opencl/drivers/opencl/compiler/llvm.git/test/Analysis/LoopAccessAnalysis/independent-interleaved.ll#2 edit
... //depot/stg/opencl/drivers/opencl/compiler/llvm.git/test/Analysis/LoopAccessAnalysis/memcheck-for-loop-invariant.ll#2 edit
... //depot/stg/opencl/drivers/opencl/compiler/llvm.git/test/Analysis/LoopAccessAnalysis/multiple-strides-rt-memory-checks.ll#2 edit
... //depot/stg/opencl/drivers/opencl/compiler/llvm.git/test/Analysis/LoopAccessAnalysis/non-wrapping-pointer.ll#2 edit
... //depot/stg/opencl/drivers/opencl/compiler/llvm.git/test/Analysis/LoopAccessAnalysis/nullptr.ll#2 edit
... //depot/stg/opencl/drivers/opencl/compiler/llvm.git/test/Analysis/LoopAccessAnalysis/number-of-memchecks.ll#2 edit
... //depot/stg/opencl/drivers/opencl/compiler/llvm.git/test/Analysis/LoopAccessAnalysis/pointer-with-unknown-bounds.ll#2 edit
... //depot/stg/opencl/drivers/opencl/compiler/llvm.git/test/Analysis/LoopAccessAnalysis/resort-to-memchecks-only.ll#2 edit
... //depot/stg/opencl/drivers/opencl/compiler/llvm.git/test/Analysis/LoopAccessAnalysis/reverse-memcheck-bounds.ll#2 edit
... //depot/stg/opencl/drivers/opencl/compiler/llvm.git/test/Analysis/LoopAccessAnalysis/safe-no-checks.ll#2 edit
... //depot/stg/opencl/drivers/opencl/compiler/llvm.git/test/Analysis/LoopAccessAnalysis/safe-with-dep-distance.ll#2 edit
... //depot/stg/opencl/drivers/opencl/compiler/llvm.git/test/Analysis/LoopAccessAnalysis/store-to-invariant-check2.ll#2 edit
... //depot/stg/opencl/drivers/opencl/compiler/llvm.git/test/Analysis/LoopAccessAnalysis/store-to-invariant-check3.ll#2 edit
... //depot/stg/opencl/drivers/opencl/compiler/llvm.git/test/Analysis/LoopAccessAnalysis/stride-access-dependence.ll#2 edit
... //depot/stg/opencl/drivers/opencl/compiler/llvm.git/test/Analysis/LoopAccessAnalysis/underlying-objects-1.ll#2 edit
... //depot/stg/opencl/drivers/opencl/compiler/llvm.git/test/Analysis/LoopAccessAnalysis/underlying-objects-2.ll#2 edit
... //depot/stg/opencl/drivers/opencl/compiler/llvm.git/test/Analysis/LoopAccessAnalysis/unsafe-and-rt-checks.ll#2 edit
... //depot/stg/opencl/drivers/opencl/compiler/llvm.git/test/Analysis/LoopAccessAnalysis/wrapping-pointer-versioning.ll#2 edit
2016-07-07 02:56:39 -04:00
foreman 2a7b7edab9 P4 to Git Change 1288098 by jsjodin@jsjodin-git2p4-llvm on 2016/07/06 18:12:30
SWDEV-3 - [codeview] Don't record UDTs for anonymous structs

	MSVC makes up names for these anonymous structs, but we don't (yet).
	Eventually Clang should use getTypedefNameForAnonDecl() to put some name
	in the debug info, and we can update the test case when that happens.

	git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@274391 91177308-0d34-0410-b5e6-96231b3b80d8

	GitHash: 613f19910964eb95a63bd906b0b75d9aa20d9b06

Affected files ...

... //depot/stg/opencl/drivers/opencl/compiler/llvm.git/lib/CodeGen/AsmPrinter/CodeViewDebug.cpp#5 edit
... //depot/stg/opencl/drivers/opencl/compiler/llvm.git/test/DebugInfo/COFF/udts.ll#2 edit
2016-07-07 02:53:10 -04:00
foreman d77019c542 P4 to Git Change 1288149 by jsjodin@jsjodin-git2p4-llvm on 2016/07/06 19:30:52
SWDEV-3 - [X86][AVX512] Add support for lowering shuffles to VPERMILPD

	git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@274450 91177308-0d34-0410-b5e6-96231b3b80d8

	GitHash: c032badeb41a5888f9627b4e7ba4d07a9a1653dc

Affected files ...

... //depot/stg/opencl/drivers/opencl/compiler/llvm.git/lib/Target/X86/X86ISelLowering.cpp#12 edit
... //depot/stg/opencl/drivers/opencl/compiler/llvm.git/test/CodeGen/X86/vector-shuffle-512-v8.ll#3 edit
2016-07-07 02:41:14 -04:00
foreman 162c07fcb4 P4 to Git Change 1288095 by jsjodin@jsjodin-git2p4-llvm on 2016/07/06 18:08:24
SWDEV-3 - IR: Set TargetPrefix for some X86 and AArch64 intrinsics where it was missing

	git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@274390 91177308-0d34-0410-b5e6-96231b3b80d8

	GitHash: f0a4c116041f7c2aef7796c8b067f0947b69602d

Affected files ...

... //depot/stg/opencl/drivers/opencl/compiler/llvm.git/include/llvm/IR/IntrinsicsAArch64.td#2 edit
... //depot/stg/opencl/drivers/opencl/compiler/llvm.git/include/llvm/IR/IntrinsicsX86.td#2 edit
2016-07-07 02:37:58 -04:00
foreman f9a2bb53b6 P4 to Git Change 1288218 by jsjodin@jsjodin-git2p4-clang on 2016/07/06 21:28:39
SWDEV-3 - [X86][SSE2] Updated tests to match llvm	est\CodeGen\X86\sse2-intrinsics-fast-isel-x86_64.ll

	git-svn-id: https://llvm.org/svn/llvm-project/cfe/trunk@274126 91177308-0d34-0410-b5e6-96231b3b80d8

	GitHash: 5f57c65083ce901e984d6456f8a9f1d78b0f1e7f

Affected files ...

... //depot/stg/opencl/drivers/opencl/compiler/clang.git/test/CodeGen/sse2-builtins.c#2 edit
2016-07-07 02:11:39 -04:00
foreman cb7e7442c8 P4 to Git Change 1288093 by jsjodin@jsjodin-git2p4-llvm on 2016/07/06 18:05:07
SWDEV-3 - Address two correctness issues in LoadStoreVectorizer

	Summary:
	GetBoundryInstruction returns the last instruction as the instruction which follows or end(). Otherwise the last instruction in the boundry set is not being tested by isVectorizable().
	Partially solve reordering of instructions. More extensive solution to follow.

	Reviewers: tstellarAMD, llvm-commits, jlebar

	Subscribers: escha, arsenm, mzolotukhin

	Differential Revision: http://reviews.llvm.org/D21934

	git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@274389 91177308-0d34-0410-b5e6-96231b3b80d8

	GitHash: 1e53a5fcec984e0f1cefe43dba3939e4b72a533f

Affected files ...

... //depot/stg/opencl/drivers/opencl/compiler/llvm.git/lib/Transforms/Vectorize/LoadStoreVectorizer.cpp#11 edit
... //depot/stg/opencl/drivers/opencl/compiler/llvm.git/test/Transforms/LoadStoreVectorizer/AMDGPU/interleaved-mayalias-store.ll#2 edit
... //depot/stg/opencl/drivers/opencl/compiler/llvm.git/test/Transforms/LoadStoreVectorizer/X86/lit.local.cfg#1 add
... //depot/stg/opencl/drivers/opencl/compiler/llvm.git/test/Transforms/LoadStoreVectorizer/X86/preserve-order32.ll#1 add
... //depot/stg/opencl/drivers/opencl/compiler/llvm.git/test/Transforms/LoadStoreVectorizer/X86/preserve-order64.ll#1 add
2016-07-07 02:07:12 -04:00
foreman 719ebdcc13 P4 to Git Change 1288145 by jsjodin@jsjodin-git2p4-llvm on 2016/07/06 19:22:31
SWDEV-3 - [PM] Preparatory cleanups to ArgumentPromotion.

	This pulls some obvious changes out of http://reviews.llvm.org/D21921 to
	minimize the diff.

	git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@274445 91177308-0d34-0410-b5e6-96231b3b80d8

	GitHash: 197a7516a32b69da7d1243308cb8eb6c5f29de0c

Affected files ...

... //depot/stg/opencl/drivers/opencl/compiler/llvm.git/lib/Transforms/IPO/ArgumentPromotion.cpp#2 edit
2016-07-07 02:04:25 -04:00
foreman 1045e2c965 P4 to Git Change 1288091 by jsjodin@jsjodin-git2p4-llvm on 2016/07/06 18:00:18
SWDEV-3 - [Hexagon] Revert r274381: that was actually wrong

	git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@274384 91177308-0d34-0410-b5e6-96231b3b80d8

	GitHash: 364f2492a3d31efedfec3127f78b0aa6c171bc4b

Affected files ...

... //depot/stg/opencl/drivers/opencl/compiler/llvm.git/lib/Target/Hexagon/HexagonExpandCondsets.cpp#6 edit
2016-07-07 02:02:02 -04:00
foreman cba6feb4eb P4 to Git Change 1288141 by jsjodin@jsjodin-git2p4-llvm on 2016/07/06 19:16:54
SWDEV-3 - [PM] Fix a small typo from when I ported JumpThreading

	git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@274440 91177308-0d34-0410-b5e6-96231b3b80d8

	GitHash: ea9886a5909183770b8d0baa9061150adf664b1a

Affected files ...

... //depot/stg/opencl/drivers/opencl/compiler/llvm.git/lib/Transforms/Scalar/JumpThreading.cpp#2 edit
2016-07-07 01:24:03 -04:00
foreman 2c48978f49 P4 to Git Change 1288089 by jsjodin@jsjodin-git2p4-llvm on 2016/07/06 17:56:13
SWDEV-3 - [Hexagon] Use MachineOperand::readsReg instead of isUse

	git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@274381 91177308-0d34-0410-b5e6-96231b3b80d8

	GitHash: 7993722a44b56787f0bf68895501a6f36d2aa8ea

Affected files ...

... //depot/stg/opencl/drivers/opencl/compiler/llvm.git/lib/Target/Hexagon/HexagonExpandCondsets.cpp#5 edit
2016-07-07 01:20:31 -04:00
foreman d32e5e2c1f P4 to Git Change 1288138 by jsjodin@jsjodin-git2p4-llvm on 2016/07/06 19:11:36
SWDEV-3 - [Hexagon] Create global std::map lazily.

	This could of course be a simple binary search with no global state
	involved at all if someone cares enough. Just don't make everyone
	linking the hexagon backend pay for it on process startup and shutdown.

	git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@274437 91177308-0d34-0410-b5e6-96231b3b80d8

	GitHash: b4e53350f9349677e2a0178bde5b8b0c3b743b5e

Affected files ...

... //depot/stg/opencl/drivers/opencl/compiler/llvm.git/lib/Target/Hexagon/MCTargetDesc/HexagonMCDuplexInfo.cpp#2 edit
2016-07-07 01:03:13 -04:00
foreman cd9d1dfd35 P4 to Git Change 1288063 by bsumner@bsumner-lin-opencl on 2016/07/06 17:07:15
SWDEV-97663 - optimize cbrt for AMD devices

Affected files ...

... //depot/stg/opencl/drivers/opencl/library/ocml/src/cbrtF.cl#5 edit
... //depot/stg/opencl/drivers/opencl/library/ocml/src/rcbrtF.cl#4 edit
2016-07-07 00:56:14 -04:00
foreman 7e285f4a16 P4 to Git Change 1288136 by jsjodin@jsjodin-git2p4-llvm on 2016/07/06 19:09:01
SWDEV-3 - [X86][AVX512] Add support for lowering shuffles to MOVDDUP/MOVSLDUP/MOVSHDUP

	git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@274436 91177308-0d34-0410-b5e6-96231b3b80d8

	GitHash: 2e1720f4556e47b98353c3a901913d0f1ba54944

Affected files ...

... //depot/stg/opencl/drivers/opencl/compiler/llvm.git/lib/Target/X86/X86ISelLowering.cpp#10 edit
... //depot/stg/opencl/drivers/opencl/compiler/llvm.git/test/CodeGen/X86/vector-shuffle-512-v16.ll#3 edit
... //depot/stg/opencl/drivers/opencl/compiler/llvm.git/test/CodeGen/X86/vector-shuffle-512-v8.ll#2 edit
2016-07-07 00:48:12 -04:00
foreman e6f8cce17c P4 to Git Change 1288058 by jsjodin@jsjodin-git2p4-llvm on 2016/07/06 16:59:31
SWDEV-3 - CodeGen: Use MachineInstr& in SlotIndexes.cpp, NFC

	Avoid implicit conversions from iterator to pointer by preferring
	MachineInstr& and using range-based for loops.

	git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@274354 91177308-0d34-0410-b5e6-96231b3b80d8

	GitHash: effa4cc200078395a74decd1ae2d1e380c79a2f7

Affected files ...

... //depot/stg/opencl/drivers/opencl/compiler/llvm.git/lib/CodeGen/SlotIndexes.cpp#2 edit
2016-07-07 00:41:51 -04:00
foreman 352d3839bf P4 to Git Change 1288134 by jsjodin@jsjodin-git2p4-llvm on 2016/07/06 19:06:29
SWDEV-3 - [X86][AVX512] Add test cases that should lower to MOVSLDUP/MOVSHDUP

	git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@274435 91177308-0d34-0410-b5e6-96231b3b80d8

	GitHash: 6c48317f16292a574f41875e2a5c433718c3171f

Affected files ...

... //depot/stg/opencl/drivers/opencl/compiler/llvm.git/test/CodeGen/X86/vector-shuffle-512-v16.ll#2 edit
2016-07-07 00:32:57 -04:00
foreman b846d72de1 P4 to Git Change 1288056 by jsjodin@jsjodin-git2p4-llvm on 2016/07/06 16:54:18
SWDEV-3 - CodeGen: Use MachineInstr& in RegAllocFast, NFC

	Use MachineInstr& instead of MachineInstr* in RegAllocFast to avoid
	implicit conversions from MachineInstrBundleIterator.  RAFast::spillAll
	and RAFast::spillVirtReg still take iterators, since their argument may
	be an end iterator from MachineBasicBlock::getFirstTerminator.

	git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@274353 91177308-0d34-0410-b5e6-96231b3b80d8

	GitHash: ce5fdc00e7ed9f05c643b056d0561a8133b5438b

Affected files ...

... //depot/stg/opencl/drivers/opencl/compiler/llvm.git/lib/CodeGen/RegAllocFast.cpp#2 edit
2016-07-07 00:29:17 -04:00
foreman 219a3c3b4f P4 to Git Change 1288154 by jsjodin@jsjodin-git2p4-llvm on 2016/07/06 19:37:08
SWDEV-3 - Fix wrong comment

	git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@274453 91177308-0d34-0410-b5e6-96231b3b80d8

	GitHash: 6f6a939a2e6fbbeb8e8ff4cc808934b6c1f58b39

Affected files ...

... //depot/stg/opencl/drivers/opencl/compiler/llvm.git/include/llvm/Analysis/LoopAccessAnalysis.h#7 edit
2016-07-07 00:18:38 -04:00
foreman 73ce4dea9f P4 to Git Change 1288053 by jsjodin@jsjodin-git2p4-llvm on 2016/07/06 16:51:12
SWDEV-3 - [CMake] Add LLVM_BUILD_32_BITS to LLVMConfig.cmake

	Previously out-of-tree passes could detect if LLVM was built with
	LLVM_BUILD_32_BITS by looking for -m32 in LLVM_DEFINITIONS, but as of r271871
	it no longer appears there. Resolve this by instead emitting LLVM_BUILD_32_BITS
	in LLVMConfig so it can be checked for directly.

	Differential Revision: http://reviews.llvm.org/D21434

	git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@274351 91177308-0d34-0410-b5e6-96231b3b80d8

	GitHash: e6124112ab41442b4df11207eaf004bb6066c021

Affected files ...

... //depot/stg/opencl/drivers/opencl/compiler/llvm.git/cmake/modules/LLVMConfig.cmake.in#6 edit
2016-07-07 00:15:12 -04:00
foreman cc1b6af6f6 P4 to Git Change 1288128 by jsjodin@jsjodin-git2p4-llvm on 2016/07/06 18:48:55
SWDEV-3 - [PM] Port ConstantHoisting to the new Pass Manager

	Differential Revision: http://reviews.llvm.org/D21945

	git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@274411 91177308-0d34-0410-b5e6-96231b3b80d8

	GitHash: 3fae04db5074e0e6d8137d04c9eca3164e7c6967

Affected files ...

... //depot/stg/opencl/drivers/opencl/compiler/llvm.git/include/llvm/InitializePasses.h#7 edit
... //depot/stg/opencl/drivers/opencl/compiler/llvm.git/include/llvm/Transforms/Scalar/ConstantHoisting.h#1 add
... //depot/stg/opencl/drivers/opencl/compiler/llvm.git/lib/Passes/PassBuilder.cpp#4 edit
... //depot/stg/opencl/drivers/opencl/compiler/llvm.git/lib/Passes/PassRegistry.def#6 edit
... //depot/stg/opencl/drivers/opencl/compiler/llvm.git/lib/Transforms/Scalar/ConstantHoisting.cpp#2 edit
... //depot/stg/opencl/drivers/opencl/compiler/llvm.git/lib/Transforms/Scalar/Scalar.cpp#4 edit
... //depot/stg/opencl/drivers/opencl/compiler/llvm.git/test/Transforms/ConstantHoisting/X86/cast-inst.ll#2 edit
2016-07-06 23:52:32 -04:00
foreman 8db67e10da P4 to Git Change 1288051 by jsjodin@jsjodin-git2p4-llvm on 2016/07/06 16:46:07
SWDEV-3 - [ARM] Refactor Thumb2 mul instruction descs

	No functional changes. Just created wrapper classes around the 3
	and 4 reg mult and mac instruction classes.

	Differential Revision: http://reviews.llvm.org/D21549

	git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@274347 91177308-0d34-0410-b5e6-96231b3b80d8

	GitHash: b5755a89959882b64dc9adc3a963b5ba920b392f

Affected files ...

... //depot/stg/opencl/drivers/opencl/compiler/llvm.git/lib/Target/ARM/ARMInstrThumb2.td#2 edit
2016-07-06 23:48:31 -04:00
foreman 7a8d3d1c44 P4 to Git Change 1288126 by jsjodin@jsjodin-git2p4-llvm on 2016/07/06 18:41:40
SWDEV-3 - LoadStoreVectorizer: Fix warning about extra semicolon

	git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@274406 91177308-0d34-0410-b5e6-96231b3b80d8

	GitHash: c1be1f58839a87dc774bd1f6d1af3c3e1174ce7b

Affected files ...

... //depot/stg/opencl/drivers/opencl/compiler/llvm.git/lib/Transforms/Vectorize/LoadStoreVectorizer.cpp#12 edit
2016-07-06 23:42:12 -04:00