커밋 그래프

1473 커밋

작성자 SHA1 메시지 날짜
foreman b89d0bea0f P4 to Git Change 1316123 by gandryey@gera-w8 on 2016/09/19 18:22:55
SWDEV-102286 - [CQE OCL][OpenCL on PAL] 32/64 "DX9" OpenCL SDK Sample is crashing
	- Correct DX9 device binding

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/pal/paldevice.cpp#22 edit


[ROCm/clr commit: b65af32fe1]
2016-09-19 20:13:01 -04:00
foreman 321fc0490a P4 to Git Change 1315824 by asalmanp@asalmanp-opencl-stg on 2016/09/19 12:04:15
SWDEV-102417 - Forum [205433] : Memory leak with printf statement inside kernel code

	A memory leak can occur if a printf statement is inside the .cl source code but it is not used inside the __kernel code (e.g., a function inside .cl code that uses printf but never called by the __kernel). In this case compiler generates the printf metadata but printf is not used by the __kernel (i.e., the printf buffer is empty).

	To fix this issue, release the transfer buffer object before returning false in PrintfDbgHSA::output function.

	ReviewBoardURL = http://ocltc.amd.com/reviews/r/11394/

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuprintf.cpp#42 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palprintf.cpp#3 edit


[ROCm/clr commit: 0dd7a03954]
2016-09-19 12:13:23 -04:00
foreman 75f80ffbfb P4 to Git Change 1313945 by wchau@wchau_OCL_boltzmann on 2016/09/13 15:57:45
SWDEV-94605 - [OCL-LC-ROCm] ability to load offline (and online) generated binaries using the HSA code object format

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/device.cpp#203 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/roccompiler.cpp#17 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocprogram.cpp#35 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocprogram.hpp#14 edit
... //depot/stg/opencl/drivers/opencl/runtime/platform/program.cpp#83 edit


[ROCm/clr commit: 0495f895a2]
2016-09-13 16:16:35 -04:00
foreman 89175ce61b P4 to Git Change 1313578 by lmoriche@lmoriche_opencl_dev on 2016/09/13 00:14:36
SWDEV-94610 - Target features are only needed in the CL->IR stage. The attributes remain on the function, so they should not be set again in the IR->ISA stage.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/roccompiler.cpp#16 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocprogram.cpp#34 edit


[ROCm/clr commit: a1009a5d11]
2016-09-12 23:23:59 -05:00
foreman 322917f699 P4 to Git Change 1313559 by lmoriche@lmoriche_opencl_dev on 2016/09/12 22:58:58
SWDEV-94610 - Don't use the -cl-denorms-are-zero, but instead set the fp32/fp64 denorms with the target features +fp32-denormals and +fp64-denormals. fp64-denormals is always set, fp32-denormals in only set if device >= gfx900 and -cl-denorms-are-zero is not set.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocprogram.cpp#33 edit


[ROCm/clr commit: 7239172265]
2016-09-12 23:08:18 -04:00
foreman 73a973b006 P4 to Git Change 1313484 by lmoriche@lmoriche_opencl_dev on 2016/09/12 17:32:10
SWDEV-94610 - Fix the HSAIL Runtime build.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocprogram.cpp#32 edit


[ROCm/clr commit: 978a25ba95]
2016-09-12 16:43:09 -05:00
foreman e52e05fe56 P4 to Git Change 1313477 by lmoriche@lmoriche_opencl_dev on 2016/09/12 17:22:37
SWDEV-94610 - Split the HSAILProgram::hsailOptions into preprocessorOptions amd codegenOptions. Pass the codegenOptions to the IR->ISA stage (linkImpl).

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/roccompiler.cpp#15 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocprogram.cpp#31 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocprogram.hpp#13 edit


[ROCm/clr commit: 64ed800f46]
2016-09-12 17:30:21 -04:00
foreman bb8ebe5ed4 P4 to Git Change 1313439 by lmoriche@lmoriche_opencl_dev on 2016/09/12 16:05:39
SWDEV-94610 - Revert to the HSA 1.0 API for now.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocdevice.cpp#16 edit


[ROCm/clr commit: fe6ef85918]
2016-09-12 16:12:26 -04:00
foreman 9a0b935753 P4 to Git Change 1313121 by lmoriche@lmoriche_opencl_dev on 2016/09/11 15:21:47
SWDEV-94611 - [OCL-LC-ROCm] Use GFX IP for device name. Set the name to "gfx[M][m][s]" (M:major,m:minor,stepping). Removed the device name strings from the DeviceInfo table. Keep the machineTarget_ field until the compiler is changed to accept gfxip strings.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocdefs.hpp#6 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocdevice.cpp#15 edit


[ROCm/clr commit: c828fa8498]
2016-09-11 15:30:45 -04:00
foreman 120b9b53c7 P4 to Git Change 1313106 by lmoriche@lmoriche_opencl_dev on 2016/09/10 22:38:21
SWDEV-94644 - Run prepare-builtins from the modules build directory, instead of right before generating the include files. Renamed the files to match the opensource build names (except for the .amdgcn suffix). Automatically generate a single include file for all libraries.

Affected files ...

... //depot/stg/opencl/drivers/opencl/library/build/Makefile.library#54 edit
... //depot/stg/opencl/drivers/opencl/make/amdgcn.git/irif/build/Makefile.irif#7 edit
... //depot/stg/opencl/drivers/opencl/make/amdgcn.git/ockl/build/Makefile.ockl#8 edit
... //depot/stg/opencl/drivers/opencl/make/amdgcn.git/oclc/build/Makefile.oclc#10 edit
... //depot/stg/opencl/drivers/opencl/make/amdgcn.git/ocml/build/Makefile.ocml#8 edit
... //depot/stg/opencl/drivers/opencl/make/amdgcn.git/opencl/build/Makefile.opencl#10 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocprogram.cpp#30 edit


[ROCm/clr commit: 8bb15b463b]
2016-09-10 22:46:52 -04:00
foreman 2a99a81d61 P4 to Git Change 1312856 by lmoriche@lmoriche_opencl_dev on 2016/09/09 11:52:59
SWDEV-94610 - Build fix for the HSAIL Runtime Foundation build

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocprogram.cpp#29 edit


[ROCm/clr commit: 70ef99eb26]
2016-09-09 12:02:35 -04:00
foreman c832965ae1 P4 to Git Change 1312846 by gandryey@gera-w8 on 2016/09/09 11:33:02
SWDEV-86035 - Add PAL backend to OpenCL
	- Add (PAL) suffix to the driver version

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpudevice.cpp#556 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/paldevice.cpp#21 edit


[ROCm/clr commit: ebb29b8d36]
2016-09-09 11:40:57 -04:00
foreman a030ebfe49 P4 to Git Change 1312587 by cpaquot@hog-ocl on 2016/09/08 19:46:03
SWDEV-96354 - Wrong usage of hsaImageData_ and deviceMemory_.

	Use hsaImageData_ as the original pointer before alignment and only for that purpose. The deviceMemory_ is where the data is located. No one ever needs to use hsaImageData_ really. This is only an issue with tiled images

	ReviewBoardURL = http://ocltc.amd.com/reviews/r/11331/diff/

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocdevice.cpp#14 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocdevice.hpp#6 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocmemory.cpp#4 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocmemory.hpp#4 edit


[ROCm/clr commit: 51948f577c]
2016-09-08 20:34:24 -04:00
foreman 376b1f2bfa P4 to Git Change 1312581 by lmoriche@lmoriche_opencl_dev on 2016/09/08 19:11:07
SWDEV-102033 - [OCL-LC-ROCm] Change runtime metadata KeyArgKind
	- Required runtime changes to use the new hidden argument metadata

Affected files ...

... //depot/stg/opencl/drivers/opencl/make/llvm.git/llvmlibs#7 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/amdgpu_metadata.cpp#4 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/amdgpu_metadata.hpp#4 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rockernel.cpp#15 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rockernel.hpp#9 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocprogram.cpp#28 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocvirtual.cpp#14 edit


[ROCm/clr commit: 8fadd2c115]
2016-09-08 20:29:42 -04:00
foreman b602575d44 P4 to Git Change 1312566 by lmoriche@lmoriche_opencl_dev on 2016/09/08 18:25:02
SWDEV-94610 - Make sure each kernarg segment sits on a different cache line (align the kernargs on cache lines at minimum). Minor misc cleanups.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocdevice.cpp#13 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rockernel.cpp#14 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rockernel.hpp#8 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocprogram.cpp#27 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocvirtual.cpp#13 edit


[ROCm/clr commit: 3a61b24dd5]
2016-09-08 19:52:04 -04:00
foreman aaa1bf6ee7 P4 to Git Change 1312399 by skudchad@skudchad_test_win_opencl2 on 2016/09/08 15:10:25
SWDEV-101383 - [RS_DVR][MGPU] Slave GPU is blocked from going into BACO when DVR process is active (no recording or instant replay)
	- Fix a memory leak
	- Also make sure to use VALIDATE_ONLY flag properly as bindExternalDevice can be called even during context creation for which we cant close the adaper

	ReviewBoardURL = http://ocltc.amd.com/reviews/r/11330/diff/

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpudevice.cpp#555 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gslbe/src/rt/GSLDevice.cpp#174 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gslbe/src/rt/GSLDevice.h#62 edit


[ROCm/clr commit: 8705eca425]
2016-09-08 15:14:05 -04:00
foreman d773c9dc27 P4 to Git Change 1312342 by rili@rili-opencl-pal-stg on 2016/09/08 13:27:45
SWDEV-101790 -  Cherry-pick CL#1293292 from 16.20.1010
	                            Support DriverStore in OCL

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/device.hpp#278 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpudevice.cpp#554 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gslbe/src/rt/GSLDevice.cpp#173 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gslbe/src/rt/backend.h#14 edit


[ROCm/clr commit: fbb90f4504]
2016-09-08 13:36:56 -04:00
foreman 08580f51b5 P4 to Git Change 1311856 by asalmanp@asalmanp-ocl-stg-pal on 2016/09/07 15:28:58
SWDEV-79278 - [OpenCL][PAL] fixing a regression in gfx9 after CL#1309875 which caused all the OCLTST tests to fail on gfx9 emulator. Don’t add any extra entry to the GfxIpDeviceInfo table as this table must match with GfxIpLevel enum (located in //depot/stg/pal/inc/core/palDevice.h).

	ReviewBoardURL = http://ocltc.amd.com/reviews/r/11313/

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/pal/paldefs.hpp#11 edit


[ROCm/clr commit: 8764785ce1]
2016-09-07 15:38:56 -04:00
foreman 3f66d55686 P4 to Git Change 1311624 by nhaustov@nhaustov_hsa on 2016/09/07 05:22:23
SWDEV-101272 - Remove call to OptimizeLLVMBitcode.

	It's no longer needed after LLVM fix https://reviews.llvm.org/D24023 .

	Reviewed by: Laurent Morichetti, German Andryeyev.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocprogram.cpp#26 edit


[ROCm/clr commit: 6a5ec3a6b0]
2016-09-07 05:29:23 -04:00
foreman 862f54db3e P4 to Git Change 1311385 by gandryey@gera-w8 on 2016/09/06 16:51:05
SWDEV-101448 - [CQE OCL][Brahma][PERF][QR] ~21% perf drop is observed with lulesh-cl subtest of ComputeApps tests : Faulty CL # 1306133
	- Use the logic for transfer size before CL#1306133

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpublit.cpp#124 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palblit.cpp#10 edit


[ROCm/clr commit: b750057405]
2016-09-06 17:00:06 -04:00
foreman afebe549f0 P4 to Git Change 1311278 by jatang@jatang-opencl-hsa-stg2 on 2016/09/06 14:13:56
SWDEV-101315 - Fix PerfCounter not working under CodeXL.

	1. Need to map ORCA PerfCounter block to PAL PerfCounter block/instance.
	2. CodeXL could try to create PerfCouters that don't exist in HW, so need to handle that and return 0 as result.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palcounters.cpp#5 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palcounters.hpp#6 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palvirtual.cpp#21 edit


[ROCm/clr commit: 93e45cff4e]
2016-09-06 14:24:40 -04:00
foreman c42b53faac P4 to Git Change 1311277 by lmoriche@lmoriche_opencl_dev on 2016/09/06 14:11:25
SWDEV-101853 - Fix the build, add a "return NULL" after the assert.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rockernel.hpp#7 edit


[ROCm/clr commit: 59ca1d70e6]
2016-09-06 14:19:41 -04:00
foreman 0a916189b5 P4 to Git Change 1311274 by lmoriche@lmoriche_opencl_dev on 2016/09/06 14:02:55
SWDEV-94610 - Fill the compileSize_ and compileSizeHint_ info from the LC metadata.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rockernel.cpp#13 edit


[ROCm/clr commit: 7d5471671b]
2016-09-06 14:09:20 -04:00
foreman d5898bbcfc P4 to Git Change 1311256 by lmoriche@lmoriche_opencl_dev on 2016/09/06 13:16:58
SWDEV-101853 - roc::Kernel cleanups:
	- Remove unused classes & member functions/variables.
	- Flatten vector arguments for the HSAIL path to  remove the need for numElem_.
	- Consolidate initArguments in a single loop for the HSAIL path.
	- Use the Kernel::Argument to fill the OCL descriptor as much as possible.
	- Set the access qualifier for both buffers and images.
	- Fix the indentation and coding conventions.
	- Add new ROC_ARG_TYPE type for hidden arguments
	- Add an index_ field the roc::Kernel::Argument to record the OCL signature index for this argument, or -1 for hidden arguments
	- Handle the hidden arguments as any other argument at dispatch (now included in the hsailArgList_)
	- roc::Kernel::hsailArgAt(int) now returns the kernel argument for the given position in the OCL signature, not the position the the hsailArgList_.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rockernel.cpp#12 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rockernel.hpp#6 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocvirtual.cpp#12 edit


[ROCm/clr commit: 1c37625aa2]
2016-09-06 13:21:14 -04:00
foreman e4f95d2cc7 P4 to Git Change 1311210 by chui@chui-cnchuiw8-ocl-hsa-stg on 2016/09/06 11:53:29
SWDEV-101383 - Back out CL1310033 as it is causing Carrizo Win 10 Sanity test to crash at ocltst module ocldx.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpudevice.cpp#553 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gslbe/src/rt/GSLDevice.cpp#172 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gslbe/src/rt/GSLDevice.h#61 edit


[ROCm/clr commit: 73bb49eb7b]
2016-09-06 12:01:14 -04:00
foreman 79c5cc6014 P4 to Git Change 1310723 by johtaylo@johtaylo-JTBUILDER03-increment on 2016/09/04 03:00:08
SWDEV-2 - Change OpenCL version number from 2216 to 2217.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1963 edit


[ROCm/clr commit: 9951309118]
2016-09-04 03:09:27 -04:00
foreman d318c61b0c P4 to Git Change 1310684 by johtaylo@johtaylo-JTBUILDER03-increment on 2016/09/03 03:00:09
SWDEV-2 - Change OpenCL version number from 2215 to 2216.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1962 edit


[ROCm/clr commit: d914798797]
2016-09-03 03:09:06 -04:00
foreman 764d09238f P4 to Git Change 1310397 by lmoriche@lmoriche_opencl_dev on 2016/09/02 13:55:00
SWDEV-101169 - Compile the PCH file from <stdin> instead of a file reference. This removes the requirement to have the original file present when using the PCH file.

Affected files ...

... //depot/stg/opencl/drivers/opencl/make/amdgcn.git/headers/build/Makefile.headers#9 edit
... //depot/stg/opencl/drivers/opencl/make/amdgcn.git/irif/build/Makefile.irif#6 edit
... //depot/stg/opencl/drivers/opencl/make/amdgcn.git/ockl/build/Makefile.ockl#7 edit
... //depot/stg/opencl/drivers/opencl/make/amdgcn.git/oclc/build/Makefile.oclc#9 edit
... //depot/stg/opencl/drivers/opencl/make/amdgcn.git/ocml/build/Makefile.ocml#7 edit
... //depot/stg/opencl/drivers/opencl/make/amdgcn.git/opencl/build/Makefile.opencl#9 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/roccompiler.cpp#14 edit


[ROCm/clr commit: 7937fd1812]
2016-09-02 14:05:40 -04:00
foreman f90f49613f P4 to Git Change 1310350 by lmoriche@lmoriche_opencl_dev on 2016/09/02 12:29:12
SWDEV-101853 - Use the PointeeAlignment metadata to align the dynamic LDS allocations at dispatch.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/amdgpu_metadata.cpp#3 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/amdgpu_metadata.hpp#3 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rockernel.cpp#11 edit


[ROCm/clr commit: b581f2510b]
2016-09-02 12:33:55 -04:00
foreman 0af3db0c1e P4 to Git Change 1310134 by johtaylo@johtaylo-JTBUILDER03-increment on 2016/09/02 03:00:07
SWDEV-2 - Change OpenCL version number from 2214 to 2215.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1961 edit


[ROCm/clr commit: 411cf8cc1d]
2016-09-02 03:05:01 -04:00
foreman c4aef18415 P4 to Git Change 1310081 by lmoriche@lmoriche_opencl_dev on 2016/09/01 21:27:48
SWDEV-94610 - The spec says that the value returned for HSA_EXECUTABLE_SYMBOL_INFO_NAME_LENGTH does not include the NUL terminator. We should add one before using the string.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocprogram.cpp#25 edit


[ROCm/clr commit: 557d2bfddf]
2016-09-01 21:34:31 -04:00
foreman 991054eec6 P4 to Git Change 1310046 by gandryey@gera-w8 on 2016/09/01 18:43:53
SWDEV-101621 - [CQE OCL][OpenCL on PAL] 6 WF Conformance tests are failing
	- Fix fillimage with DEPTH16 format

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palblit.cpp#9 edit


[ROCm/clr commit: 45eedbec7c]
2016-09-01 18:50:31 -04:00
foreman 7e33afa9c4 P4 to Git Change 1310033 by skudchad@skudchad_test_win_opencl2 on 2016/09/01 18:10:18
SWDEV-101383 - [RS_DVR][MGPU] Slave GPU is blocked from going into BACO when DVR process is active (no recording or instant replay)
	- if the OS is Win10, no need to do extensive adapter init.

	ReviewBoardURL = http://ocltc.amd.com/reviews/r/11241/diff/

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpudevice.cpp#552 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gslbe/src/rt/GSLDevice.cpp#171 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gslbe/src/rt/GSLDevice.h#60 edit


[ROCm/clr commit: b232b2bbab]
2016-09-01 18:15:08 -04:00
foreman 319308cce0 P4 to Git Change 1310014 by gandryey@gera-w8 on 2016/09/01 17:32:18
SWDEV-101621 - [CQE OCL][OpenCL on PAL] 6 WF Conformance tests are failing
	- Make sure the rowPitch is aligned to pixels for images created from buffer

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/pal/paldevice.cpp#20 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palresource.cpp#10 edit


[ROCm/clr commit: 09c96ae91c]
2016-09-01 17:44:19 -04:00
foreman 388f4e3b5f P4 to Git Change 1310000 by asalmanp@asalmanp-ocl-stg-pal on 2016/09/01 17:13:37
SWDEV-79278 - [OpenCL][PAL] force Vega10(gfx9)(aka: Greenland) to use PAL backend

	ReviewBoardURL = http://ocltc.amd.com/reviews/r/11279/

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpudevice.cpp#551 edit


[ROCm/clr commit: 02a04ecb2c]
2016-09-01 17:23:35 -04:00
foreman 2f60226768 P4 to Git Change 1309875 by smekhano@stas-rampitec-hsa on 2016/09/01 14:11:31
SWDEV-101534 - Lexa bringup: correct capability to 8:0:4 and add complib code
	Testing: smoke, precheckin
	Reviewed by Brian Sumner and Evgeny Mankov

Affected files ...

... //depot/stg/opencl/drivers/opencl/compiler/legacy-lib/backends/gpu/scwrapper/SI/devStateSI.cpp#2 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/utils/v0_8/libUtils.cpp#19 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/utils/v0_8/target_mappings.h#47 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/utils/v0_8/target_mappings_amdil.h#41 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/utils/v0_8/target_mappings_amdil64.h#37 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/utils/v0_8/target_mappings_hsail.h#38 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/utils/v0_8/target_mappings_hsail64.h#33 edit
... //depot/stg/opencl/drivers/opencl/compiler/tools/driver/driver.cpp#63 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpudefs.hpp#141 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/paldefs.hpp#10 edit


[ROCm/clr commit: 5787de6741]
2016-09-01 14:20:14 -04:00
foreman 55c1acd44a P4 to Git Change 1309866 by gandryey@gera-w8 on 2016/09/01 13:50:12
SWDEV-79445 - OCL generic changes and code clean-up
	- Improve image fill performance with multiple writes in a single thread. The current split has 3 regions

Affected files ...

... //depot/stg/opencl/drivers/opencl/library/common.hsa/src/blitKernels.cl#4 edit
... //depot/stg/opencl/drivers/opencl/library/common/src/blitKernels.cl#4 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpublit.cpp#123 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpublit.hpp#40 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palblit.cpp#8 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palblit.hpp#4 edit


[ROCm/clr commit: 57043d662d]
2016-09-01 14:01:08 -04:00
foreman 216f9cf1b1 P4 to Git Change 1309857 by lmoriche@lmoriche_opencl_dev on 2016/09/01 13:09:36
SWDEV-94610 - Restore the amdgpu_metadata.[ch]pp namespaces (amd::hsa::code)

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/amdgpu_metadata.cpp#2 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/amdgpu_metadata.hpp#2 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rockernel.cpp#10 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rockernel.hpp#5 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocprogram.cpp#24 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocprogram.hpp#12 edit


[ROCm/clr commit: d6fbd59c1a]
2016-09-01 13:18:08 -04:00
foreman 9485e60962 P4 to Git Change 1309815 by lmoriche@lmoriche_opencl_dev on 2016/09/01 11:55:16
SWDEV-94610 - Restore the amdgpu_metadata.[ch]pp files. We need to share these files between different projects, and should avoid branching them. Ideally, they would be part of a metadata utility library.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/amdgpu_metadata.cpp#1 add
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/amdgpu_metadata.hpp#1 branch
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rockernel.cpp#9 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocmetadata.cpp#3 delete
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocmetadata.hpp#4 delete
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocprogram.hpp#11 edit


[ROCm/clr commit: ee978692f5]
2016-09-01 12:05:26 -04:00
foreman e3081c5a39 P4 to Git Change 1309585 by johtaylo@johtaylo-JTBUILDER03-increment on 2016/09/01 03:00:07
SWDEV-2 - Change OpenCL version number from 2213 to 2214.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1960 edit


[ROCm/clr commit: 51b56820d8]
2016-09-01 03:08:43 -04:00
foreman 1c9fdc27d3 P4 to Git Change 1309526 by lmoriche@lmoriche_opencl_dev on 2016/08/31 22:39:22
SWDEV-94610 - Use the metadata to set the correct size for pointer arguments. Pointers to different address spaces may be of different sizes.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocvirtual.cpp#11 edit


[ROCm/clr commit: 741816f873]
2016-08-31 22:44:50 -04:00
foreman ad2a806fb0 P4 to Git Change 1309486 by lmoriche@lmoriche_opencl_dev on 2016/08/31 20:40:56
SWDEV-94610 - Fix the indentation and function declarations (coding standards).

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/roccompiler.cpp#13 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocprogram.cpp#23 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocprogram.hpp#10 edit


[ROCm/clr commit: 8d309ccae6]
2016-08-31 20:46:12 -04:00
foreman d9d9ee6414 P4 to Git Change 1309479 by lmoriche@lmoriche_opencl_dev on 2016/08/31 20:14:13
SWDEV-94610 - Fix the argName length issue. The string returned by the ROCR is already NUL-terminated.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocprogram.cpp#22 edit


[ROCm/clr commit: 52e3652f92]
2016-08-31 20:21:06 -04:00
foreman 0b63f2d19f P4 to Git Change 1309472 by lmoriche@lmoriche_opencl_dev on 2016/08/31 19:27:07
SWDEV-94610 - Fix the API::get_kernel_arg_info conformance test failure. The runtime metadata needs to return references from Name() and TypeName() instead of temporary strings. Name().c_str() should be valid until the program is destroyed.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rockernel.cpp#8 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocmetadata.cpp#2 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocmetadata.hpp#3 edit


[ROCm/clr commit: 29a3f5d56a]
2016-08-31 19:36:48 -04:00
foreman ac10a39e22 P4 to Git Change 1309426 by lmoriche@lmoriche_opencl_dev on 2016/08/31 18:05:23
SWDEV-94610 - roundUp the kernarg segment size to the hidden arguments alignment (sizeof(size_t)). Fixes ocltst::oclperf::OCLPerfLDSReadSpeed.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocprogram.cpp#21 edit


[ROCm/clr commit: ed155ace2e]
2016-08-31 18:20:16 -04:00
foreman 59fbb048ea P4 to Git Change 1309300 by wchau@wchau_WIN_OCL_HSA on 2016/08/31 14:57:02
SWDEV-100272 - [OCL] Support AMD DVR Core functionalities - Distinct Single Display (Changes in OCL runtime)

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gslbe/src/rt/GSLDeviceGL.cpp#27 edit


[ROCm/clr commit: 07339a999b]
2016-08-31 15:04:44 -04:00
foreman 5338d11731 P4 to Git Change 1309060 by johtaylo@johtaylo-JTBUILDER03-increment on 2016/08/31 03:00:08
SWDEV-2 - Change OpenCL version number from 2212 to 2213.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1959 edit


[ROCm/clr commit: 55c2ad2f12]
2016-08-31 03:08:49 -04:00
foreman 0de4de0907 P4 to Git Change 1309005 by lmoriche@lmoriche_opencl_dev on 2016/08/31 00:08:03
SWDEV-94644 - Generate a relocatable PCH for opencl-c.h. Use -isysroot to point to $(LLVM_BIN)/../../include where opencl-c.h should be installed.

Affected files ...

... //depot/stg/opencl/drivers/opencl/make/amdgcn.git/headers/build/Makefile.headers#8 edit
... //depot/stg/opencl/drivers/opencl/make/amdgcn.git/irif/build/Makefile.irif#5 edit
... //depot/stg/opencl/drivers/opencl/make/amdgcn.git/ockl/build/Makefile.ockl#6 edit
... //depot/stg/opencl/drivers/opencl/make/amdgcn.git/oclc/build/Makefile.oclc#8 edit
... //depot/stg/opencl/drivers/opencl/make/amdgcn.git/ocml/build/Makefile.ocml#6 edit
... //depot/stg/opencl/drivers/opencl/make/amdgcn.git/opencl/build/Makefile.opencl#8 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/roccompiler.cpp#12 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocprogram.cpp#20 edit


[ROCm/clr commit: 1d6b47602d]
2016-08-31 00:13:59 -04:00
foreman 973e214c15 P4 to Git Change 1308969 by lmoriche@lmoriche_opencl_dev on 2016/08/30 21:30:32
SWDEV-94644 - Run prepare-builtins on the control functions.

Affected files ...

... //depot/stg/opencl/drivers/opencl/library/build/Makefile.library#53 edit
... //depot/stg/opencl/drivers/opencl/make/amdgcn.git/oclc/build/Makefile.oclc#7 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocprogram.cpp#19 edit


[ROCm/clr commit: 6cb1ec0047]
2016-08-30 21:40:25 -04:00
foreman fd2ff06502 P4 to Git Change 1308933 by gandryey@gera-w8 on 2016/08/30 18:42:21
SWDEV-86035 - Enable PAL for GFX9 by default
	- GPU_ENABLE_PAL=0 will force GSL backend for GFX9

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpudevice.cpp#550 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/paldevice.cpp#19 edit
... //depot/stg/opencl/drivers/opencl/runtime/utils/flags.hpp#256 edit


[ROCm/clr commit: e0f15c2613]
2016-08-30 19:13:14 -04:00