Commit Graph

1485 Commits

Author SHA1 Message Date
foreman be0df52adf P4 to Git Change 1317944 by johtaylo@johtaylo-jtincrementor-increment on 2016/09/23 03:00:05
SWDEV-2 - Change OpenCL version number from 2222 to 2223.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1969 edit


[ROCm/clr commit: 7f531afdb4]
2016-09-23 03:18:40 -04:00
foreman 4624b20760 P4 to Git Change 1317781 by lmoriche@lmoriche_opencl_dev on 2016/09/22 19:28:45
SWDEV-94610 - Add runtime support for Printf. Parse the metadata strings and build the PrintInfo structure expected by the runtime.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/amdgpu_metadata.cpp#6 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/amdgpu_metadata.hpp#5 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rockernel.cpp#16 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rockernel.hpp#10 edit


[ROCm/clr commit: c5b3373da2]
2016-09-22 19:35:10 -04:00
foreman 8870b4d28d P4 to Git Change 1317395 by johtaylo@johtaylo-jtincrementor-increment on 2016/09/22 03:00:04
SWDEV-2 - Change OpenCL version number from 2221 to 2222.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1968 edit


[ROCm/clr commit: 4dd808cd6d]
2016-09-22 03:11:20 -04:00
foreman d6f7ddd5c5 P4 to Git Change 1317285 by lmoriche@lmoriche_opencl_dev on 2016/09/21 20:36:55
SWDEV-102540 - Fill the device info_.boardName_ field from the info returned by ROCR's HSA_AMD_AGENT_INFO_PRODUCT_NAME query.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/roccompiler.cpp#19 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocdevice.cpp#19 edit


[ROCm/clr commit: 0dd6aae926]
2016-09-21 20:43:02 -04:00
foreman 067afc2c02 P4 to Git Change 1317211 by gandryey@gera-w8 on 2016/09/21 16:53:02
SWDEV-102971 - [CQE OCL][OpenCL on PAL] "computer on low memory" issue observed folllowed by tdr while running few WF Conformance tests together
	- Use custom memory allocator to avoid memory grow in PAL
	- Extra clean-ups in resource alloc, including a fix for image pitch in the persistent memory

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palmemory.cpp#6 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palresource.cpp#11 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palresource.hpp#7 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palvirtual.cpp#22 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palvirtual.hpp#9 edit


[ROCm/clr commit: 48590452f1]
2016-09-21 17:08:14 -04:00
foreman 6846b8ee16 P4 to Git Change 1316856 by johtaylo@johtaylo-jtincrementor-increment on 2016/09/21 03:00:05
SWDEV-2 - Change OpenCL version number from 2220 to 2221.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1967 edit


[ROCm/clr commit: fcc6d6637b]
2016-09-21 03:05:42 -04:00
foreman 70acf47d09 P4 to Git Change 1316785 by lmoriche@lmoriche_opencl_dev on 2016/09/20 20:47:40
SWDEV-94610 - Add gfx700 to the list of suported targets in HSAILProgram::linkImpl_LC. When dumping the source (-save-temps), print the options actually sent to clang as well as the options passed to OpenCL.

Affected files ...

... //depot/stg/opencl/drivers/opencl/library/build/Makefile.library#56 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/amdgpu_metadata.cpp#5 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/roccompiler.cpp#18 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocprogram.cpp#36 edit


[ROCm/clr commit: 383e97425b]
2016-09-20 20:53:50 -04:00
foreman 69fc4580e8 P4 to Git Change 1316718 by johtaylo@johtaylo-jtincrementor-increment on 2016/09/20 17:52:05
SWDEV-2 - Change OpenCL version number from 2219 to 2220.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1966 edit


[ROCm/clr commit: e9c1fb06c7]
2016-09-20 18:02:57 -04:00
foreman 1907fcb485 P4 to Git Change 1316686 by johtaylo@johtaylo-jtincrementor-increment on 2016/09/20 17:08:15
SWDEV-2 - Change OpenCL version number from 2218 to 2219.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1965 edit


[ROCm/clr commit: f6529ae04a]
2016-09-20 17:22:31 -04:00
foreman c54ba8bfa1 P4 to Git Change 1316519 by johtaylo@johtaylo_L7_stg on 2016/09/20 12:50:43
SWDEV-2 - Change OpenCL version number from 2217 to 2218.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1964 edit


[ROCm/clr commit: 34e0d956ec]
2016-09-20 12:56:44 -04:00
foreman 75418f30e6 P4 to Git Change 1316270 by scharupa@scharupa_temp on 2016/09/20 03:14:55
SWDEV-96479 - changing tab to space as Victor pointed out

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocdevice.cpp#18 edit


[ROCm/clr commit: c2b57553dc]
2016-09-20 03:20:15 -04:00
foreman ae76b26df1 P4 to Git Change 1316221 by scharupa@scharupa_temp on 2016/09/20 01:44:43
SWDEV-96479 - bug fix, exposing atomics capabilities based on GFX IP. For Hawaii the atomics is not supported hence disabled.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocdevice.cpp#17 edit


[ROCm/clr commit: d793001561]
2016-09-20 01:52:18 -04:00
foreman b89d0bea0f P4 to Git Change 1316123 by gandryey@gera-w8 on 2016/09/19 18:22:55
SWDEV-102286 - [CQE OCL][OpenCL on PAL] 32/64 "DX9" OpenCL SDK Sample is crashing
	- Correct DX9 device binding

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/pal/paldevice.cpp#22 edit


[ROCm/clr commit: b65af32fe1]
2016-09-19 20:13:01 -04:00
foreman 321fc0490a P4 to Git Change 1315824 by asalmanp@asalmanp-opencl-stg on 2016/09/19 12:04:15
SWDEV-102417 - Forum [205433] : Memory leak with printf statement inside kernel code

	A memory leak can occur if a printf statement is inside the .cl source code but it is not used inside the __kernel code (e.g., a function inside .cl code that uses printf but never called by the __kernel). In this case compiler generates the printf metadata but printf is not used by the __kernel (i.e., the printf buffer is empty).

	To fix this issue, release the transfer buffer object before returning false in PrintfDbgHSA::output function.

	ReviewBoardURL = http://ocltc.amd.com/reviews/r/11394/

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuprintf.cpp#42 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palprintf.cpp#3 edit


[ROCm/clr commit: 0dd7a03954]
2016-09-19 12:13:23 -04:00
foreman 75f80ffbfb P4 to Git Change 1313945 by wchau@wchau_OCL_boltzmann on 2016/09/13 15:57:45
SWDEV-94605 - [OCL-LC-ROCm] ability to load offline (and online) generated binaries using the HSA code object format

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/device.cpp#203 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/roccompiler.cpp#17 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocprogram.cpp#35 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocprogram.hpp#14 edit
... //depot/stg/opencl/drivers/opencl/runtime/platform/program.cpp#83 edit


[ROCm/clr commit: 0495f895a2]
2016-09-13 16:16:35 -04:00
foreman 89175ce61b P4 to Git Change 1313578 by lmoriche@lmoriche_opencl_dev on 2016/09/13 00:14:36
SWDEV-94610 - Target features are only needed in the CL->IR stage. The attributes remain on the function, so they should not be set again in the IR->ISA stage.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/roccompiler.cpp#16 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocprogram.cpp#34 edit


[ROCm/clr commit: a1009a5d11]
2016-09-12 23:23:59 -05:00
foreman 322917f699 P4 to Git Change 1313559 by lmoriche@lmoriche_opencl_dev on 2016/09/12 22:58:58
SWDEV-94610 - Don't use the -cl-denorms-are-zero, but instead set the fp32/fp64 denorms with the target features +fp32-denormals and +fp64-denormals. fp64-denormals is always set, fp32-denormals in only set if device >= gfx900 and -cl-denorms-are-zero is not set.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocprogram.cpp#33 edit


[ROCm/clr commit: 7239172265]
2016-09-12 23:08:18 -04:00
foreman 73a973b006 P4 to Git Change 1313484 by lmoriche@lmoriche_opencl_dev on 2016/09/12 17:32:10
SWDEV-94610 - Fix the HSAIL Runtime build.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocprogram.cpp#32 edit


[ROCm/clr commit: 978a25ba95]
2016-09-12 16:43:09 -05:00
foreman e52e05fe56 P4 to Git Change 1313477 by lmoriche@lmoriche_opencl_dev on 2016/09/12 17:22:37
SWDEV-94610 - Split the HSAILProgram::hsailOptions into preprocessorOptions amd codegenOptions. Pass the codegenOptions to the IR->ISA stage (linkImpl).

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/roccompiler.cpp#15 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocprogram.cpp#31 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocprogram.hpp#13 edit


[ROCm/clr commit: 64ed800f46]
2016-09-12 17:30:21 -04:00
foreman bb8ebe5ed4 P4 to Git Change 1313439 by lmoriche@lmoriche_opencl_dev on 2016/09/12 16:05:39
SWDEV-94610 - Revert to the HSA 1.0 API for now.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocdevice.cpp#16 edit


[ROCm/clr commit: fe6ef85918]
2016-09-12 16:12:26 -04:00
foreman 9a0b935753 P4 to Git Change 1313121 by lmoriche@lmoriche_opencl_dev on 2016/09/11 15:21:47
SWDEV-94611 - [OCL-LC-ROCm] Use GFX IP for device name. Set the name to "gfx[M][m][s]" (M:major,m:minor,stepping). Removed the device name strings from the DeviceInfo table. Keep the machineTarget_ field until the compiler is changed to accept gfxip strings.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocdefs.hpp#6 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocdevice.cpp#15 edit


[ROCm/clr commit: c828fa8498]
2016-09-11 15:30:45 -04:00
foreman 120b9b53c7 P4 to Git Change 1313106 by lmoriche@lmoriche_opencl_dev on 2016/09/10 22:38:21
SWDEV-94644 - Run prepare-builtins from the modules build directory, instead of right before generating the include files. Renamed the files to match the opensource build names (except for the .amdgcn suffix). Automatically generate a single include file for all libraries.

Affected files ...

... //depot/stg/opencl/drivers/opencl/library/build/Makefile.library#54 edit
... //depot/stg/opencl/drivers/opencl/make/amdgcn.git/irif/build/Makefile.irif#7 edit
... //depot/stg/opencl/drivers/opencl/make/amdgcn.git/ockl/build/Makefile.ockl#8 edit
... //depot/stg/opencl/drivers/opencl/make/amdgcn.git/oclc/build/Makefile.oclc#10 edit
... //depot/stg/opencl/drivers/opencl/make/amdgcn.git/ocml/build/Makefile.ocml#8 edit
... //depot/stg/opencl/drivers/opencl/make/amdgcn.git/opencl/build/Makefile.opencl#10 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocprogram.cpp#30 edit


[ROCm/clr commit: 8bb15b463b]
2016-09-10 22:46:52 -04:00
foreman 2a99a81d61 P4 to Git Change 1312856 by lmoriche@lmoriche_opencl_dev on 2016/09/09 11:52:59
SWDEV-94610 - Build fix for the HSAIL Runtime Foundation build

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocprogram.cpp#29 edit


[ROCm/clr commit: 70ef99eb26]
2016-09-09 12:02:35 -04:00
foreman c832965ae1 P4 to Git Change 1312846 by gandryey@gera-w8 on 2016/09/09 11:33:02
SWDEV-86035 - Add PAL backend to OpenCL
	- Add (PAL) suffix to the driver version

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpudevice.cpp#556 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/paldevice.cpp#21 edit


[ROCm/clr commit: ebb29b8d36]
2016-09-09 11:40:57 -04:00
foreman a030ebfe49 P4 to Git Change 1312587 by cpaquot@hog-ocl on 2016/09/08 19:46:03
SWDEV-96354 - Wrong usage of hsaImageData_ and deviceMemory_.

	Use hsaImageData_ as the original pointer before alignment and only for that purpose. The deviceMemory_ is where the data is located. No one ever needs to use hsaImageData_ really. This is only an issue with tiled images

	ReviewBoardURL = http://ocltc.amd.com/reviews/r/11331/diff/

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocdevice.cpp#14 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocdevice.hpp#6 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocmemory.cpp#4 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocmemory.hpp#4 edit


[ROCm/clr commit: 51948f577c]
2016-09-08 20:34:24 -04:00
foreman 376b1f2bfa P4 to Git Change 1312581 by lmoriche@lmoriche_opencl_dev on 2016/09/08 19:11:07
SWDEV-102033 - [OCL-LC-ROCm] Change runtime metadata KeyArgKind
	- Required runtime changes to use the new hidden argument metadata

Affected files ...

... //depot/stg/opencl/drivers/opencl/make/llvm.git/llvmlibs#7 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/amdgpu_metadata.cpp#4 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/amdgpu_metadata.hpp#4 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rockernel.cpp#15 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rockernel.hpp#9 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocprogram.cpp#28 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocvirtual.cpp#14 edit


[ROCm/clr commit: 8fadd2c115]
2016-09-08 20:29:42 -04:00
foreman b602575d44 P4 to Git Change 1312566 by lmoriche@lmoriche_opencl_dev on 2016/09/08 18:25:02
SWDEV-94610 - Make sure each kernarg segment sits on a different cache line (align the kernargs on cache lines at minimum). Minor misc cleanups.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocdevice.cpp#13 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rockernel.cpp#14 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rockernel.hpp#8 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocprogram.cpp#27 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocvirtual.cpp#13 edit


[ROCm/clr commit: 3a61b24dd5]
2016-09-08 19:52:04 -04:00
foreman aaa1bf6ee7 P4 to Git Change 1312399 by skudchad@skudchad_test_win_opencl2 on 2016/09/08 15:10:25
SWDEV-101383 - [RS_DVR][MGPU] Slave GPU is blocked from going into BACO when DVR process is active (no recording or instant replay)
	- Fix a memory leak
	- Also make sure to use VALIDATE_ONLY flag properly as bindExternalDevice can be called even during context creation for which we cant close the adaper

	ReviewBoardURL = http://ocltc.amd.com/reviews/r/11330/diff/

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpudevice.cpp#555 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gslbe/src/rt/GSLDevice.cpp#174 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gslbe/src/rt/GSLDevice.h#62 edit


[ROCm/clr commit: 8705eca425]
2016-09-08 15:14:05 -04:00
foreman d773c9dc27 P4 to Git Change 1312342 by rili@rili-opencl-pal-stg on 2016/09/08 13:27:45
SWDEV-101790 -  Cherry-pick CL#1293292 from 16.20.1010
	                            Support DriverStore in OCL

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/device.hpp#278 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpudevice.cpp#554 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gslbe/src/rt/GSLDevice.cpp#173 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gslbe/src/rt/backend.h#14 edit


[ROCm/clr commit: fbb90f4504]
2016-09-08 13:36:56 -04:00
foreman 08580f51b5 P4 to Git Change 1311856 by asalmanp@asalmanp-ocl-stg-pal on 2016/09/07 15:28:58
SWDEV-79278 - [OpenCL][PAL] fixing a regression in gfx9 after CL#1309875 which caused all the OCLTST tests to fail on gfx9 emulator. Don’t add any extra entry to the GfxIpDeviceInfo table as this table must match with GfxIpLevel enum (located in //depot/stg/pal/inc/core/palDevice.h).

	ReviewBoardURL = http://ocltc.amd.com/reviews/r/11313/

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/pal/paldefs.hpp#11 edit


[ROCm/clr commit: 8764785ce1]
2016-09-07 15:38:56 -04:00
foreman 3f66d55686 P4 to Git Change 1311624 by nhaustov@nhaustov_hsa on 2016/09/07 05:22:23
SWDEV-101272 - Remove call to OptimizeLLVMBitcode.

	It's no longer needed after LLVM fix https://reviews.llvm.org/D24023 .

	Reviewed by: Laurent Morichetti, German Andryeyev.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocprogram.cpp#26 edit


[ROCm/clr commit: 6a5ec3a6b0]
2016-09-07 05:29:23 -04:00
foreman 862f54db3e P4 to Git Change 1311385 by gandryey@gera-w8 on 2016/09/06 16:51:05
SWDEV-101448 - [CQE OCL][Brahma][PERF][QR] ~21% perf drop is observed with lulesh-cl subtest of ComputeApps tests : Faulty CL # 1306133
	- Use the logic for transfer size before CL#1306133

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpublit.cpp#124 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palblit.cpp#10 edit


[ROCm/clr commit: b750057405]
2016-09-06 17:00:06 -04:00
foreman afebe549f0 P4 to Git Change 1311278 by jatang@jatang-opencl-hsa-stg2 on 2016/09/06 14:13:56
SWDEV-101315 - Fix PerfCounter not working under CodeXL.

	1. Need to map ORCA PerfCounter block to PAL PerfCounter block/instance.
	2. CodeXL could try to create PerfCouters that don't exist in HW, so need to handle that and return 0 as result.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palcounters.cpp#5 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palcounters.hpp#6 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palvirtual.cpp#21 edit


[ROCm/clr commit: 93e45cff4e]
2016-09-06 14:24:40 -04:00
foreman c42b53faac P4 to Git Change 1311277 by lmoriche@lmoriche_opencl_dev on 2016/09/06 14:11:25
SWDEV-101853 - Fix the build, add a "return NULL" after the assert.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rockernel.hpp#7 edit


[ROCm/clr commit: 59ca1d70e6]
2016-09-06 14:19:41 -04:00
foreman 0a916189b5 P4 to Git Change 1311274 by lmoriche@lmoriche_opencl_dev on 2016/09/06 14:02:55
SWDEV-94610 - Fill the compileSize_ and compileSizeHint_ info from the LC metadata.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rockernel.cpp#13 edit


[ROCm/clr commit: 7d5471671b]
2016-09-06 14:09:20 -04:00
foreman d5898bbcfc P4 to Git Change 1311256 by lmoriche@lmoriche_opencl_dev on 2016/09/06 13:16:58
SWDEV-101853 - roc::Kernel cleanups:
	- Remove unused classes & member functions/variables.
	- Flatten vector arguments for the HSAIL path to  remove the need for numElem_.
	- Consolidate initArguments in a single loop for the HSAIL path.
	- Use the Kernel::Argument to fill the OCL descriptor as much as possible.
	- Set the access qualifier for both buffers and images.
	- Fix the indentation and coding conventions.
	- Add new ROC_ARG_TYPE type for hidden arguments
	- Add an index_ field the roc::Kernel::Argument to record the OCL signature index for this argument, or -1 for hidden arguments
	- Handle the hidden arguments as any other argument at dispatch (now included in the hsailArgList_)
	- roc::Kernel::hsailArgAt(int) now returns the kernel argument for the given position in the OCL signature, not the position the the hsailArgList_.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rockernel.cpp#12 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rockernel.hpp#6 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocvirtual.cpp#12 edit


[ROCm/clr commit: 1c37625aa2]
2016-09-06 13:21:14 -04:00
foreman e4f95d2cc7 P4 to Git Change 1311210 by chui@chui-cnchuiw8-ocl-hsa-stg on 2016/09/06 11:53:29
SWDEV-101383 - Back out CL1310033 as it is causing Carrizo Win 10 Sanity test to crash at ocltst module ocldx.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpudevice.cpp#553 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gslbe/src/rt/GSLDevice.cpp#172 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gslbe/src/rt/GSLDevice.h#61 edit


[ROCm/clr commit: 73bb49eb7b]
2016-09-06 12:01:14 -04:00
foreman 79c5cc6014 P4 to Git Change 1310723 by johtaylo@johtaylo-JTBUILDER03-increment on 2016/09/04 03:00:08
SWDEV-2 - Change OpenCL version number from 2216 to 2217.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1963 edit


[ROCm/clr commit: 9951309118]
2016-09-04 03:09:27 -04:00
foreman d318c61b0c P4 to Git Change 1310684 by johtaylo@johtaylo-JTBUILDER03-increment on 2016/09/03 03:00:09
SWDEV-2 - Change OpenCL version number from 2215 to 2216.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1962 edit


[ROCm/clr commit: d914798797]
2016-09-03 03:09:06 -04:00
foreman 764d09238f P4 to Git Change 1310397 by lmoriche@lmoriche_opencl_dev on 2016/09/02 13:55:00
SWDEV-101169 - Compile the PCH file from <stdin> instead of a file reference. This removes the requirement to have the original file present when using the PCH file.

Affected files ...

... //depot/stg/opencl/drivers/opencl/make/amdgcn.git/headers/build/Makefile.headers#9 edit
... //depot/stg/opencl/drivers/opencl/make/amdgcn.git/irif/build/Makefile.irif#6 edit
... //depot/stg/opencl/drivers/opencl/make/amdgcn.git/ockl/build/Makefile.ockl#7 edit
... //depot/stg/opencl/drivers/opencl/make/amdgcn.git/oclc/build/Makefile.oclc#9 edit
... //depot/stg/opencl/drivers/opencl/make/amdgcn.git/ocml/build/Makefile.ocml#7 edit
... //depot/stg/opencl/drivers/opencl/make/amdgcn.git/opencl/build/Makefile.opencl#9 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/roccompiler.cpp#14 edit


[ROCm/clr commit: 7937fd1812]
2016-09-02 14:05:40 -04:00
foreman f90f49613f P4 to Git Change 1310350 by lmoriche@lmoriche_opencl_dev on 2016/09/02 12:29:12
SWDEV-101853 - Use the PointeeAlignment metadata to align the dynamic LDS allocations at dispatch.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/amdgpu_metadata.cpp#3 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/amdgpu_metadata.hpp#3 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rockernel.cpp#11 edit


[ROCm/clr commit: b581f2510b]
2016-09-02 12:33:55 -04:00
foreman 0af3db0c1e P4 to Git Change 1310134 by johtaylo@johtaylo-JTBUILDER03-increment on 2016/09/02 03:00:07
SWDEV-2 - Change OpenCL version number from 2214 to 2215.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1961 edit


[ROCm/clr commit: 411cf8cc1d]
2016-09-02 03:05:01 -04:00
foreman c4aef18415 P4 to Git Change 1310081 by lmoriche@lmoriche_opencl_dev on 2016/09/01 21:27:48
SWDEV-94610 - The spec says that the value returned for HSA_EXECUTABLE_SYMBOL_INFO_NAME_LENGTH does not include the NUL terminator. We should add one before using the string.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocprogram.cpp#25 edit


[ROCm/clr commit: 557d2bfddf]
2016-09-01 21:34:31 -04:00
foreman 991054eec6 P4 to Git Change 1310046 by gandryey@gera-w8 on 2016/09/01 18:43:53
SWDEV-101621 - [CQE OCL][OpenCL on PAL] 6 WF Conformance tests are failing
	- Fix fillimage with DEPTH16 format

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palblit.cpp#9 edit


[ROCm/clr commit: 45eedbec7c]
2016-09-01 18:50:31 -04:00
foreman 7e33afa9c4 P4 to Git Change 1310033 by skudchad@skudchad_test_win_opencl2 on 2016/09/01 18:10:18
SWDEV-101383 - [RS_DVR][MGPU] Slave GPU is blocked from going into BACO when DVR process is active (no recording or instant replay)
	- if the OS is Win10, no need to do extensive adapter init.

	ReviewBoardURL = http://ocltc.amd.com/reviews/r/11241/diff/

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpudevice.cpp#552 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gslbe/src/rt/GSLDevice.cpp#171 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gslbe/src/rt/GSLDevice.h#60 edit


[ROCm/clr commit: b232b2bbab]
2016-09-01 18:15:08 -04:00
foreman 319308cce0 P4 to Git Change 1310014 by gandryey@gera-w8 on 2016/09/01 17:32:18
SWDEV-101621 - [CQE OCL][OpenCL on PAL] 6 WF Conformance tests are failing
	- Make sure the rowPitch is aligned to pixels for images created from buffer

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/pal/paldevice.cpp#20 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palresource.cpp#10 edit


[ROCm/clr commit: 09c96ae91c]
2016-09-01 17:44:19 -04:00
foreman 388f4e3b5f P4 to Git Change 1310000 by asalmanp@asalmanp-ocl-stg-pal on 2016/09/01 17:13:37
SWDEV-79278 - [OpenCL][PAL] force Vega10(gfx9)(aka: Greenland) to use PAL backend

	ReviewBoardURL = http://ocltc.amd.com/reviews/r/11279/

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpudevice.cpp#551 edit


[ROCm/clr commit: 02a04ecb2c]
2016-09-01 17:23:35 -04:00
foreman 2f60226768 P4 to Git Change 1309875 by smekhano@stas-rampitec-hsa on 2016/09/01 14:11:31
SWDEV-101534 - Lexa bringup: correct capability to 8:0:4 and add complib code
	Testing: smoke, precheckin
	Reviewed by Brian Sumner and Evgeny Mankov

Affected files ...

... //depot/stg/opencl/drivers/opencl/compiler/legacy-lib/backends/gpu/scwrapper/SI/devStateSI.cpp#2 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/utils/v0_8/libUtils.cpp#19 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/utils/v0_8/target_mappings.h#47 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/utils/v0_8/target_mappings_amdil.h#41 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/utils/v0_8/target_mappings_amdil64.h#37 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/utils/v0_8/target_mappings_hsail.h#38 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/utils/v0_8/target_mappings_hsail64.h#33 edit
... //depot/stg/opencl/drivers/opencl/compiler/tools/driver/driver.cpp#63 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpudefs.hpp#141 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/paldefs.hpp#10 edit


[ROCm/clr commit: 5787de6741]
2016-09-01 14:20:14 -04:00
foreman 55c1acd44a P4 to Git Change 1309866 by gandryey@gera-w8 on 2016/09/01 13:50:12
SWDEV-79445 - OCL generic changes and code clean-up
	- Improve image fill performance with multiple writes in a single thread. The current split has 3 regions

Affected files ...

... //depot/stg/opencl/drivers/opencl/library/common.hsa/src/blitKernels.cl#4 edit
... //depot/stg/opencl/drivers/opencl/library/common/src/blitKernels.cl#4 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpublit.cpp#123 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpublit.hpp#40 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palblit.cpp#8 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palblit.hpp#4 edit


[ROCm/clr commit: 57043d662d]
2016-09-01 14:01:08 -04:00
foreman 216f9cf1b1 P4 to Git Change 1309857 by lmoriche@lmoriche_opencl_dev on 2016/09/01 13:09:36
SWDEV-94610 - Restore the amdgpu_metadata.[ch]pp namespaces (amd::hsa::code)

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/amdgpu_metadata.cpp#2 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/amdgpu_metadata.hpp#2 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rockernel.cpp#10 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rockernel.hpp#5 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocprogram.cpp#24 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocprogram.hpp#12 edit


[ROCm/clr commit: d6fbd59c1a]
2016-09-01 13:18:08 -04:00