نمودار کامیت

408 کامیت‌ها

مولف SHA1 پیام تاریخ
foreman fffbe5bb3b P4 to Git Change 1236618 by weizhang@msdnweizhang-opencl on 2016/02/11 22:12:34
SWDEV-78299 - Back out changelist 1236441 since OCLCreateBuffer fails.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpudevice.cpp#541 edit
2016-02-11 22:20:56 -05:00
foreman 31c030ff18 P4 to Git Change 1236441 by weizhang@weizhang-lnx-opencl on 2016/02/11 12:33:49
SWDEV-78299 - [Brahma] Setting max single allocation size by comparing cardMemAvailableBytes with cardExtMemAvailableBytes on Brahma.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpudevice.cpp#540 edit
2016-02-11 12:41:22 -05:00
foreman c16578eb71 P4 to Git Change 1236419 by asalmanp@asalmanp-opencl-stg on 2016/02/11 11:36:16
SWDEV-79278 - [OpenCL][Ellesmere BU] Exposing Ellesmere in mainline for 16.20 release

	ReviewBoardURL = http://ocltc.amd.com/reviews/r/9750/

Affected files ...

... //depot/stg/opencl/drivers/opencl/compiler/legacy-lib/utils/v0_8/target_mappings.h#7 edit
... //depot/stg/opencl/drivers/opencl/compiler/legacy-lib/utils/v0_8/target_mappings_amdil.h#7 edit
... //depot/stg/opencl/drivers/opencl/compiler/legacy-lib/utils/v0_8/target_mappings_amdil64.h#6 edit
... //depot/stg/opencl/drivers/opencl/compiler/legacy-lib/utils/v0_8/target_mappings_hsail.h#6 edit
... //depot/stg/opencl/drivers/opencl/compiler/legacy-lib/utils/v0_8/target_mappings_hsail64.h#6 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/utils/v0_8/target_mappings.h#39 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/utils/v0_8/target_mappings_amdil.h#34 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/utils/v0_8/target_mappings_amdil64.h#30 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/utils/v0_8/target_mappings_hsail.h#29 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/utils/v0_8/target_mappings_hsail64.h#24 edit
... //depot/stg/opencl/drivers/opencl/compiler/llvm32/lib/Target/AMDIL/AMDILSubtarget.cpp#8 edit
... //depot/stg/opencl/drivers/opencl/compiler/llvm32/lib/Target/AMDIL/Internal/InternalProcessors.td#8 edit
... //depot/stg/opencl/drivers/opencl/compiler/tools/driver/driver.cpp#59 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpudefs.hpp#133 edit
... //depot/stg/opencl/drivers/opencl/tests/ocltst/module/runtime/OCLDeviceQueries.cpp#33 edit
2016-02-11 11:49:08 -05:00
foreman 45715f0c48 P4 to Git Change 1235782 by asalmanp@asalmanp-opencl-stg on 2016/02/09 15:30:16
SWDEV-68792 - [OpenCL][HWS/CWSR] Adding support for Hardware Scheduler and Compute Wave Save restore (CWSR) feature on ORCA

	Adding a temporary w/a for a CP uCode bug in HWS mode. Due to this bug, CP uCode loops through a RUNLIST unless there is a submission on all queues in HWS mode. This causes some overhead and performance drop in PCMark8 on CZ in HWS mode. To work around this issue, it was suggested to submit a dummy packet during initialization on all available queues on HWS mode so that CP uCode can break the loop. This w/a should be removed once CP uCode provides a final fix for this issue.

	ReviewBoardURL = http://ocltc.amd.com/reviews/r/9616/

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gslbe/src/rt/GSLDevice.cpp#160 edit
2016-02-09 17:25:20 -05:00
foreman c22b282854 P4 to Git Change 1235269 by asalmanp@asalmanp-opencl-stg on 2016/02/08 12:27:46
SWDEV-79278 - [OpenCL][Baffin BU] Exposing Baffin in mainline for 16.20 release
	ReviewBoardURL = http://ocltc.amd.com/reviews/r/8969/

Affected files ...

... //depot/stg/opencl/drivers/opencl/compiler/legacy-lib/utils/v0_8/target_mappings.h#6 edit
... //depot/stg/opencl/drivers/opencl/compiler/legacy-lib/utils/v0_8/target_mappings_amdil.h#6 edit
... //depot/stg/opencl/drivers/opencl/compiler/legacy-lib/utils/v0_8/target_mappings_amdil64.h#5 edit
... //depot/stg/opencl/drivers/opencl/compiler/legacy-lib/utils/v0_8/target_mappings_hsail.h#5 edit
... //depot/stg/opencl/drivers/opencl/compiler/legacy-lib/utils/v0_8/target_mappings_hsail64.h#5 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/utils/v0_8/target_mappings.h#38 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/utils/v0_8/target_mappings_amdil.h#33 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/utils/v0_8/target_mappings_amdil64.h#29 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/utils/v0_8/target_mappings_hsail.h#28 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/utils/v0_8/target_mappings_hsail64.h#23 edit
... //depot/stg/opencl/drivers/opencl/compiler/llvm32/lib/Target/AMDIL/AMDILSubtarget.cpp#7 edit
... //depot/stg/opencl/drivers/opencl/compiler/llvm32/lib/Target/AMDIL/Internal/InternalProcessors.td#7 edit
... //depot/stg/opencl/drivers/opencl/compiler/tools/driver/driver.cpp#58 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpudefs.hpp#132 edit
... //depot/stg/opencl/drivers/opencl/tests/ocltst/module/runtime/OCLDeviceQueries.cpp#32 edit
2016-02-08 12:36:54 -05:00
foreman 27c1ec4857 P4 to Git Change 1234990 by bcrane@MSDNBCRANE99-OCL on 2016/02/05 15:44:10
SWDEV-86885 - [CQE OCL][2.0][QR][CFX] Few SDK 64 and 32 bit Samples resulting in Soft/hard hangs with faulty cl: 1233743

	Unsubmit CL1233743 because of problems with CFX.

	I have not been able to figure out how Cl1233743 would cause a problem.  It sets a flag, to disable new code... I would expect the new code to be the problem, not the disable.

	So, in case this unsubmit does not elminate the problem, CL1233686 is shelved, to use #ifndef CAL_SUPPORT around the new code in CL1226184.

	CL1233743 was submitted for:
	SWDEV-86253 - [QR] 6 to 7% performance drop is observed in BasemarkCL test

	CL1226184 adds serialization to LHIO, because pxproxy accesses global state, without locks, leading to crash, etc.

	To fix OpenCL perf regressions, allow unserialized access to pxproxy.  It may be that OpenCL design leads away from racy behavior, so it may be safe.  This was checked in to gather information.  What software gets perf drops?  Does any software get fixed?

	The dangerous functions are CreateDevice, CreateContext, because the handles from the OS are saved in global caches, and if those global caches are modified in one thread, while other threads are looking-up from the caches - that is a problem.

	CL1226184 was submitted for:
	SWDEV-80442 - [QR][Adobe Premier Pro CS6] TDR/App Crash observed while resizing the video window within workspace

	TC: http://ocltc:8111/viewModification.html?modId=66278&personal=true

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gslbe/src/rt/GSLDevice.cpp#159 edit
2016-02-05 15:55:28 -05:00
foreman bb433385db P4 to Git Change 1234162 by xcui@merged_opencl_jxcwin on 2016/02/03 18:17:16
SWDEV-84299 - added support for svmmemcpy to handle the situation that src/dst pointers may or may not in the SVM space

	code review:
	http://ocltc.amd.com/reviews/r/9645/
	precheckin:
	http://ocltc.amd.com:8111/viewModification.html?modId=66292&personal=true&buildTypeId=&tab=vcsModificationBuilds&show_all_builds=true

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuvirtual.cpp#394 edit
2016-02-03 18:24:32 -05:00
foreman bd54838ca7 P4 to Git Change 1233743 by bcrane@MSDNBCRANE99-OCL on 2016/02/02 20:45:13
SWDEV-86253 - [QR] 6 to 7% performance drop is observed in BasemarkCL test

	CL1226184 adds serialization to LHIO, because pxproxy accesses global state, without locks, leading to crash, etc.

	To fix OpenCL perf regressions, allow unserialized access to pxproxy.  It may be that OpenCL design leads away from racy behavior, so it may be safe.  This was checked in to gather information.  What software gets perf drops?  Does any software get fixed?

	The dangerous functions are CreateDevice, CreateContext, because the handles from the OS are saved in global caches, and if those global caches are modified in one thread, while other threads are looking-up from the caches - that is a problem.

	TC: http://ocltc:8111/viewModification.html?modId=66278&personal=true

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gslbe/src/rt/GSLDevice.cpp#158 edit
2016-02-02 20:52:50 -05:00
foreman 434a3ff361 P4 to Git Change 1232550 by skudchad@skudchad_test_win_opencl2 on 2016/01/29 14:32:21
SWDEV-77172 - Enable ThreadTrace for SVM.

	ReviewBoardURL = http://ocltc.amd.com/reviews/r/9671/diff/

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpusettings.cpp#342 edit
2016-01-29 14:39:05 -05:00
foreman f009f13ac4 P4 to Git Change 1232190 by gandryey@gera-w8 on 2016/01/28 15:54:15
SWDEV-86312 - Report correct available LDS size for clGetKernelInfoAMD
	- return LDS size.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuscsi.cpp#35 edit
2016-01-28 16:02:41 -05:00
foreman 4ad4539710 P4 to Git Change 1231595 by weizhang@weizhang-lnx-opencl on 2016/01/27 12:13:15
SWDEV-85649 - The return of owner() needs to be casted to get amd::Image.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpumemory.cpp#125 edit
2016-01-27 12:28:37 -05:00
foreman 80ab916a5b P4 to Git Change 1231549 by wchau@wchau_WIN_OCL_HSA on 2016/01/27 11:23:48
SWDEV-79308 - Use 64-bit to calculate the scratch buffer size for OCL.  We observed that the computed scratch buffer size could be > 4G when compilier optimization option is not used.
	Cross branch change - requires CL1231547.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gslbe/src/rt/GSLDevice.cpp#157 edit
2016-01-27 11:36:07 -05:00
foreman 14179df14f P4 to Git Change 1228972 by smekhano@stas-nova-hsa on 2016/01/19 14:21:26
SWDEV-85602 - rename hsail-64 arch to hsail64
	This is to match other existing llvm targets, such as spir64 and amdil64, as well as to match behavior of open source HSAIL BE.
	For legacy users there is alias "-hsail-64" provided in the aoc2 only.

	Testing: smoke, precheckin
	Reviewed by Matthew Arsenault, Evgeny Mankov and Nikolay Haustov

Affected files ...

... //depot/stg/opencl/drivers/opencl/compiler/legacy-lib/backends/common/codegen.cpp#2 edit
... //depot/stg/opencl/drivers/opencl/compiler/legacy-lib/backends/common/frontend.cpp#4 edit
... //depot/stg/opencl/drivers/opencl/compiler/legacy-lib/utils/v0_8/target_mappings.h#5 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/backends/common/codegen.cpp#66 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/backends/common/frontend.cpp#37 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/utils/v0_8/target_mappings.h#37 edit
... //depot/stg/opencl/drivers/opencl/compiler/llvm/lib/Target/HSAIL/HSAILTargetMachine.cpp#53 edit
... //depot/stg/opencl/drivers/opencl/compiler/llvm/lib/Target/HSAIL/TargetInfo/HSAILTargetInfo.cpp#6 edit
... //depot/stg/opencl/drivers/opencl/compiler/llvm/tools/aacl/aa.h#2 edit
... //depot/stg/opencl/drivers/opencl/compiler/tools/aoc2/aoc2.cpp#80 edit
... //depot/stg/opencl/drivers/opencl/library/hsa/amp_libm/build/Makefile.amp_libm#4 edit
... //depot/stg/opencl/drivers/opencl/library/hsa/gcn/build/Makefile.gcn#20 edit
... //depot/stg/opencl/drivers/opencl/library/hsa/gcndev/build/Makefile.gcndev#3 edit
... //depot/stg/opencl/drivers/opencl/library/hsa/hsail/build/Makefile.hsail#44 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpucompiler.cpp#153 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuprogram.cpp#222 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/hsa_foundation/hsacompiler.cpp#5 edit
... //depot/stg/opencl/drivers/opencl/tests/hsa/bin/test_driver.pl#25 edit
... //depot/stg/opencl/drivers/opencl/tests/hsa/tlst/complib.tlst#21 edit
... //depot/stg/opencl/drivers/opencl/tests/hsa/tlst/ocl_debug.tlst#9 edit
... //depot/stg/opencl/drivers/opencl/tests/hsa/tlst/ocl_regression.tlst#25 edit
... //depot/stg/opencl/drivers/opencl/tests/ocltst/module/spir/SPIRBase.cpp#3 edit
... //depot/stg/opencl/drivers/opencl/tests/ocltst/module/spir/SPIRVBasic.cpp#10 edit
... //depot/stg/opencl/drivers/opencl/tests/ocltst/module/spir/SPIRVDropIn.cpp#5 edit
2016-01-19 14:31:59 -05:00
foreman db94c3fb66 P4 to Git Change 1228628 by wchau@wchau_WIN_OCL_HSA on 2016/01/18 17:02:40
SWDEV-79308 -  Resubmit of CL1228064 with restriction of mininum scratch buffer size of 64K if a scratch buffer is needed.
	  Reduce the total scratch buffer size by a factor of 4, which in effect reducing the max. scratch waves from 32 to 8, to avoid the required total scratch buffer size exceeds the available local memory.
	  Made sure the scratch buffer size is aligned with 64K boundary

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuresource.cpp#235 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gslbe/src/rt/GSLDevice.cpp#156 edit
2016-01-18 17:19:01 -05:00
foreman e84d8eb04d P4 to Git Change 1228237 by skudchad@skudchad_test_win_opencl2 on 2016/01/15 22:24:47
SWDEV-77172 - Back out changelist 1228194

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuprogram.cpp#221 edit
2016-01-15 22:32:16 -05:00
foreman 3345a97263 P4 to Git Change 1228194 by skudchad@skudchad_test_win_opencl2 on 2016/01/15 17:13:28
SWDEV-77172 - Choose isa handle on CZ based on whether SVM is supported or not as SVM may need additional SC Options to be passed as default.

	ReviewBoardURL = http://ocltc.amd.com/reviews/r/9531/diff/

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuprogram.cpp#220 edit
2016-01-15 17:23:45 -05:00
foreman 01dda0e716 P4 to Git Change 1228137 by wchau@wchau_WIN_OCL_HSA on 2016/01/15 14:53:24
SWDEV-79308 - Back out changelist 1228064

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuresource.cpp#234 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gslbe/src/rt/GSLDevice.cpp#155 edit
2016-01-15 15:04:14 -05:00
foreman 685d1164ee P4 to Git Change 1228064 by wchau@wchau_WIN_OCL_HSA on 2016/01/15 11:18:23
SWDEV-79308 -  Resubmit of CL1226881 with the fix of the SC sanity check issue.  Reduce the total scratch buffer size by a factor of 4, which in effect reducing the max. scratch waves from 32 to 8, to avoid the required total scratch buffer
	size exceeds the available local memory.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuresource.cpp#233 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gslbe/src/rt/GSLDevice.cpp#154 edit
2016-01-15 11:26:41 -05:00
foreman ec51f85c8e P4 to Git Change 1227495 by skudchad@skudchad_test_win_opencl2 on 2016/01/13 18:12:59
SWDEV-77172 - Disable ThreadTrace on SVM as its causing hang until a solution can be found.

	ReviewBoardURL = http://ocltc.amd.com/reviews/r/9502/diff/

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpusettings.cpp#341 edit
2016-01-13 18:18:51 -05:00
foreman e66e70e316 P4 to Git Change 1226990 by johtaylo@johtaylo_L7_stg on 2016/01/12 17:44:18
SWDEV-79308 -  Back out changelist 1226881

	Causes failures in execution model, math and pipes

	http://ocltc.amd.com:8111/viewLog.html?buildId=14142599&tab=buildResultsDiv&buildTypeId=TestsOpenCLScSanity_BonaireConformanceWin764bit

	Reduce the total scratch buffer size by a factor of 4, which in effect reducing the max. scratch waves from 32 to 8, to avoid the required total scratch buffer
	size exceeds the available local memory.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gslbe/src/rt/GSLDevice.cpp#153 edit
2016-01-12 17:51:14 -05:00
foreman a319c0e35c P4 to Git Change 1226881 by wchau@wchau_WIN_OCL_HSA on 2016/01/12 14:03:39
SWDEV-79308 -  Reduce the total scratch buffer size by a factor of 4, which in effect reducing the max. scratch waves from 32 to 8, to avoid the required total scratch buffer
	size exceeds the available local memory.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gslbe/src/rt/GSLDevice.cpp#152 edit
2016-01-12 14:11:46 -05:00
foreman 6a0e72acc5 P4 to Git Change 1226780 by nhaustov@nhaustov_hsa on 2016/01/12 07:20:04
SWDEV-77584 - ORCA RT/Compiler Lib: Add 8:0:4, 8:1:0 and 9:0:1 compute capabilities.

	Changes by Evgeniy Mankov.

	Also cherry-pick CL 1226741 for 8:1:0 compute capability.

	Testing: smoke, pre-checkin

	[Reviewers] Nikolay Haustov, German Andryeyev

Affected files ...

... //depot/stg/opencl/drivers/opencl/compiler/lib/utils/v0_8/libUtils.cpp#14 edit
... //depot/stg/opencl/drivers/opencl/compiler/sc/HSAIL/ext/finalizer/program.cpp#20 integrate
... //depot/stg/opencl/drivers/opencl/compiler/sc/HSAIL/ext/loader/loaders.cpp#7 integrate
... //depot/stg/opencl/drivers/opencl/compiler/sc/HSAIL/ext/loader/loaders.hpp#7 integrate
... //depot/stg/opencl/drivers/opencl/compiler/sc/HSAIL/tests/tlst/finalizer_offline.tlst#15 integrate
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpudefs.hpp#131 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuprogram.cpp#219 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuprogram.hpp#63 edit
2016-01-12 07:27:39 -05:00
foreman bc822d60d1 P4 to Git Change 1226078 by rili@rili_opencl_stg on 2016/01/08 15:03:58
SWDEV-84309 - Using agpMemAvailableCacheableBytes instead of agpMemAvailableBytes when calculating free memory for viPlus_ apu.
	                         When memory allocation is in system memory, only agpMemAvailableCacheableBytes is changed

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpudevice.cpp#539 edit
2016-01-08 15:16:55 -05:00
foreman b4f5366fe4 P4 to Git Change 1223703 by nhaustov@nhaustov_hsa on 2015/12/28 05:46:00
SWDEV-77584 - Fix ocl_conformance compiler failures.

	1. If compiling in debug mode, linkImpl wasn't called and kernelNames wasn't set
	which led to CL_INVALID_KERNEL_NAME errors in debug configs in TeamCity. Looking at AMDIL
	code, there is no reason to skip linkImpl in debug mode.

	2. Set types to TYPE_LIBRARY/TYPE_EXECUTABLE. This fixes ocl_conformance compiler program_binary_type.

	Reviewed by: Evgeniy Mankov
	Testing: smoke, pre-checkin, OCLSeparateCompile.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuprogram.cpp#218 edit
2015-12-28 05:53:39 -05:00
foreman f15514da95 P4 to Git Change 1223278 by wchau@wchau_WIN_OCL_HSA on 2015/12/23 15:37:26
SWDEV-66693 - OpenCL Runtime HW Debug support development - use flag, instead of getenv() call, in IOL to indicate the enablement of HW Debug.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gslbe/src/rt/GSLDevice.cpp#151 edit
2015-12-23 15:47:22 -05:00
foreman ef2a00d6ce P4 to Git Change 1223026 by skudchad@skudchad_test_win_opencl2 on 2015/12/22 18:08:38
SWDEV-77172 - IOMMUv2 changes for Windows 10
	- Enable DX interop for SVM

	ReviewBoardURL = http://ocltc.amd.com/reviews/r/9340/diff/

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpusettings.cpp#340 edit
2015-12-22 18:15:46 -05:00
foreman 26be053088 P4 to Git Change 1221147 by ashi1@ashi1_win50 on 2015/12/15 15:58:09
SWDEV-83467 - [SPIRV] Add support of SPIRV to CPU
	Modifying runtime and compile time to allow SPIRV binaries to run on CPU since it only runs on HSAIL GPU
	Added changes to allow conversion of CPU's llvmBinaryIsSpir boolean into compiler library's oclElfSections enum
	Cpuprogram.cpp's llvmBinaryIsSpir flag renamed to elfSectionType will now support LLVMIR, SPIR, and SPIRV
	Added SPIRV to compiler lib's elf as new oclElfSections enum
	cpuprogram.cpp changes also made to gpuprogram.cpp's NullProgram to allow compilation

Affected files ...

... //depot/stg/opencl/drivers/opencl/compiler/lib/loaders/elf/elf.cpp#33 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/loaders/elf/elf.hpp#22 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/cpu/cpuprogram.cpp#69 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/device.cpp#191 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/device.hpp#266 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpucompiler.cpp#152 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuprogram.cpp#217 edit
2015-12-15 16:09:35 -05:00
foreman 4df676dc36 P4 to Git Change 1221023 by yaxunl@yaxunl_stg_win50 on 2015/12/15 12:50:27
SWDEV-83705 - [SPIRV] Refactor aclUpdateRawFile as a static function called by SPIRVToModule.

Affected files ...

... //depot/stg/opencl/drivers/opencl/compiler/lib/api/v0_8/acl.cpp#39 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/backends/common/v0_8/if_acl.cpp#88 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/include/v0_8/acl.h#12 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuprogram.cpp#216 edit
2015-12-15 13:07:52 -05:00
foreman f5acde74f7 P4 to Git Change 1220218 by yaxunl@yaxunl_stg_win50 on 2015/12/11 16:35:30
SWDEV-83705 - [SPIRV] Fix failure in SPIR-V conformance test compile_and_link.

	e_rawfile of elf needs to be updated after LLVM section is inserted into the elf containing SPIR-V.

Affected files ...

... //depot/stg/opencl/drivers/opencl/compiler/lib/api/v0_8/acl.cpp#38 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/backends/common/v0_8/if_acl.cpp#87 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/include/v0_8/acl.h#11 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/loaders/bif/bifbase.cpp#54 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/loaders/bif/bifbase.hpp#24 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/utils/v0_8/libUtils.cpp#13 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/utils/v0_8/libUtils.h#23 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuprogram.cpp#215 edit
2015-12-11 16:46:21 -05:00
foreman 157116503e P4 to Git Change 1219975 by fdaniil@spb_fdaniil_amd_hsa_brigvar_test on 2015/12/11 08:44:02
SWDEV-77584 - enable cl_khr_int64_base_atomics extension in 32bit HSAIL compiler - it's already supported. the extension used in OpenMM library and FAHBench.

	significantly reduces perfomance drop in bug 11198 http://ocltc.amd.com/bugs/show_bug.cgi?id=11198

	http://ocltc.amd.com/reviews/r/9223/

	reviewer: German Andryeyev
	testing: smoke, precheckin

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpusettings.cpp#339 edit
2015-12-11 08:51:00 -05:00
foreman 9b2fc7a77e P4 to Git Change 1219652 by asalmanp@asalmanp-opencl-stg on 2015/12/10 14:20:26
SWDEV-82296 - [CQE OCL][2.0][HWSC][16.10]SDK Sample "AtomicCounters" 32/64bit failed with HWSC driver

	Disabling the cl_ext_atomic_counters_32 extension since there is no support for this extension on HSAIL and HWS.

	ReviewBoardURL = http://ocltc.amd.com/reviews/r/9221/

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/device.hpp#265 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpudevice.cpp#538 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpusettings.cpp#338 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/hsa_foundation/hsasettings.cpp#11 edit
2015-12-10 14:27:54 -05:00
foreman 05ac0307b7 P4 to Git Change 1217843 by gandryey@gera-w8 on 2015/12/04 16:33:36
SWDEV-79445 - Enable CL_AMD_FP16 extension and allow to force CL_KHR_FP16

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/device.hpp#264 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpusettings.cpp#337 edit
... //depot/stg/opencl/drivers/opencl/runtime/utils/flags.hpp#247 edit
2015-12-04 16:43:59 -05:00
foreman 6b762d400f P4 to Git Change 1216440 by rili@rili_opencl_stg on 2015/12/01 17:23:55
SWDEV-82256 - Limit the workaround for Win 7 only because KMD has fixed TDR issue on Win 8.1/10

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpusettings.cpp#336 edit
2015-12-01 17:34:48 -05:00
foreman 217ef518c4 P4 to Git Change 1215524 by rili@rili_opencl_stg on 2015/11/27 16:00:18
SWDEV-82205 - Increased workloard to pass this test.
	                       - This is workaround because KMD don't have solution to fix TDR issue yet in 15.30.
	                       - This workaround including CL#1201765 should be reverted once KMD has a fix

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpusettings.cpp#335 edit
2015-11-27 16:07:10 -05:00
foreman f697264676 P4 to Git Change 1213547 by skudchad@skudchad_test_win_opencl2 on 2015/11/20 18:45:03
SWDEV-77172 - IOMMUv2 changes for Windows 10 (Part 2)
	- Fix SVM FGS malloc arguments being passed to kernel
	- Move setting allocation attributes to runtime
	- Fix some bugs from my previous checkins

	ReviewBoardURL = http://ocltc.amd.com/reviews/r/9035/diff/

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpukernel.cpp#311 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuresource.cpp#232 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuresource.hpp#86 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuvirtual.cpp#393 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gslbe/src/rt/GSLDevice.cpp#150 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gslbe/src/rt/backend.h#11 edit
2015-11-20 18:51:27 -05:00
foreman b03a0deeaa P4 to Git Change 1211599 by rili@rili_opencl_stg on 2015/11/16 14:38:36
SWDEV-78240 - Reduce workload for Stoney

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpusettings.cpp#334 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuvirtual.cpp#392 edit
2015-11-16 14:45:37 -05:00
foreman 768a8e7e78 P4 to Git Change 1211287 by xcui@merged_opencl_jxcwin on 2015/11/13 19:19:52
SWDEV-77172 - temporary disable the FGS optimization for a failure on stg.

	code review:
	http://ocltc.amd.com/reviews/r/8992/
	precheckin:
	http://ocltc.amd.com:8111/viewModification.html?modId=62238&personal=true&buildTypeId=&tab=vcsModificationBuilds&show_all_builds=true

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/device.hpp#263 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpudevice.cpp#537 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpudevice.hpp#156 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpukernel.cpp#310 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuvirtual.cpp#391 edit
... //depot/stg/opencl/drivers/opencl/runtime/platform/context.cpp#37 edit
2015-11-13 19:47:56 -05:00
foreman 8a7bac7048 P4 to Git Change 1211165 by gandryey@gera-ubuntu14 on 2015/11/13 14:36:37
SWDEV-78467 - OpenCL LiquidFlash feature
	- Add staging transfer support for invisible memory
	- Don't fallback to USWC memory if persistent allocation failed

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/device.hpp#262 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuresource.cpp#231 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuvirtual.cpp#390 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuvirtual.hpp#137 edit
... //depot/stg/opencl/drivers/opencl/runtime/platform/command.cpp#73 edit
... //depot/stg/opencl/drivers/opencl/runtime/platform/command.hpp#80 edit
2015-11-13 15:09:15 -05:00
foreman 2abb8e1d6a P4 to Git Change 1210812 by emankov@em-hsa-amd on 2015/11/12 18:22:08
SWDEV-77584 - ORCA RT: Preparations for enabling HSAIL on OpenCL 1.2 by default. Fix for rare regressions with AMDIL on CI+, when forcing AMDIL by -legacy.

	[Testing] Pre-checkin:
	http://ocltc.amd.com:8111/viewModification.html?modId=62184&personal=true&buildTypeId=&tab=vcsModificationBuilds&show_all_builds=true

	[Reviewers] German Andryeyev, Brian Sumner
	http://ocltc.amd.com/reviews/r/8975

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpudevice.cpp#536 edit
2015-11-12 20:11:59 -05:00
foreman 64a7c5df25 P4 to Git Change 1210662 by rili@rili_opencl_stg on 2015/11/12 15:16:48
SWDEV-80061 - Copy flag HostMemoryDirectAccess from parent to view

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpumemory.cpp#124 edit
2015-11-12 15:31:34 -05:00
foreman 7237cf514f P4 to Git Change 1210047 by wchau@wchau_WINDOWS7_OCL on 2015/11/11 13:06:48
SWDEV-80450 - Fix the issue of app context reference count > 0 after app termination by using device context for the mapped buffer/image resource.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpudevice.hpp#155 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpumemory.cpp#123 edit
2015-11-11 13:14:27 -05:00
foreman eea9bc6733 P4 to Git Change 1209566 by smekhano@stas-nova-hsa on 2015/11/10 14:25:56
SWDEV-80874 - fixed out of bound access to the printf format string
	We do not really need two separate induction variables, pos and i, and we had a bug of not incrementing i as needed.
	The only reason it used to work is because all strings we used for testing ended with '\n'.
	The bug resulted in ignoring this '\n', but the code unconditionally adds '\n', so nobody noticed.
	If you try to print anything having any other escape, '\n' not at the end, or a colon, there will be assertion.

	That is fixed, and newline now is only added if last symbol in user's format was not newline, because otherwise
	we would now print 2 new lines. NB, I prefer to use bool variable rather then addressing last symbol of the string
	which could be empty.

	A side node, why do we run flex scanner past the last colon? If we do not we would not need this double encoding at all.

	Testing: smoke, precheckin, conformance printf with HSAIL forced, custom test
	Reviewed by German Andreev

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpukernel.cpp#309 edit
2015-11-10 14:33:05 -05:00
foreman 8c29379c3f P4 to Git Change 1209406 by emankov@em-hsa-amd on 2015/11/10 08:12:09
SWDEV-77584 - ORCA RT: Preparations for enabling HSAIL on OpenCL 1.2 by default. Fix regression by CL 1208929 in blit compilation.

	[Regression]
	CL 1208929 causes the following OpenCL Sanity Lnx/Win7/Win8.1 32-bit tests on CI+ GPU (Kalindi, Bonaire, Hawaii):
	BufferBandwidth, CplusplusWrapper, DynamicOpenCLDetection, ImageBandwidth, KernelLaunch
	crashes with:
	:2:..\..\..\gpucompiler.cpp:441: aclCompile failed :
	:1:..\..\..\gpudevice.cpp:955: Couldn't create blit kernels!

	[Testing]
	OCL SDK 2.9 (32bit/64bit), pre-checkin:
	http://ocltc.amd.com:8111/viewModification.html?modId=62012&personal=true&buildTypeId=&tab=vcsModificationBuilds&show_all_builds=true

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpudevice.cpp#535 edit
2015-11-10 08:17:59 -05:00
foreman 539fef47eb P4 to Git Change 1208929 by emankov@em-hsa-amd on 2015/11/09 10:49:06
SWDEV-77584 - ORCA RT: Preparations for enabling HSAIL on OpenCL 1.2 by default. Integrate new algorithm for device program choice.

	[Reasons]
	1. Make the switching change as less as possible.
	2. Give a chance to test HSA_foundation device work on OCL 1.2 beforehand (asked by Nikolay).

	Almost already reviewed:
	http://ocltc.amd.com/reviews/r/8850/

	Additionally:
	1. Linking logic was changed: if the target of one of the binaries is hsail-(64) linking goes through HSAIL, otherwise - through AMDIL. Previously -cl-std=CL2.0 in any of the linking binaries was a criterion for HSAIL, what will be wrong for HSAIL 1.2 after switching. -clang & -edg options are set now to distinguish the path while linking.
	2. -cl-std=CL2.0 as a criterion for HSAIL was returned back in isHSAILProgram() method; -clang & -edg options were also added as a criterion.

	[ToDo] After enabling HSAIL by default remove -cl-std, -clang & -edg checks from the code.

	[Testing] Pre-checkin
	http://ocltc.amd.com:8111/viewModification.html?modId=61929&personal=true&buildTypeId=&tab=vcsModificationBuilds&show_all_builds=true

	[Reviewers] German Andryeyev, Nikolay Haustov

Affected files ...

... //depot/stg/opencl/drivers/opencl/api/opencl/amdocl/cl_program.cpp#39 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/cpu/cpudevice.cpp#279 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/cpu/cpudevice.hpp#93 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/device.hpp#261 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpudevice.cpp#534 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpudevice.hpp#154 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/hsa_foundation/hsadevice.cpp#47 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/hsa_foundation/hsadevice.hpp#22 edit
... //depot/stg/opencl/drivers/opencl/runtime/platform/program.cpp#76 edit
... //depot/stg/opencl/drivers/opencl/runtime/platform/program.hpp#38 edit
2015-11-09 10:56:13 -05:00
foreman 6f0457c510 P4 to Git Change 1208925 by wchau@wchau_WINDOWS7_OCL on 2015/11/09 10:34:19
SWDEV-79957 - use system memory to calculate the largest available memory size on Linux APU system.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpudevice.cpp#533 edit
2015-11-09 10:40:09 -05:00
foreman b0b6b55051 P4 to Git Change 1208618 by skudchad@skudchad_test_win_opencl2 on 2015/11/06 17:27:01
SWDEV-77172 - IOMMUv2 changes for Windows 10
	- Clear passing SVM flag from top level and fix GL interop on SVM
	- Add\Remove gpuvmOffset before WDDM calls as its added manually for SUA model

	ReviewBoardURL = http://ocltc.amd.com/reviews/r/8914/diff/

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuresource.cpp#230 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gslbe/src/rt/GSLDeviceGL.cpp#25 edit
2015-11-06 17:35:18 -05:00
foreman e18cd1d76e P4 to Git Change 1208596 by smekhano@stas-rampitec-hsa on 2015/11/06 16:26:28
SWDEV-80874 - fixed staging buffer overflow with HSA printf

	Staging buffer is ~2 times smaller than allocated printf buffer, so if amount of data in printf buffer exceeds the size of the staging buffer
	we hit assertion in the memory copy. To hit the assertion that is enough to print 2 integers with 64K workitems.
	Added loop to read printf buffer into staging in portions.

	Testing: smoke, precheckin, conformance printf with HSAIL forced, custom tests
	Reviewed by German Andreev

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuprintf.cpp#41 edit
2015-11-06 16:37:38 -05:00
foreman 047f87bb4f P4 to Git Change 1208448 by smekhano@stas-rampitec-hsa on 2015/11/06 12:02:54
SWDEV-80874 - Fixed ORCA RT HSA printf buffer indexing issues

	The format of the buffer is: printf_id, <arg1>, <arg2>, ...
	The RT did not advance index for printf_id field, so for example for a format string "%d" we have been printing printf_id instead of actual argument for every other string.
	The other issue is that outputDbgBuffer is adjusting its last argument (idx) by the number of consumed DWORD values,
	but PrintfDbgHSA::output() is also ajusting dbgBufferPtr, so we had adjustment done twice, printing only half of the actual data and then printing zeroes from the buffer.
	The resolution for both is to always pass 1 as index to outputDbgBuffer(). 1 because 0 is printf_id.

	Testing: smoke, precheckin, conformance printf with HSAIL forced, custom tests
	Reviewed by Brian Sumner and German Andreev

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuprintf.cpp#40 edit
2015-11-06 12:20:00 -05:00
foreman b9fcb50bbc P4 to Git Change 1208096 by ashi1@ashi1_win50 on 2015/11/05 18:20:58
SWDEV-80864 - HSAIL Metadata Workgroup Size Hint and Vec Type Hint added to HSAIL Runtime

	Runtime changes required for the use of these two metadata:
	  - Runtime's gpukernel.cpp requires new aclQueries during HSAILKernel::Init
	    - One for quering WorkGroupSizeHint's array
	    - Two for size of VecTypeHint and fetching VecTypeHint's string
	  - initArgList needs to be moved to end of HSAILKernel::init to allow createSignature to get non empty values
	  - Compiler lib's workgroup hint (wsh) needs to match runtime's type (size_t)
	  - In Kernel constructor, instead of using memset which corrupts std::string, specifically set default workGroupInfo struct's variables

	Also fixed wavesPerSimdHint to use size_t to match runtime.
	Updated CLAssumptionCheck.cpp since aclMetadata structure was modified.

	Note: This is the runtime counterpart to submitted CL#1204512. (Post Review#8808, SWDEV-79695)

Affected files ...

... //depot/stg/opencl/drivers/opencl/compiler/legacy-lib/include/v0_8/aclStructs.h#5 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/include/v0_8/aclStructs.h#22 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/device.hpp#260 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpukernel.cpp#308 edit
... //depot/stg/opencl/drivers/opencl/tests/ocltst/module/complib/CLAssumptionCheck.cpp#48 edit
2015-11-05 18:28:26 -05:00
foreman 3e4332a85f P4 to Git Change 1207850 by emankov@em-hsa-amd on 2015/11/05 06:54:22
SWDEV-77584 - ORCA RT: Preparations for enabling HSAIL on OpenCL 1.2 by default. Restore setBinary while linking GPU program.

	After CL 1205315 "Call aclLink always even for single library/binary." setting OCL Binary should work without setting library type for the program by setType(TYPE_LIBRARY).

	[Testing] pre check-in
	http://ocltc.amd.com:8111/viewModification.html?modId=61694&personal=true&buildTypeId=&tab=vcsModificationBuilds&show_all_builds=true

	[Reviewer] German Andryeyev
	http://ocltc.amd.com/reviews/r/8850

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuprogram.cpp#214 edit
2015-11-05 07:02:25 -05:00