Граф коммитов

180 Коммитов

Автор SHA1 Сообщение Дата
foreman 239faab75e P4 to Git Change 1079952 by yaxunl@yaxunl_stg_win50 on 2014/09/23 12:31:16
ECR #377625 - Workaround for Blender performance issue. Lower available VGPRs to improve waves per CU.

	Added BuildOptsAppend to OCL app profile.
	Read BuildOptsAppend and append to build options.
	Added specific wave optimization option for Blender.

Affected files ...

... //depot/stg/opencl/drivers/opencl/appprofiles/oclappprofile.xml#7 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/backends/gpu/scwrapper/SI/scCompileSI.cpp#45 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/utils/OPTIONS.def#116 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/appprofile.cpp#10 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/appprofile.hpp#8 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/device.cpp#170 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/device.hpp#230 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuappprofile.cpp#10 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuappprofile.hpp#6 edit
... //depot/stg/opencl/drivers/opencl/runtime/platform/program.cpp#63 edit


[ROCm/clr commit: 16f8ca9aae]
2014-09-23 12:44:50 -04:00
foreman e023435e44 P4 to Git Change 1079692 by johtaylo@johtaylo-JTBUILDER03-increment on 2014/09/23 03:00:11
EPR #010002 - Change OpenCL version number from 1641 to 1642.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1388 edit


[ROCm/clr commit: 5ef908c5e7]
2014-09-23 03:05:33 -04:00
foreman 065fd02d4d P4 to Git Change 1079573 by bdhanase@bala_workpc_ocl on 2014/09/22 18:26:26
EPR #405824 - set apusystem_ to 1 for CZ

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpusettings.cpp#282 edit


[ROCm/clr commit: 5c6eb9d62e]
2014-09-22 18:33:46 -04:00
foreman a56fe8c8a2 P4 to Git Change 1079546 by bdhanase@bala_workpc_ocl on 2014/09/22 17:41:33
EPR #405824 - set apusystem_ to 1 for CZ

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpusettings.cpp#281 edit


[ROCm/clr commit: 3e550e63ef]
2014-09-22 17:47:58 -04:00
foreman 1983649921 P4 to Git Change 1079525 by bwicakso@opencl-hsa-stg-bwicakso on 2014/09/22 16:53:56
ECR #333755 - Disable image support on granite

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/hsa/hsadevice.cpp#86 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/hsa/hsasettings.cpp#35 edit


[ROCm/clr commit: a9f1a1f7da]
2014-09-22 17:11:17 -04:00
foreman 77a950b631 P4 to Git Change 1079216 by emankov@em-hsa-amd on 2014/09/22 08:18:24
ECR #333753 - Compiler Lib/RT: Metadata related code refactor, annotation, minor fixes & additional checks

	+ refactor if_aclQueryInfo() in order to simplify code and to avoid direct usage of aclMetadata struct members types
	+ annotation on why we need to use deserializeCLMetadata on "serialized" (to NULL) pointers
	+ erroneously forgotten RT_KERNEL_NAME was added to aclQueryType enum
	+ OCLRTGetInfo, CLEnumCheck tests from ocltst oclcomplib was updated to use RT_KERNEL_NAME
	+ testing of printf is added to OCLRTGetInfo
	+ minor fixes and additional checks

	tests: pre check-in, ocltst -m oclcomplib

	Reviewers: Artem Tamazov, Brian Sumner, German Andryeyev

Affected files ...

... //depot/stg/opencl/drivers/opencl/compiler/lib/backends/common/v0_8/if_acl.cpp#49 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/include/v0_8/aclEnums.h#12 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/include/v0_8/aclStructs.h#12 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpukernel.cpp#265 edit
... //depot/stg/opencl/drivers/opencl/tests/ocltst/module/complib/CLEnumCheck.cpp#36 edit
... //depot/stg/opencl/drivers/opencl/tests/ocltst/module/complib/clSourceShaders.h#5 edit
... //depot/stg/opencl/drivers/opencl/tests/ocltst/module/complib/oclrtGetInfo.cpp#14 edit


[ROCm/clr commit: d50fa706e3]
2014-09-22 08:24:39 -04:00
foreman 39623a8c5f P4 to Git Change 1079052 by johtaylo@johtaylo-JTBUILDER03-increment on 2014/09/21 03:00:12
EPR #010002 - Change OpenCL version number from 1640 to 1641.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1387 edit


[ROCm/clr commit: 9f99843ca0]
2014-09-21 03:06:13 -04:00
foreman b8e25d60fe P4 to Git Change 1079030 by johtaylo@johtaylo-JTBUILDER03-increment on 2014/09/20 03:00:11
EPR #010002 - Change OpenCL version number from 1639 to 1640.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1386 edit


[ROCm/clr commit: 5df649cb7c]
2014-09-20 03:03:54 -04:00
foreman dfc011e789 P4 to Git Change 1078881 by gandryey@gera-dev-w7 on 2014/09/19 15:04:45
ECR #304775 - Keep CB warm-up with CP write for SI+ only

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuresource.cpp#193 edit


[ROCm/clr commit: 9ed2f459e6]
2014-09-19 15:22:00 -04:00
foreman 2336e8d5bc P4 to Git Change 1078871 by gandryey@gera-dev-w7 on 2014/09/19 14:45:35
ECR #304775 - Add extra CP write operation for the resource warm-up
	- Vidmm will page in the constant buffers before the actual usage

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuresource.cpp#192 edit


[ROCm/clr commit: 2e23538a01]
2014-09-19 14:59:34 -04:00
foreman 4dc4a5238f P4 to Git Change 1078639 by johtaylo@johtaylo-JTBUILDER03-increment on 2014/09/19 03:00:12
EPR #010002 - Change OpenCL version number from 1638 to 1639.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1385 edit


[ROCm/clr commit: 52fa4fec8a]
2014-09-19 03:12:33 -04:00
foreman 38ad781d31 P4 to Git Change 1078478 by xcui@merged_opencl_jxcwin on 2014/09/18 17:18:02
EPR #397491 - disable OpenCL 2.0 for mainline when there are multiple devices in the system, because svm test will fail even test on the first device

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpudevice.cpp#462 edit


[ROCm/clr commit: e7b10515af]
2014-09-18 18:15:20 -04:00
foreman 61c96486bb P4 to Git Change 1078307 by mbareghe@mbareghe_staging_win30 on 2014/09/18 13:54:53
ECR #304775 - clp re-implementation - Back out changelist 1077873 to investigate smoke_clang failures

Affected files ...

... //depot/stg/opencl/drivers/opencl/compiler/clc/clpSrc/build/Makefile.clp#4 edit
... //depot/stg/opencl/drivers/opencl/compiler/clc/clpSrc/clpVectorExpansion.cpp#26 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/backends/common/linker.cpp#112 edit
... //depot/stg/opencl/drivers/opencl/library/common/src/commonConversions.cl#15 edit
... //depot/stg/opencl/drivers/opencl/library/hsa/hsail/build/Makefile.hsail#34 edit
... //depot/stg/opencl/drivers/opencl/library/x86/gen/build/Makefile.gen#15 edit


[ROCm/clr commit: 09c85802ff]
2014-09-18 14:03:26 -04:00
foreman c48a55626f P4 to Git Change 1078099 by johtaylo@johtaylo-JTBUILDER03-increment on 2014/09/18 03:00:11
EPR #010002 - Change OpenCL version number from 1637 to 1638.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1384 edit


[ROCm/clr commit: ddf76db1d4]
2014-09-18 03:11:33 -04:00
foreman 10a5371f59 P4 to Git Change 1078026 by yaxunl@yaxunl_stg_win50 on 2014/09/17 21:51:23
EPR #402000 - Add option to control SC reg alloc strategy.

	[CQE OCL][Perf][QR] ~6-7% perf drop in CompuCL Benchmark (Graphics: T-Rex subtest).

Affected files ...

... //depot/stg/opencl/drivers/opencl/compiler/lib/backends/gpu/scwrapper/SI/scStateSI.cpp#25 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/utils/OPTIONS.def#115 edit


[ROCm/clr commit: 478f10bc24]
2014-09-17 21:58:55 -04:00
foreman c2bdf9f6c8 P4 to Git Change 1077873 by mbareghe@mbareghe_staging_win30 on 2014/09/17 13:47:49
ECR #304775 - clp re-implementation - refactoring and generalization of clpVectorExpansion to work on both edg and clang

Affected files ...

... //depot/stg/opencl/drivers/opencl/compiler/clc/clpSrc/build/Makefile.clp#3 edit
... //depot/stg/opencl/drivers/opencl/compiler/clc/clpSrc/clpVectorExpansion.cpp#25 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/backends/common/linker.cpp#111 edit
... //depot/stg/opencl/drivers/opencl/library/common/src/commonConversions.cl#14 edit
... //depot/stg/opencl/drivers/opencl/library/hsa/hsail/build/Makefile.hsail#33 edit
... //depot/stg/opencl/drivers/opencl/library/x86/gen/build/Makefile.gen#14 edit


[ROCm/clr commit: 4099ccc281]
2014-09-17 13:58:52 -04:00
foreman 70f74f658c P4 to Git Change 1077715 by johtaylo@johtaylo-JTBUILDER03-increment on 2014/09/17 03:00:15
EPR #010002 - Change OpenCL version number from 1636 to 1637.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1383 edit


[ROCm/clr commit: 0ae51a1467]
2014-09-17 03:25:51 -04:00
foreman b3a2e596d3 P4 to Git Change 1077642 by yaxunl@yaxunl_stg_win50 on 2014/09/16 23:00:53
EPR #402000 - [CQE OCL][Perf][QR] ~6-7% perf drop in CompuCL Benchmark (Graphics: T-Rex subtest).

	Add option to disable SC merge memory loads and stores. By default it is disabled. Will decide whether to enable it by default after performance runs.
	cherrypick 1076590 and CL#1077419 from sc stg for adding option in sc.

Affected files ...

... //depot/stg/opencl/drivers/opencl/compiler/lib/backends/gpu/sc/Interface/SCCommon.h#42 integrate
... //depot/stg/opencl/drivers/opencl/compiler/lib/backends/gpu/sc/Src/CompilerBase.cpp#51 integrate
... //depot/stg/opencl/drivers/opencl/compiler/lib/backends/gpu/sc/Src/CompilerBase.hpp#35 integrate
... //depot/stg/opencl/drivers/opencl/compiler/lib/backends/gpu/sc/Src/HwUtils.cpp#36 integrate
... //depot/stg/opencl/drivers/opencl/compiler/lib/backends/gpu/scwrapper/scState.cpp#32 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/utils/OPTIONS.def#114 edit


[ROCm/clr commit: a49ebf6f6e]
2014-09-16 23:14:51 -04:00
foreman 97365d23c4 P4 to Git Change 1077444 by gandryey@gera-dev-w7 on 2014/09/16 14:31:35
ECR #304775 - Add capability to enable large allocations >4GB
	- Update the blit kernels to consider a buffer size >4GB

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/blitcl.cpp#4 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpublit.cpp#110 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpusettings.cpp#280 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/hsa/hsablit.cpp#8 edit
... //depot/stg/opencl/drivers/opencl/runtime/utils/flags.hpp#214 edit


[ROCm/clr commit: b672b6c4da]
2014-09-16 14:43:17 -04:00
foreman 942888ab4a P4 to Git Change 1077370 by emankov@em-hsa-amd on 2014/09/16 12:02:36
ECR #333753 - HSA RT: avoiding superfluous recompilations on ORCA RT/HSA path

	Next compilation stage determination based on binary sections and options (while linkImpl).

	If current HSAILProgram options are equal to binary’s ones:
	- Do not generate BRIG if BRIG sections are already presented in binary.
	- Do not finalize BRIG->ISA if ISA is already presented in binary.
	- Perform only CG phase if HSAIL is absent in binary.

	Always perform only brig loading (even in case of ISA presented).

	Testing: pre check-in, compile & basic ocl conformance 2.0 tests

	Reviewer: German Andryeyev

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpucompiler.cpp#150 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpukernel.cpp#264 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpukernel.hpp#101 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuprogram.cpp#177 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuprogram.hpp#55 edit


[ROCm/clr commit: ff7ab4a0b2]
2014-09-16 12:21:42 -04:00
foreman 8c5bc4a9ca P4 to Git Change 1077336 by emankov@em-hsa-amd on 2014/09/16 11:13:20
ECR #333753 - Compiler Lib/RT: Fix RT_DEVICE_ENQUEUE & RT_KERNEL_INDEX querying

	tests: pre check-in, compiler ocl conformance tests

	Reviewer: German Andryeyev, Vinay Madhusudan

Affected files ...

... //depot/stg/opencl/drivers/opencl/compiler/lib/backends/common/v0_8/if_acl.cpp#48 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpukernel.cpp#263 edit


[ROCm/clr commit: 09fb6c961b]
2014-09-16 11:54:47 -04:00
foreman 2fe29882fa P4 to Git Change 1077270 by emankov@em-hsa-amd on 2014/09/16 09:26:09
ECR #333753 - Compiler Lib: Bug fix in string options comparison

	Tests: smoke, precheck-in

	Reviewer: Brian Sumner, Artem Tamazov

Affected files ...

... //depot/stg/opencl/drivers/opencl/compiler/lib/utils/options.cpp#30 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/utils/options.hpp#15 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpucompiler.cpp#149 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuprogram.cpp#176 edit


[ROCm/clr commit: 9c02c73c2f]
2014-09-16 09:39:53 -04:00
foreman 6f9b379fd7 P4 to Git Change 1077181 by johtaylo@johtaylo-JTBUILDER03-increment on 2014/09/16 03:00:11
EPR #010002 - Change OpenCL version number from 1635 to 1636.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1382 edit


[ROCm/clr commit: 44b425a7a5]
2014-09-16 03:09:12 -04:00
foreman 0774d9ddbb P4 to Git Change 1076975 by jgolds@jgolds_MSDNfractal5_opencl_stg on 2014/09/15 13:22:38
ECR #304775 - Update resource cache behavior

	Currently, the resource cache is fixed at 64MB regardless of available video memory size. Changed the logic to use max(1/8th video memory, 64MB). This is still overrideable with the env. var. GPU_RESOURCE_CACHE_SIZE.

	Improvements with changes: 18% decrease in video chat face detect time on 95w Kaveri (no change in PCMark8 score as we already achieved 30fps), 14% improvement on 19w Kaveri (this does result in an improvement in PCMark8 since the APU is slower).

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpusettings.cpp#279 edit


[ROCm/clr commit: fc2687df3a]
2014-09-15 13:28:54 -04:00
foreman 512752e821 P4 to Git Change 1076784 by johtaylo@johtaylo-JTBUILDER03-increment on 2014/09/15 03:00:16
EPR #010002 - Change OpenCL version number from 1634 to 1635.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1381 edit


[ROCm/clr commit: fe0dedc497]
2014-09-15 03:12:34 -04:00
foreman ab4b371960 P4 to Git Change 1076725 by bdhanase@bala_workpc_ocl on 2014/09/14 23:52:04
EPR #399601 - Report CZ as a 2.0 device

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpusettings.cpp#278 edit


[ROCm/clr commit: 93b88902dc]
2014-09-14 23:58:52 -04:00
foreman 6b9155cf5a P4 to Git Change 1076634 by johtaylo@johtaylo-JTBUILDER03-increment on 2014/09/13 03:00:12
EPR #010002 - Change OpenCL version number from 1633 to 1634.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1380 edit


[ROCm/clr commit: 585664f084]
2014-09-13 03:08:06 -04:00
foreman fb6d8c3c1c P4 to Git Change 1076600 by gandryey@gera-dev-w7 on 2014/09/12 18:30:59
ECR #304775 - Correct OPENCL_VERSION key check for staging

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpudevice.cpp#461 edit


[ROCm/clr commit: f33c12010a]
2014-09-12 18:36:05 -04:00
foreman 8f12eae36d P4 to Git Change 1076493 by yaxunl@yaxunl_stg_win50 on 2014/09/12 14:45:24
ECR #377625 - AMDIL Function support: Calculate total private memory usage by a kernel including memory used by called functions.

	This cannot be done by IPA since stack size is known only after register allocation due to potential register spill, but MachineFunctionAnalysis cannot persist after CGSCC pass with current LLVM version.

	This change adds private memory usage metadata for non-kernel functions. The total private memory usage by a kernel is calculated when AMDIL is split for different kernels. BIF will contain total private memory size.

Affected files ...

... //depot/stg/opencl/drivers/opencl/compiler/lib/utils/amdilUtils.cpp#1 add
... //depot/stg/opencl/drivers/opencl/compiler/lib/utils/amdilUtils.hpp#1 add
... //depot/stg/opencl/drivers/opencl/compiler/llvm/lib/Target/AMDIL/AMDILKernelManager.cpp#451 edit
... //depot/stg/opencl/drivers/opencl/compiler/llvm/lib/Target/AMDIL/AMDILKernelManager.h#51 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuprogram.cpp#175 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuprogram.hpp#54 edit


[ROCm/clr commit: 42f4b2af97]
2014-09-12 15:02:32 -04:00
foreman b449a9d680 P4 to Git Change 1076356 by smekhano@rampitec-HSA on 2014/09/12 09:35:57
ECR #333753 - HSA HLC: remove code changing the default filetype which is set by an external tool such as llc
	Effectively llc will produce text hsail file by default as a standard llc behaviour. Use -filetype=obj to obtain brig.
	Note, test_driver.pl is already patched to preserve old behaviour.

	Testing: smoke, smoke_clang, precheckin
	Reviewed by Nikolay Haustov

Affected files ...

... //depot/stg/opencl/drivers/opencl/compiler/lib/backends/common/codegen.cpp#58 edit
... //depot/stg/opencl/drivers/opencl/compiler/llvm/lib/Target/HSAIL/HSAILTargetMachine.cpp#33 edit


[ROCm/clr commit: fea6100aa9]
2014-09-12 09:42:39 -04:00
foreman 9b14cee39c P4 to Git Change 1076270 by johtaylo@johtaylo-JTBUILDER03-increment on 2014/09/12 03:00:13
EPR #010002 - Change OpenCL version number from 1632 to 1633.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1379 edit


[ROCm/clr commit: 6fddbe6449]
2014-09-12 03:09:18 -04:00
foreman e4a0212bbe P4 to Git Change 1076147 by rili@rili_opencl_stg on 2014/09/11 17:44:11
EPR #405753 - Fixed incorrect value of slicePitch returned from clEnqueueMapimage for 1Darray.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpumemory.cpp#111 edit


[ROCm/clr commit: edb288692d]
2014-09-11 18:10:22 -04:00
foreman b0868e0475 P4 to Git Change 1075901 by gandryey@gera-dev-w7 on 2014/09/11 11:15:25
ECR #304775 - Make sure runtime detects USWC memory for subbuffers

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpumemory.cpp#110 edit


[ROCm/clr commit: c4c488b53f]
2014-09-11 11:32:24 -04:00
foreman ef43050562 P4 to Git Change 1075755 by johtaylo@johtaylo-JTBUILDER03-increment on 2014/09/11 03:00:11
EPR #010002 - Change OpenCL version number from 1631 to 1632.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1378 edit


[ROCm/clr commit: 93ee6bb034]
2014-09-11 03:25:34 -04:00
foreman 7cafd57a3f P4 to Git Change 1075489 by marsenau@marsenau_ma on 2014/09/10 15:05:23
EPR #405194 - Change unroll threshold to LLVM default to partially work around Linpack performance problem.

	Prior to CL 1058428, which increased the unroll threshold to 200, this was only 100 which is lower than the LLVM default. Linpack's new ISA has increased register usage, but decreasing the unroll threshold to the previous level does not reduce the register count to its previous level. The increased register usage is probably a new SC problem, so this should probably be increased again in the future. There is no change in register usage with 100 vs. 150 on Linpack.

Affected files ...

... //depot/stg/opencl/drivers/opencl/compiler/lib/utils/OPTIONS.def#113 edit


[ROCm/clr commit: d5f7502ca8]
2014-09-10 15:52:30 -04:00
foreman a0506d2387 P4 to Git Change 1075485 by marsenau@marsenau_ma on 2014/09/10 15:00:38
ECR #304775 - Remove _ in hsail_64 triple enum name. It isn't consistent with itself, or most other targets. The string form is already "hsail64", but the target name is sometimes "hsail-64". Does not remove the - in "hsail-64" for the target name since users could be depending on that, although that should also be fixed.

Affected files ...

... //depot/stg/opencl/drivers/opencl/compiler/lib/backends/common/codegen.cpp#57 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/backends/common/linker.cpp#110 edit
... //depot/stg/opencl/drivers/opencl/compiler/llvm/include/llvm/ADT/Triple.h#36 edit
... //depot/stg/opencl/drivers/opencl/compiler/llvm/include/llvm/AMDLLVMContextHook.h#22 edit
... //depot/stg/opencl/drivers/opencl/compiler/llvm/lib/MC/MCObjectFileInfo.cpp#14 edit
... //depot/stg/opencl/drivers/opencl/compiler/llvm/lib/Support/Triple.cpp#47 edit
... //depot/stg/opencl/drivers/opencl/compiler/llvm/lib/Target/HSAIL/TargetInfo/HSAILTargetInfo.cpp#4 edit
... //depot/stg/opencl/drivers/opencl/compiler/llvm/lib/Transforms/SPIR/AMDSPIRLoader.cpp#82 edit
... //depot/stg/opencl/drivers/opencl/compiler/llvm/lib/Transforms/Scalar/AMDLowerAtomics.cpp#13 edit
... //depot/stg/opencl/drivers/opencl/compiler/llvm/lib/Transforms/Utils/AMDUtils.cpp#2 edit


[ROCm/clr commit: c02dacedb2]
2014-09-10 15:30:51 -04:00
foreman 525f1b5360 P4 to Git Change 1075455 by shxiao@profilerworksp on 2014/09/10 14:12:27
EPR #403493 - Block index error for CI and VI, OCL code change

	Problem description: The OCL implementation requires HSA to used different block index values for CI and VI. However, the same index value is used for the same counter block in both CI and VI, which in turn caused a segmentation fault.

	Root cause: HSA implementation does not know this situation before hand.

	Solution: Fix to use different counter block index in CI from that in VI

	Functional area: HSA perf counter implementation...

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/hsa/hsacounters.cpp#4 edit


[ROCm/clr commit: f517eefd51]
2014-09-10 15:04:08 -04:00
foreman 7c5ee80511 P4 to Git Change 1075322 by smekhano@rampitec-HSA on 2014/09/10 11:07:04
ECR #333756 - HSA Finalizer: added runtime option to force buffer instructions for global access
	This can be used under ORCA RT.

	Testing: smoke, smoke_clang, precheckin, clbas dgemm
	Reviewed by Nikolay Haustov

Affected files ...

... //depot/stg/opencl/drivers/opencl/compiler/lib/backends/gpu/scwrapper/SI/scStateSI.cpp#24 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/utils/OPTIONS.def#112 edit


[ROCm/clr commit: d35be99f01]
2014-09-10 11:25:14 -04:00
foreman e0157cd07a P4 to Git Change 1075183 by johtaylo@johtaylo-JTBUILDER03-increment on 2014/09/10 03:00:11
EPR #010002 - Change OpenCL version number from 1630 to 1631.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1377 edit


[ROCm/clr commit: 612ed6149b]
2014-09-10 03:13:37 -04:00
foreman add534ffcf P4 to Git Change 1075101 by yaxunl@yaxunl_stg_win50 on 2014/09/09 21:48:45
ECR #304775 - Modify Makefiles to let TC SC_PRM use the same SC component for AMDIL and HSAIL path.

Affected files ...

... //depot/stg/opencl/drivers/opencl/api/opencl/amdocl/build/Makefile.api#91 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/api/v0_8/aclLoaders.cpp#8 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/api/v0_9/aclLoaders.cpp#5 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/backends/gpu/Makefile#43 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/backends/gpu/scwrapper/scCompileBase.cpp#38 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/build/Makefile.complib#72 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/complibdefs#38 edit
... //depot/stg/opencl/drivers/opencl/compiler/llvm/llvmdefs#35 edit
... //depot/stg/opencl/drivers/opencl/opencldefs#142 edit


[ROCm/clr commit: 4f6f6aadc0]
2014-09-09 21:57:54 -04:00
foreman c48e0b854b P4 to Git Change 1075041 by gandryey@gera-dev-w7 on 2014/09/09 18:36:42
ECR #304775 - Device enqueue
	- Update the template header

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuschedcl.cpp#29 edit


[ROCm/clr commit: 9c6c074122]
2014-09-09 19:12:37 -04:00
foreman bc421db094 P4 to Git Change 1074863 by xcui@merged_opencl_jxcwin on 2014/09/09 13:34:10
EPR #397491 - only using OpenCL 1.2 onr multiple GPUs on mainline by default.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpudevice.cpp#460 edit


[ROCm/clr commit: f2c0eaf67d]
2014-09-09 14:05:07 -04:00
foreman fd758b29de P4 to Git Change 1074626 by johtaylo@johtaylo-JTBUILDER03-increment on 2014/09/09 03:00:12
EPR #010002 - Change OpenCL version number from 1629 to 1630.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1376 edit


[ROCm/clr commit: 8751996be9]
2014-09-09 03:11:00 -04:00
foreman 6bcb543bce P4 to Git Change 1074473 by rili@rili_opencl_stg on 2014/09/08 16:30:03
EPR #400016 - Keep the path of temp folder if the app is WIndows app

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/os/os_win32.cpp#39 edit


[ROCm/clr commit: 8f5b43ffd1]
2014-09-08 16:44:37 -04:00
foreman fb71ff26ca P4 to Git Change 1074167 by johtaylo@johtaylo-JTBUILDER03-increment on 2014/09/06 03:00:11
EPR #010002 - Change OpenCL version number from 1628 to 1629.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1375 edit


[ROCm/clr commit: c777c3e198]
2014-09-06 03:11:04 -04:00
foreman 3623685db8 P4 to Git Change 1074077 by jatang@jatang-ocl-lnx1 on 2014/09/05 16:58:30
EPR #405458 - clinfo segfaults when ENABLE_CAL_SHUTDOWN=1.

	For the global variables of:

	std::map <std::string, int> OptionNameMap[2];
	std::map <std::string, int> NoneSeparatorOptionMap[2];
	std::map <std::string, int> FOptionMap;
	std::map <std::string, int> MOptionMap;

	We don't need to call the clear() method explicitly, since the std::map destructor will clean things up (valgrind mem-check doesn't report any leak related to these global variables after this change). Besides, on Linux amd::option::teardown() is called after the global variables' destructors are called, and it will cause segfault.

Affected files ...

... //depot/stg/opencl/drivers/opencl/compiler/lib/utils/options.cpp#29 edit


[ROCm/clr commit: b9e695d254]
2014-09-05 20:58:50 -04:00
foreman 6026f2550c P4 to Git Change 1073720 by johtaylo@johtaylo-JTBUILDER03-increment on 2014/09/05 03:00:12
EPR #010002 - Change OpenCL version number from 1627 to 1628.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1374 edit


[ROCm/clr commit: 97c9f5611c]
2014-09-05 05:01:16 -04:00
foreman 704d424a7f P4 to Git Change 1073180 by nhaustov@nhaustov_hsa on 2014/09/04 03:54:52
ECR #333756 - Enable HSAIL 0.95+ images in Orca Runtime, HLC and Finalizer.

	Testing: pre-checkin, image tests on Granite and Obsidian fail as expected.

Affected files ...

... //depot/stg/opencl/drivers/opencl/compiler/lib/backends/gpu/sc-hsa/HSAIL/hsail-fin/HSAILCodeGen.cpp#2 edit
... //depot/stg/opencl/drivers/opencl/compiler/llvm/lib/CodeGen/CodeGen.cpp#30 edit
... //depot/stg/opencl/drivers/opencl/compiler/llvm/lib/Target/HSAIL/BRIGAsmPrinter.cpp#78 edit
... //depot/stg/opencl/drivers/opencl/compiler/llvm/lib/Target/HSAIL/HSAILISelDAGToDAG.cpp#29 edit
... //depot/stg/opencl/drivers/opencl/compiler/llvm/lib/Target/HSAIL/HSAILISelLowering.cpp#61 edit
... //depot/stg/opencl/drivers/opencl/compiler/llvm/lib/Target/HSAIL/HSAILImages.td#9 edit
... //depot/stg/opencl/drivers/opencl/compiler/llvm/lib/Target/HSAIL/HSAILPropImageOps.cpp#4 delete
... //depot/stg/opencl/drivers/opencl/compiler/llvm/lib/Target/HSAIL/HSAILTargetMachine.cpp#32 edit
... //depot/stg/opencl/drivers/opencl/runtime/utils/flags.hpp#213 edit


[ROCm/clr commit: 43cb9c9fc1]
2014-09-04 04:10:00 -04:00
foreman f245362986 P4 to Git Change 1073164 by johtaylo@johtaylo-JTBUILDER03-increment on 2014/09/04 03:00:11
EPR #010002 - Change OpenCL version number from 1626 to 1627.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1373 edit


[ROCm/clr commit: 126e8c33e1]
2014-09-04 03:07:54 -04:00
foreman 1d54653e39 P4 to Git Change 1072691 by johtaylo@johtaylo-JTBUILDER03-increment on 2014/09/03 03:00:11
EPR #010002 - Change OpenCL version number from 1625 to 1626.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1372 edit


[ROCm/clr commit: 7bd8bf4f9c]
2014-09-03 03:14:26 -04:00