Gráfico de commits

336 Commits

Autor SHA1 Mensaje Fecha
foreman 0293e30a65 P4 to Git Change 1104627 by gandryey@gera-dev-w7 on 2014/12/11 12:01:30
EPR #411058 - [CQE OCL][Lnx][QR][CZ]MultiDevice_Context fails in 2.0 conformance wimpyfull due to CL# 1101352
	- The detection of different map types is overcomplicated with possibility of multiple maps and multithreading environment. Thus keep USWC indirect map optimization based on the allocation flags.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpumemory.cpp#114 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpumemory.hpp#46 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuvirtual.cpp#342 edit


[ROCm/clr commit: 593d1e3b8d]
2014-12-11 12:07:05 -05:00
foreman c27a1ebdd5 P4 to Git Change 1104508 by johtaylo@johtaylo-JTBUILDER03-increment on 2014/12/11 03:00:09
EPR #010002 - Change OpenCL version number from 1701 to 1702.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1448 edit


[ROCm/clr commit: c722a0a2da]
2014-12-11 03:10:05 -05:00
foreman 48857846cd P4 to Git Change 1104077 by johtaylo@johtaylo-JTBUILDER03-increment on 2014/12/10 03:00:10
EPR #010002 - Change OpenCL version number from 1700 to 1701.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1447 edit


[ROCm/clr commit: 68108a505c]
2014-12-10 03:10:40 -05:00
foreman a709ffc57e P4 to Git Change 1103884 by yaxunl@yaxunl_stg_win50 on 2014/12/09 15:30:25
EPR #405889 - Added option to set VGPR/SGPR/LDS usage in ISA to certain value greater than actual usage for debugging purpose. If the given value is smaller than actual value, this option has no effect.

Affected files ...

... //depot/stg/opencl/drivers/opencl/compiler/lib/backends/gpu/scwrapper/SI/scCompileSI.cpp#52 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/backends/gpu/scwrapper/scHWShaderInfo.h#2 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/utils/OPTIONS.def#121 edit


[ROCm/clr commit: 9f760b7bf0]
2014-12-09 16:10:21 -05:00
foreman a1b445747a P4 to Git Change 1103629 by johtaylo@johtaylo-JTBUILDER03-increment on 2014/12/09 03:00:55
EPR #010002 - Change OpenCL version number from 1699 to 1700.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1446 edit


[ROCm/clr commit: ed3642807b]
2014-12-09 03:20:34 -05:00
foreman 38a2aa6461 P4 to Git Change 1103206 by johtaylo@johtaylo-JTBUILDER03-increment on 2014/12/06 03:00:11
EPR #010002 - Change OpenCL version number from 1698 to 1699.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1445 edit


[ROCm/clr commit: f4addd58c9]
2014-12-06 03:08:32 -05:00
foreman 9571fcfc9b P4 to Git Change 1103193 by wchau@wchau_WINDOWS7_OCL on 2014/12/05 22:31:28
ECR #399840 - OpenCL Runtime HW Debug support development - APIs

Affected files ...

... //depot/stg/opencl/drivers/opencl/api/opencl/amdocl/cl_context.cpp#40 edit
... //depot/stg/opencl/drivers/opencl/api/opencl/amdocl/cl_debugger_amd.cpp#1 add
... //depot/stg/opencl/drivers/opencl/api/opencl/amdocl/cl_debugger_amd.h#1 add
... //depot/stg/opencl/drivers/opencl/runtime/device/device.cpp#173 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/device.hpp#235 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/hwdebug.hpp#1 add


[ROCm/clr commit: 801fcec2fe]
2014-12-05 22:36:08 -05:00
foreman 100a1e5fd5 P4 to Git Change 1102960 by gandryey@gera-dev-w7 on 2014/12/05 10:51:47
EPR #410824 - [CQE OCL][CZ][S/G][QR] Two Bolt sample failing on CPU; Faulty CL: 1101352
	- The test performs double maps with different map flags. Optimization could choose different map schemes for each call and memory coherency could be broken. Add extra conditions to detect multiple maps and use the same  path as the first map.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpumemory.hpp#45 edit


[ROCm/clr commit: a1202e54be]
2014-12-05 11:07:57 -05:00
foreman 4ebfb6d910 P4 to Git Change 1102842 by johtaylo@johtaylo-JTBUILDER03-increment on 2014/12/05 03:03:04
EPR #010002 - Change OpenCL version number from 1697 to 1698.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1444 edit


[ROCm/clr commit: 3444e16d99]
2014-12-05 03:15:11 -05:00
foreman 5aa6a01f52 P4 to Git Change 1102693 by jsjodin@jsjodin_linux_avx_hsa on 2014/12/04 17:11:26
ECR #304775 - Change the way SIGFPE is handled.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/cpu/cpucommand.hpp#38 edit
... //depot/stg/opencl/drivers/opencl/runtime/os/os_posix.cpp#39 edit
... //depot/stg/opencl/drivers/opencl/runtime/os/os_win32.cpp#42 edit
... //depot/stg/opencl/drivers/opencl/runtime/thread/thread.hpp#14 edit


[ROCm/clr commit: 024acc392e]
2014-12-04 17:19:31 -05:00
foreman 75a107518d P4 to Git Change 1102435 by johtaylo@johtaylo-JTBUILDER03-increment on 2014/12/04 03:00:13
EPR #010002 - Change OpenCL version number from 1696 to 1697.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1443 edit


[ROCm/clr commit: 2d58fc417b]
2014-12-04 03:12:23 -05:00
foreman fba461ffe5 P4 to Git Change 1102328 by gandryey@gera-dev-w7 on 2014/12/03 18:56:06
EPR #410736 - [CQE OCL][ISV][QR][G] FFMPEG app generating corrupted video output; Faulty CL:1101352
	- Add detection for AHP allocation.
	 FFmpeg uses AHP allocations with CL_MAP_READ flag, but actually performs CPU write into the buffer. With indirect map runtime executes useless transfer on map and doesn't write updated memory on unmap, because a wrong flag sent by the app.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpumemory.cpp#113 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpumemory.hpp#44 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuvirtual.cpp#341 edit
... //depot/stg/opencl/drivers/opencl/tests/ocltst/module/perf/TestList.cpp#40 edit


[ROCm/clr commit: f9f5df731e]
2014-12-03 19:02:40 -05:00
foreman 0e671f4cb6 P4 to Git Change 1102071 by johtaylo@johtaylo-JTBUILDER03-increment on 2014/12/03 03:00:15
EPR #010002 - Change OpenCL version number from 1695 to 1696.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1442 edit


[ROCm/clr commit: 750e1bf9bd]
2014-12-03 14:12:22 -05:00
foreman a5f74e1dbb P4 to Git Change 1101483 by johtaylo@johtaylo-JTBUILDER03-increment on 2014/12/01 03:00:18
EPR #010002 - Change OpenCL version number from 1694 to 1695.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1441 edit


[ROCm/clr commit: 6824541acd]
2014-12-03 01:01:52 -05:00
foreman 332cebf3b8 P4 to Git Change 1101372 by johtaylo@johtaylo-JTBUILDER03-increment on 2014/11/29 03:00:11
EPR #010002 - Change OpenCL version number from 1693 to 1694.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1440 edit


[ROCm/clr commit: ba8e6fefbe]
2014-11-29 03:11:48 -05:00
foreman c2a64b89f6 P4 to Git Change 1101359 by weizhang@msdnweizhang-opencl on 2014/11/28 18:31:24
EPR #397491 - Replace "switch" with "if" so that new ASIC id doesn't need to be added.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gslbe/src/rt/GSLDevice.cpp#103 edit


[ROCm/clr commit: afe1835f56]
2014-11-28 18:47:54 -05:00
foreman f43897e51e P4 to Git Change 1101352 by gandryey@gera-dev-w7 on 2014/11/28 18:03:18
ECR #304775 - Make optimization for read map of USWC memory
	- If runtime detects USWC map with read operation, then it will switch to indirect map. This should improve map-read  performance on APU(s)  when USWC memory is used instead of frame buffer

Affected files ...

... //depot/stg/opencl/drivers/opencl/api/opencl/amdocl/cl_memobj.cpp#72 edit
... //depot/stg/opencl/drivers/opencl/api/opencl/amdocl/cl_svm.cpp#8 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/cpu/cpudevice.cpp#269 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/cpu/cpudevice.hpp#89 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/device.cpp#172 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/device.hpp#234 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpudevice.cpp#486 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpudevice.hpp#134 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpumemory.cpp#112 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpumemory.hpp#43 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuvirtual.cpp#340 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/hsa/hsadevice.cpp#88 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/hsa/hsadevice.hpp#45 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/hsa/hsamemory.cpp#42 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/hsa/hsamemory.hpp#27 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/hsa/hsavirtual.cpp#98 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/hsa_foundation/hsadevice.cpp#21 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/hsa_foundation/hsadevice.hpp#7 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/hsa_foundation/hsamemory.cpp#6 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/hsa_foundation/hsamemory.hpp#5 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/hsa_foundation/hsavirtual.cpp#26 edit


[ROCm/clr commit: 6d464be252]
2014-11-28 18:11:36 -05:00
foreman de5e6ef163 P4 to Git Change 1101351 by skudchad@skudchad_test_win_opencl2 on 2014/11/28 17:32:17
EPR #403782 - IOMMU2/SVM
	- For finegrainsystem, the app can pass a malloced pointer directly to the kernel. Copy pointer directly to the aqlArgBuf without exiting.

	ReviewBoardURL = http://ocltc.amd.com/reviews/r/6378/diff/

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpukernel.cpp#269 edit


[ROCm/clr commit: 2ba0f2a112]
2014-11-28 17:45:30 -05:00
foreman 8dd771017b P4 to Git Change 1100847 by johtaylo@johtaylo-JTBUILDER03-increment on 2014/11/27 03:00:11
EPR #010002 - Change OpenCL version number from 1692 to 1693.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1439 edit


[ROCm/clr commit: 7bf07ad054]
2014-11-27 03:12:03 -05:00
foreman 79182aae87 P4 to Git Change 1100525 by johtaylo@johtaylo-JTBUILDER03-increment on 2014/11/26 03:00:11
EPR #010002 - Change OpenCL version number from 1691 to 1692.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1438 edit


[ROCm/clr commit: 9583bf4f36]
2014-11-26 03:12:16 -05:00
foreman aad43cf1dc P4 to Git Change 1100156 by johtaylo@johtaylo-JTBUILDER03-increment on 2014/11/25 03:00:11
EPR #010002 - Change OpenCL version number from 1690 to 1691.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1437 edit


[ROCm/clr commit: e0f3106f52]
2014-11-25 03:14:10 -05:00
foreman 6c75a04b12 P4 to Git Change 1099900 by yaxunl@yaxunl_stg_win50 on 2014/11/24 13:59:14
EPR #409950 - [IV][OCL] Multiple OCL samples crashed on multiple machines for 32-bit OS.

	There are two issues:

	1. the SC dll should be dynamically loaded only when it is available. This is to allow apps to run on CPU device without the SC dll. This CL fixes it. It also allows user to use env var AMD_OCL_SC_LIB to provide the name or complete path of SC dll to load.

	2. The test fails because amdhsasc.dll is not included in base driver for 32 bit OS. The proper solution should be ask package team to include amdhsasc.dll in the base driver. Also amdhsasc.dll should be renamed amdoclsc.dll since it is not only used for HSAIL but also used by AMDIL. The benefit of separate SC component as a shared library is decreased build time since changes in SC does not require rebuild of amdocl.dll, and ease of debugging and regression analysis by allowing swapping SC comopnent.

	However since 15.10 branch is close, there is not enough time to make changes to package. Therefore this CL implements a workaround for this issue without change to the package.  We will implement the proper fix in the next relase.

	The workaround implemented by this CL embeds SC statically in amdocl.dll. The runtime loads SC dll specified by env var AMD_OCL_SC_LIB only if it is available. If the SC dll is not available, it will use the embeded SC.

Affected files ...

... //depot/stg/opencl/drivers/opencl/api/opencl/amdocl/build/Makefile.api#96 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/api/v0_8/acl.cpp#22 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/api/v0_8/aclLoaders.cpp#9 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/backends/gpu/Makefile#44 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/backends/gpu/sclibdefs.opencl#20 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/include/v0_8/aclStructs.h#13 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/include/v0_8/aclTypes.h#4 edit
... //depot/stg/opencl/drivers/opencl/compiler/tools/aoc2/build/Makefile.aoc2#21 edit
... //depot/stg/opencl/drivers/opencl/opencldefs#148 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpudevice.cpp#485 edit
... //depot/stg/opencl/drivers/opencl/runtime/utils/flags.hpp#220 edit


[ROCm/clr commit: 16ebf68e43]
2014-11-24 14:08:05 -05:00
foreman db05ab434f P4 to Git Change 1099819 by asalmanp@asalmanp-opencl-stg1 on 2014/11/24 09:47:35
EPR #394115 - [OpenCL][AI BU] - Adding support for Ellesmere (gfx8)

	ReviewBoardURL = http://ocltc.amd.com/reviews/r/6350/

Affected files ...

... //depot/stg/opencl/drivers/opencl/compiler/lib/backends/gpu/scwrapper/SI/devStateSI.cpp#17 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/backends/gpu/scwrapper/devState.cpp#22 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/backends/gpu/scwrapper/scCompileBase.cpp#39 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/loaders/bif/amuabi/amuABI.h#14 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/loaders/bif/amuabi/amuABIMultiBinary.cpp#14 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/utils/v0_8/target_mappings.h#28 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/utils/v0_8/target_mappings_amdil.h#23 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/utils/v0_8/target_mappings_amdil64.h#20 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/utils/v0_8/target_mappings_hsail.h#19 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/utils/v0_8/target_mappings_hsail64.h#14 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/utils/v0_9/target_mappings.h#23 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/utils/v0_9/target_mappings_amdil.h#22 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/utils/v0_9/target_mappings_amdil64.h#24 edit
... //depot/stg/opencl/drivers/opencl/compiler/llvm/lib/Target/AMDIL/AMDIL.h#107 edit
... //depot/stg/opencl/drivers/opencl/compiler/llvm/lib/Target/AMDIL/AMDILSubtarget.cpp#205 edit
... //depot/stg/opencl/drivers/opencl/compiler/llvm/lib/Target/AMDIL/Internal/AMDILInternalDeviceFlags.h#11 edit
... //depot/stg/opencl/drivers/opencl/compiler/llvm/lib/Target/AMDIL/Internal/InternalProcessors.td#19 edit
... //depot/stg/opencl/drivers/opencl/compiler/tools/driver/driver.cpp#51 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpudefs.hpp#117 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpudevice.cpp#484 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpusettings.cpp#294 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gslbe/src/include/cal/cal.h#30 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gslbe/src/rt/GSLDevice.cpp#102 edit
... //depot/stg/opencl/drivers/opencl/tests/ocltst/module/binary/BIFEnumCheck.cpp#20 edit
... //depot/stg/opencl/drivers/opencl/tests/ocltst/module/complib/CLTargetInfo.cpp#26 edit
... //depot/stg/opencl/drivers/opencl/tests/ocltst/module/math/OCLMathFunc.cpp#18 edit
... //depot/stg/opencl/drivers/opencl/tests/ocltst/module/runtime/OCLDeviceQueries.cpp#22 edit
... //depot/stg/opencl/drivers/opencl/tests/ocltst/module/runtime/OCLOfflineCompilation.cpp#10 edit
... //depot/stg/opencl/drivers/opencl/tests/ocltst/module/runtime/OCLPerfCounters.cpp#20 edit


[ROCm/clr commit: 06d3b47141]
2014-11-24 09:53:40 -05:00
foreman ce86a04e40 P4 to Git Change 1099664 by johtaylo@johtaylo-JTBUILDER03-increment on 2014/11/22 03:00:11
EPR #010002 - Change OpenCL version number from 1689 to 1690.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1436 edit


[ROCm/clr commit: 05afab8ccf]
2014-11-22 03:09:37 -05:00
foreman 1907dd1786 P4 to Git Change 1099390 by johtaylo@johtaylo-JTBUILDER03-increment on 2014/11/21 03:00:12
EPR #010002 - Change OpenCL version number from 1688 to 1689.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1435 edit


[ROCm/clr commit: 48dbc6d01e]
2014-11-21 03:15:48 -05:00
foreman 8a53c7fcf6 P4 to Git Change 1099279 by gandryey@gera-dev-w7 on 2014/11/20 18:54:15
ECR #304775 - Properly report the new depth stencil formats

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/platform/memory.cpp#114 edit
... //depot/stg/opencl/drivers/opencl/runtime/platform/memory.hpp#87 edit


[ROCm/clr commit: 0d4acaeeaf]
2014-11-20 19:05:14 -05:00
foreman 62985c95f3 P4 to Git Change 1098964 by johtaylo@johtaylo-JTBUILDER03-increment on 2014/11/20 03:00:14
EPR #010002 - Change OpenCL version number from 1687 to 1688.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1434 edit


[ROCm/clr commit: 6d7aaf21a6]
2014-11-20 03:11:35 -05:00
foreman 8d023afdf3 P4 to Git Change 1098949 by emankov@em-hsa-amd on 2014/11/20 02:16:39
ECR #333753 - Compiler Lib/aoc2/devloader: move devloader functionality into aoc2

	[Purpose] To get rid of obsolete runtimenew dependency in compiler

	1.      Devloader functionality moved into aoc2;
	2.      Devloader is removed from the tree & make system;
	3.      Related changes in test_driver.pl;
	4.      Functions alignedMalloc & alignedFree are moved to libUtils.h;
	5.      Function aclHsaLoader is renamed to _aclHsaLoader to indicate that it is not a Compiler Lib API's function.

	[Testing] make smoke, pre check-in

	[Reviewers] Nikolay Haustov, Brian Sumner

Affected files ...

... //depot/stg/opencl/drivers/opencl/Makefile#48 edit
... //depot/stg/opencl/drivers/opencl/api/opencl/amdocl/amdocl.def.in#10 edit
... //depot/stg/opencl/drivers/opencl/api/opencl/amdocl/amdocl.map.in#11 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/amdoclcl.def.in#8 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/amdoclcl.map.in#7 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/backends/gpu/brig_loader.cpp#15 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/backends/gpu/scwrapper/scClientAPI.cpp#20 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/utils/v0_8/libUtils.h#13 edit
... //depot/stg/opencl/drivers/opencl/compiler/loader/devloader/Makefile#8 delete
... //depot/stg/opencl/drivers/opencl/compiler/loader/devloader/build/Makefile#3 delete
... //depot/stg/opencl/drivers/opencl/compiler/loader/devloader/build/Makefile.devloader#11 delete
... //depot/stg/opencl/drivers/opencl/compiler/loader/devloader/devloader.cpp#6 delete
... //depot/stg/opencl/drivers/opencl/compiler/tools/aoc2/aoc2.cpp#61 edit
... //depot/stg/opencl/drivers/opencl/compiler/tools/aoc2/build/Makefile.aoc2#20 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuprogram.cpp#185 edit
... //depot/stg/opencl/drivers/opencl/tests/hsa/bin/test_driver.pl#5 edit


[ROCm/clr commit: 6244599f99]
2014-11-20 02:27:54 -05:00
foreman 7e8ccfc3c5 P4 to Git Change 1098843 by mbareghe@mbareghe_staging_win30 on 2014/11/19 18:08:14
EPR #409798 - clCompileProgram and clLinkProgram regression for SPIR - set the correct IR type while extracting from binary (aclSPIR, aclLLVMIR) for single SPIR module for CPU.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/cpu/cpuprogram.cpp#62 edit


[ROCm/clr commit: 57a45f9066]
2014-11-19 18:38:13 -05:00
foreman d75064b500 P4 to Git Change 1098569 by mbareghe@mbareghe_staging_win30 on 2014/11/19 10:23:56
EPR #409840 - [CQE OCL][LNX][QR] OpenCL SPIR Conf test "Compile_and_link" failed in all Asics due to CL#1098110 - Set the IR type to SPIR only for single SPIR modules.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuprogram.cpp#184 edit


[ROCm/clr commit: f324bf5f80]
2014-11-19 10:38:39 -05:00
foreman d599b91524 P4 to Git Change 1098567 by rili@rili_opencl_stg on 2014/11/19 10:20:42
EPR #407215 - re-submit CL#1098846
	                         Added checking for VM

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpudevice.cpp#483 edit


[ROCm/clr commit: 42627826ff]
2014-11-19 10:30:26 -05:00
foreman 5db3dece74 P4 to Git Change 1098439 by johtaylo@johtaylo-JTBUILDER03-increment on 2014/11/19 03:00:12
EPR #010002 - Change OpenCL version number from 1686 to 1687.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1433 edit


[ROCm/clr commit: 5573f4f56b]
2014-11-19 03:34:04 -05:00
foreman a786643b56 P4 to Git Change 1098110 by mbareghe@mbareghe_staging_win30 on 2014/11/18 13:21:17
EPR #409798 - clCompileProgram and clLinkProgram regression for SPIR - set the correct IR type while extracting from binary (aclSPIR, aclLLVMIR)

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuprogram.cpp#183 edit


[ROCm/clr commit: d604c03916]
2014-11-18 13:28:05 -05:00
foreman a412e8e9ac P4 to Git Change 1097951 by johtaylo@johtaylo-JTBUILDER03-increment on 2014/11/18 03:00:12
EPR #010002 - Change OpenCL version number from 1685 to 1686.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1432 edit


[ROCm/clr commit: 1b0c2439f2]
2014-11-18 03:15:14 -05:00
foreman f1b2db9a85 P4 to Git Change 1097741 by efinger@efinger_BDCW7-EFINGER on 2014/11/17 15:47:10
EPR #092474 - Fix OpenCL build

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuscsi.cpp#26 edit


[ROCm/clr commit: b12970cbe2]
2014-11-17 15:56:59 -05:00
foreman 51722b42f7 P4 to Git Change 1097349 by johtaylo@johtaylo-JTBUILDER03-increment on 2014/11/15 03:00:12
EPR #010002 - Change OpenCL version number from 1684 to 1685.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1431 edit


[ROCm/clr commit: fca0dee30a]
2014-11-16 15:54:39 -05:00
foreman 172c34d207 P4 to Git Change 1097207 by mbareghe@mbareghe_staging_win30 on 2014/11/14 14:11:59
ECR #304775 - Replace Trivial Relational Functions

Affected files ...

... //depot/stg/opencl/drivers/opencl/compiler/lib/backends/common/linker.cpp#117 edit
... //depot/stg/opencl/drivers/opencl/compiler/llvm/lib/Transforms/SPIR/AMDSPIRLoader.cpp#88 edit
... //depot/stg/opencl/drivers/opencl/compiler/llvm/tools/llvm-link/llvm-link.cpp#50 edit
... //depot/stg/opencl/drivers/opencl/library/common/src/commonRelationals.cl#13 edit
... //depot/stg/opencl/drivers/opencl/library/gpu/gen/build/Makefile.gen#7 edit


[ROCm/clr commit: 60e766397d]
2014-11-14 14:22:12 -05:00
foreman f7d489a445 P4 to Git Change 1097200 by gandryey@gera-dev-w7 on 2014/11/14 13:59:46
ECR #304775 - Optimize oclBandwidthTest from nVidia SDK
	- Cache pinned memory, since the benchmark sends the same transfer in a single batch. Thus we could avoid pin/unpin
	- Swap SDMA engine allocation order. Blit manager allocates a queue on device, thus the first app queue was getting the paging second SDMA.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpublit.cpp#112 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpublit.hpp#37 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuvirtual.cpp#339 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuvirtual.hpp#121 edit


[ROCm/clr commit: dc8a3205ce]
2014-11-14 14:07:55 -05:00
foreman f80e78e32a P4 to Git Change 1097122 by gandryey@gera-dev-w7 on 2014/11/14 11:06:31
ECR #304775 - Rename GPU_MAX_ALLOC_PERCENT to GPU_SINGLE_ALLOC_PERCENT

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpudevice.cpp#482 edit
... //depot/stg/opencl/drivers/opencl/runtime/utils/flags.hpp#219 edit


[ROCm/clr commit: 0fc8ceaa73]
2014-11-14 11:30:20 -05:00
foreman d7995f9960 P4 to Git Change 1097001 by johtaylo@johtaylo-JTBUILDER03-increment on 2014/11/14 03:00:12
EPR #010002 - Change OpenCL version number from 1683 to 1684.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1430 edit


[ROCm/clr commit: ab2a9ee5fc]
2014-11-14 03:15:51 -05:00
foreman 0826cf70be P4 to Git Change 1096863 by rili@rili_opencl_stg on 2014/11/13 16:43:47
EPR #407215 - Revert CL#1096846

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpudevice.cpp#481 edit


[ROCm/clr commit: 83a6efb085]
2014-11-13 17:23:29 -05:00
foreman e73ae9c166 P4 to Git Change 1096846 by rili@rili_opencl_stg on 2014/11/13 16:23:44
EPR #407215 - reset host memory pointer of a image view based on original image
	                         ReviewBoard #6301

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpudevice.cpp#480 edit


[ROCm/clr commit: b3f320ce7d]
2014-11-13 16:56:11 -05:00
foreman 3bd8a66fad P4 to Git Change 1096552 by johtaylo@johtaylo-JTBUILDER03-increment on 2014/11/13 03:00:12
EPR #010002 - Change OpenCL version number from 1682 to 1683.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1429 edit


[ROCm/clr commit: 3f1af9d6c4]
2014-11-13 03:12:25 -05:00
foreman 4074d9668f P4 to Git Change 1096144 by johtaylo@johtaylo-JTBUILDER03-increment on 2014/11/12 03:00:11
EPR #010002 - Change OpenCL version number from 1681 to 1682.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1428 edit


[ROCm/clr commit: 5658b6a1b4]
2014-11-12 03:12:31 -05:00
foreman 91b247014f P4 to Git Change 1095935 by bwicakso@opencl-hsa-stg-bwicakso on 2014/11/11 16:00:10
ECR #333755 - Part 2- Update to foundation spec 1.0 20141019:
	- hsa_dispatch_packet_t now becomes hsa_kernel_dispatch_packet_t
	- all bit mask in a struct are removed and replaced by enums that indicates the bit position and width.

	Test: TC precheckin

	Review: Hari, Fan, Shucai, German, Yunjun.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpukernel.cpp#268 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpukernel.hpp#103 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpusched.hpp#15 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuvirtual.cpp#338 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/hsa_foundation/hsavirtual.cpp#25 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/hsa_foundation/hsavirtual.hpp#12 edit


[ROCm/clr commit: c7988f7209]
2014-11-11 16:27:13 -05:00
foreman a3cd8fcbb3 P4 to Git Change 1095644 by johtaylo@johtaylo-JTBUILDER03-increment on 2014/11/11 03:00:13
EPR #010002 - Change OpenCL version number from 1680 to 1681.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1427 edit


[ROCm/clr commit: 381e955dbf]
2014-11-11 03:11:00 -05:00
foreman 3d866ee5df P4 to Git Change 1095445 by emankov@em-hsa-amd on 2014/11/10 13:48:06
ECR #333753 - hsa_foundation RT/Compiler Lib: recompilation algorithm rework

	1. Recompilation algorithm rework in order to avoid superfluous recompilations.
	2. Replace aclExtractSymbol/Section with aclQueryInfo for symbol/section detection.

	The replaced calls in RT previously performed actual extraction of the sections from the BIF with memory allocation and copying. But what is needed in fact is only to determine whether the section exists in BIF or not to make a further decision on needed recompilations. With aclQueryInfo and new added enums RT_CONTAINS_LLVMIR, RT_CONTAINS_OPTIONS, RT_CONTAINS_BRIG, RT_CONTAINS_HSAIL, RT_CONTAINS_ISA Runtime starts querying not the whole sections but the bool flag which indicates the existence of the corresponding section(s) without any memory allocations. Every compilation on RT starting from LLVMIR is affected by the change including compilation of blit kernels.

	3. Fix in Compiler Lib for correct ACL_INVALID_ARG detection (for wrong/unsupported compilations).

	[Side Effects] performance improvement, memory consumption reduction

	[ToDo] Do not finalize program if ISA is already provided in BIF and options are unchanged.

	[Testing] pre check-in, ocltst complib, ocl conformance 2.0 compiler & api

	[Reviewers] German Andryeyev, Brian Sumner

Affected files ...

... //depot/stg/opencl/drivers/opencl/compiler/lib/backends/common/v0_8/if_acl.cpp#56 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/hsa_foundation/hsaprogram.cpp#8 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/hsa_foundation/hsaprogram.hpp#3 edit


[ROCm/clr commit: 9c4a22118e]
2014-11-10 13:55:17 -05:00
foreman b678aa1eb3 P4 to Git Change 1095169 by johtaylo@johtaylo-JTBUILDER03-increment on 2014/11/08 03:00:12
EPR #010002 - Change OpenCL version number from 1679 to 1680.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1426 edit


[ROCm/clr commit: f858ed6336]
2014-11-08 03:08:12 -05:00
foreman c04dd4d6dc P4 to Git Change 1095130 by xcui@merged_opencl_jxcwin on 2014/11/07 18:09:55
EPR #408459 - changed the implementation of svmAlloc, so that the first device can create amd::Memory object, and the rest of devices only added gpu memory to it. This is part of changes for mgpu support for svmalloc
	code review:
	http://ocltc.amd.com/reviews/r/6245/
	precheckin testing results:
	http://ocltc.amd.com:8111/viewModification.html?modId=43136&personal=true&buildTypeId=&tab=vcsModificationTests

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/cpu/cpudevice.hpp#88 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/device.hpp#233 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpudevice.cpp#479 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpudevice.hpp#133 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/hsa/hsadevice.cpp#87 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/hsa/hsadevice.hpp#44 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/hsa_foundation/hsadevice.cpp#19 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/hsa_foundation/hsadevice.hpp#6 edit
... //depot/stg/opencl/drivers/opencl/runtime/platform/context.cpp#34 edit


[ROCm/clr commit: efbedb25be]
2014-11-07 18:20:08 -05:00
foreman f02e1e3ed8 P4 to Git Change 1095076 by rili@rili_opencl_stg on 2014/11/07 16:25:17
EPR #408185 - Use pinned memory if directaccess is true and remoteAlloc is used.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpudevice.cpp#478 edit


[ROCm/clr commit: c24b46e708]
2014-11-07 16:37:32 -05:00