76333 Incheckningar

Upphovsman SHA1 Meddelande Datum
foreman a028d44e88 P4 to Git Change 1061395 by johtaylo@johtaylo-JTBUILDER03-increment on 2014/07/31 03:00:11
EPR #010002 - Change OpenCL version number from 1600 to 1601.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1347 edit
2014-07-31 03:12:54 -04:00
foreman 49d1896484 P4 to Git Change 1061376 by ssahasra@ssahasra_opencl_windows on 2014/07/31 02:22:23
ECR #333753 - linker: rely on builtins library triple instead of elf binary

	This is the first of multiple changes aimed at unifying the
	offline linker (llvm-link) with the online linker in the compiler
	library. The online linker is considered state-of-the-art and the
	code there needs to be made available to the offline linker.

	This change teaches the online linker to determine the target by
	examining the target triple on the builtins library modules,
	instead of checking the elf binary target. The assumption is that
	the builtins library always matches the actual target as
	confirmed in CL 1041226. This removes one dependence of compiler
	library functions so that the affected code can eventually be
	moved to llvm/lib and shared with the offline linker.

	The change passes smoke_clang (Orca build), smoke (HSA build) and Teamcity pre-checkin.

	Reviewed by Brian Sumner, Yaxun Liu, Stanislav Mekhanoshin

Affected files ...

... //depot/stg/opencl/drivers/opencl/compiler/lib/backends/common/linker.cpp#107 edit


[ROCm/clr commit: 4396288d55]
2014-07-31 02:31:05 -04:00
foreman 4396288d55 P4 to Git Change 1061376 by ssahasra@ssahasra_opencl_windows on 2014/07/31 02:22:23
ECR #333753 - linker: rely on builtins library triple instead of elf binary

	This is the first of multiple changes aimed at unifying the
	offline linker (llvm-link) with the online linker in the compiler
	library. The online linker is considered state-of-the-art and the
	code there needs to be made available to the offline linker.

	This change teaches the online linker to determine the target by
	examining the target triple on the builtins library modules,
	instead of checking the elf binary target. The assumption is that
	the builtins library always matches the actual target as
	confirmed in CL 1041226. This removes one dependence of compiler
	library functions so that the affected code can eventually be
	moved to llvm/lib and shared with the offline linker.

	The change passes smoke_clang (Orca build), smoke (HSA build) and Teamcity pre-checkin.

	Reviewed by Brian Sumner, Yaxun Liu, Stanislav Mekhanoshin

Affected files ...

... //depot/stg/opencl/drivers/opencl/compiler/lib/backends/common/linker.cpp#107 edit
2014-07-31 02:31:05 -04:00
foreman 8e8b28279d P4 to Git Change 1061045 by weizhang@msdnweizhang-opencl on 2014/07/30 12:01:56
ECR #392041 - Implement high performance state on Linux

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuappprofile.cpp#7 edit


[ROCm/clr commit: 7be05e924e]
2014-07-30 12:27:50 -04:00
foreman 7be05e924e P4 to Git Change 1061045 by weizhang@msdnweizhang-opencl on 2014/07/30 12:01:56
ECR #392041 - Implement high performance state on Linux

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuappprofile.cpp#7 edit
2014-07-30 12:27:50 -04:00
foreman 30bd2100aa P4 to Git Change 1060846 by johtaylo@johtaylo-JTBUILDER03-increment on 2014/07/30 03:00:11
EPR #010002 - Change OpenCL version number from 1599 to 1600.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1346 edit


[ROCm/clr commit: e5523be947]
2014-07-30 03:15:05 -04:00
foreman e5523be947 P4 to Git Change 1060846 by johtaylo@johtaylo-JTBUILDER03-increment on 2014/07/30 03:00:11
EPR #010002 - Change OpenCL version number from 1599 to 1600.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1346 edit
2014-07-30 03:15:05 -04:00
foreman 805d1e4465 P4 to Git Change 1060466 by skudchad@skudchad_test_win_opencl2 on 2014/07/29 13:45:27
ECR #304775 - Use accelerated copy path for read/writeRect if the host memory has offsets. This avoids re-pinning the memory giving nearly a 100% perf boost for such copies.

	ReviewBoardURL = http://ocltc.amd.com/reviews/r/5371/diff/

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuvirtual.cpp#328 edit


[ROCm/clr commit: 0758f1e95b]
2014-07-29 13:52:27 -04:00
foreman 0758f1e95b P4 to Git Change 1060466 by skudchad@skudchad_test_win_opencl2 on 2014/07/29 13:45:27
ECR #304775 - Use accelerated copy path for read/writeRect if the host memory has offsets. This avoids re-pinning the memory giving nearly a 100% perf boost for such copies.

	ReviewBoardURL = http://ocltc.amd.com/reviews/r/5371/diff/

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuvirtual.cpp#328 edit
2014-07-29 13:52:27 -04:00
foreman 8fc6a2fa8c P4 to Git Change 1060225 by johtaylo@johtaylo-JTBUILDER03-increment on 2014/07/29 03:00:12
EPR #010002 - Change OpenCL version number from 1598 to 1599.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1345 edit


[ROCm/clr commit: d42ad806ad]
2014-07-29 03:11:45 -04:00
foreman d42ad806ad P4 to Git Change 1060225 by johtaylo@johtaylo-JTBUILDER03-increment on 2014/07/29 03:00:12
EPR #010002 - Change OpenCL version number from 1598 to 1599.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1345 edit
2014-07-29 03:11:45 -04:00
foreman ecb5c50de5 P4 to Git Change 1059594 by johtaylo@johtaylo-JTBUILDER03-increment on 2014/07/26 03:00:12
EPR #010002 - Change OpenCL version number from 1597 to 1598.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1344 edit


[ROCm/clr commit: 0ac5d305af]
2014-07-26 03:09:24 -04:00
foreman 0ac5d305af P4 to Git Change 1059594 by johtaylo@johtaylo-JTBUILDER03-increment on 2014/07/26 03:00:12
EPR #010002 - Change OpenCL version number from 1597 to 1598.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1344 edit
2014-07-26 03:09:24 -04:00
foreman f8dc564915 P4 to Git Change 1059564 by gandryey@gera-dev-w7 on 2014/07/25 18:14:33
ECR #304775 - Device enqueuing
	- Run the scheduler in the host queue by default.
	- GPU_USE_DEVICE_QUEUE=1 can force execution in the device queue

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpudevice.cpp#451 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpudevice.hpp#128 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpukernel.cpp#260 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpusched.hpp#10 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuschedcl.cpp#23 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpusettings.cpp#273 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpusettings.hpp#86 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuvirtual.cpp#327 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuvirtual.hpp#119 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gslbe/src/rt/GSLContext.cpp#62 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gslbe/src/rt/GSLContext.h#36 edit
... //depot/stg/opencl/drivers/opencl/runtime/utils/flags.hpp#208 edit


[ROCm/clr commit: 3a4400135d]
2014-07-25 20:41:05 -04:00
foreman 3a4400135d P4 to Git Change 1059564 by gandryey@gera-dev-w7 on 2014/07/25 18:14:33
ECR #304775 - Device enqueuing
	- Run the scheduler in the host queue by default.
	- GPU_USE_DEVICE_QUEUE=1 can force execution in the device queue

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpudevice.cpp#451 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpudevice.hpp#128 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpukernel.cpp#260 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpusched.hpp#10 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuschedcl.cpp#23 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpusettings.cpp#273 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpusettings.hpp#86 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuvirtual.cpp#327 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuvirtual.hpp#119 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gslbe/src/rt/GSLContext.cpp#62 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gslbe/src/rt/GSLContext.h#36 edit
... //depot/stg/opencl/drivers/opencl/runtime/utils/flags.hpp#208 edit
2014-07-25 20:41:05 -04:00
foreman 9aca4fedc5 P4 to Git Change 1059226 by johtaylo@johtaylo-JTBUILDER03-increment on 2014/07/25 03:00:11
EPR #010002 - Change OpenCL version number from 1596 to 1597.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1343 edit


[ROCm/clr commit: d12b4d2364]
2014-07-25 03:17:59 -04:00
foreman d12b4d2364 P4 to Git Change 1059226 by johtaylo@johtaylo-JTBUILDER03-increment on 2014/07/25 03:00:11
EPR #010002 - Change OpenCL version number from 1596 to 1597.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1343 edit
2014-07-25 03:17:59 -04:00
foreman 1ff4bbda6a P4 to Git Change 1058915 by rili@rili_opencl_stg on 2014/07/24 12:24:49
EPR #399808 - Fix the value of HSA image channel order for CL_RGB

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuresource.cpp#188 edit


[ROCm/clr commit: 5f93384dbc]
2014-07-24 12:45:47 -04:00
foreman 5f93384dbc P4 to Git Change 1058915 by rili@rili_opencl_stg on 2014/07/24 12:24:49
EPR #399808 - Fix the value of HSA image channel order for CL_RGB

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuresource.cpp#188 edit
2014-07-24 12:45:47 -04:00
foreman 6e04cbf3c2 P4 to Git Change 1058675 by johtaylo@johtaylo-JTBUILDER03-increment on 2014/07/24 03:00:13
EPR #010002 - Change OpenCL version number from 1595 to 1596.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1342 edit


[ROCm/clr commit: 8431455a87]
2014-07-24 03:12:45 -04:00
foreman 8431455a87 P4 to Git Change 1058675 by johtaylo@johtaylo-JTBUILDER03-increment on 2014/07/24 03:00:13
EPR #010002 - Change OpenCL version number from 1595 to 1596.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1342 edit
2014-07-24 03:12:45 -04:00
foreman aa79c6851f P4 to Git Change 1058650 by shivara@shivaram_llvmmerge_dt on 2014/07/24 01:26:41
ECR #397495 - Adding macros which are missing in clang path.

Affected files ...

... //depot/stg/opencl/drivers/opencl/compiler/lib/backends/common/frontend_clang.cpp#16 edit


[ROCm/clr commit: cde1570514]
2014-07-24 01:39:00 -04:00
foreman cde1570514 P4 to Git Change 1058650 by shivara@shivaram_llvmmerge_dt on 2014/07/24 01:26:41
ECR #397495 - Adding macros which are missing in clang path.

Affected files ...

... //depot/stg/opencl/drivers/opencl/compiler/lib/backends/common/frontend_clang.cpp#16 edit
2014-07-24 01:39:00 -04:00
foreman 4d4cb007af P4 to Git Change 1058429 by gandryey@gera-dev-w7 on 2014/07/23 14:41:31
ECR #304775 - Device enqueuing
	- Add 1 to count the queue for the blit engine

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuvirtual.cpp#326 edit


[ROCm/clr commit: 431b0d1047]
2014-07-23 15:23:50 -04:00
foreman 431b0d1047 P4 to Git Change 1058429 by gandryey@gera-dev-w7 on 2014/07/23 14:41:31
ECR #304775 - Device enqueuing
	- Add 1 to count the queue for the blit engine

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuvirtual.cpp#326 edit
2014-07-23 15:23:50 -04:00
foreman 649664914e P4 to Git Change 1058428 by marsenau@marsenau_cl on 2014/07/23 14:39:25
ECR #304775 - Bug 10112 - Raise default unroll threshold. The current default is 100, which is even lower than the LLVM default of 150. Increasing to 200 is a modest increase, and this should probably be even higher.

Affected files ...

... //depot/stg/opencl/drivers/opencl/compiler/lib/utils/OPTIONS.def#111 edit


[ROCm/clr commit: 2c5424663c]
2014-07-23 15:19:00 -04:00
foreman 2c5424663c P4 to Git Change 1058428 by marsenau@marsenau_cl on 2014/07/23 14:39:25
ECR #304775 - Bug 10112 - Raise default unroll threshold. The current default is 100, which is even lower than the LLVM default of 150. Increasing to 200 is a modest increase, and this should probably be even higher.

Affected files ...

... //depot/stg/opencl/drivers/opencl/compiler/lib/utils/OPTIONS.def#111 edit
2014-07-23 15:19:00 -04:00
foreman 71a25ff296 P4 to Git Change 1058395 by emankov@em-hsa-amd on 2014/07/23 13:32:42
ECR #333753 - Compiler Lib: Resubmitting double checked changes from CL 1056755 & 1056683

Affected files ...

... //depot/stg/opencl/drivers/opencl/compiler/lib/backends/common/v0_8/if_acl.cpp#43 edit


[ROCm/clr commit: 3eca90a7db]
2014-07-23 13:54:37 -04:00
foreman 3eca90a7db P4 to Git Change 1058395 by emankov@em-hsa-amd on 2014/07/23 13:32:42
ECR #333753 - Compiler Lib: Resubmitting double checked changes from CL 1056755 & 1056683

Affected files ...

... //depot/stg/opencl/drivers/opencl/compiler/lib/backends/common/v0_8/if_acl.cpp#43 edit
2014-07-23 13:54:37 -04:00
foreman 77795f8c89 P4 to Git Change 1058368 by gandryey@gera-dev-w7 on 2014/07/23 12:46:38
ECR #304775 - Attempt to fix BUG#10085
	- Align the scratch buffer size to 16 bytes.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuscsi.cpp#24 edit


[ROCm/clr commit: cec48dd7b8]
2014-07-23 12:54:37 -04:00
foreman cec48dd7b8 P4 to Git Change 1058368 by gandryey@gera-dev-w7 on 2014/07/23 12:46:38
ECR #304775 - Attempt to fix BUG#10085
	- Align the scratch buffer size to 16 bytes.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuscsi.cpp#24 edit
2014-07-23 12:54:37 -04:00
foreman 5b53524966 P4 to Git Change 1058342 by jatang@jatang-ocl-lnx3 on 2014/07/23 11:50:50
EPR #402935 - Reset Resource::pinOffset_ if gslResource couldn't be created for pinned memory.

	When the pinned memory to be created is too large, gslResource couldn't be created, and a local memory will be created instead. If pinOffset_ is NOT reset in this case, it will mess up future copying of the local memory.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuresource.cpp#187 edit


[ROCm/clr commit: 1e0a5f64f5]
2014-07-23 12:19:19 -04:00
foreman 1e0a5f64f5 P4 to Git Change 1058342 by jatang@jatang-ocl-lnx3 on 2014/07/23 11:50:50
EPR #402935 - Reset Resource::pinOffset_ if gslResource couldn't be created for pinned memory.

	When the pinned memory to be created is too large, gslResource couldn't be created, and a local memory will be created instead. If pinOffset_ is NOT reset in this case, it will mess up future copying of the local memory.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuresource.cpp#187 edit
2014-07-23 12:19:19 -04:00
foreman 868a67346b P4 to Git Change 1058301 by weizhang@msdnweizhang-opencl on 2014/07/23 10:08:38
EPR #397491 - Enable platform atomics in stg but not in mainline.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpudevice.cpp#450 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpusettings.cpp#272 edit
... //depot/stg/opencl/drivers/opencl/runtime/utils/flags.hpp#207 edit


[ROCm/clr commit: 5b7c1dd22c]
2014-07-23 10:21:39 -04:00
foreman 5b7c1dd22c P4 to Git Change 1058301 by weizhang@msdnweizhang-opencl on 2014/07/23 10:08:38
EPR #397491 - Enable platform atomics in stg but not in mainline.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpudevice.cpp#450 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpusettings.cpp#272 edit
... //depot/stg/opencl/drivers/opencl/runtime/utils/flags.hpp#207 edit
2014-07-23 10:21:39 -04:00
foreman 351ae12237 P4 to Git Change 1058289 by emankov@em-hsa-amd on 2014/07/23 08:19:09
ECR #333753 - Compiler Lib: back out CL 1056755 & 1056683

Affected files ...

... //depot/stg/opencl/drivers/opencl/compiler/lib/backends/common/v0_8/if_acl.cpp#42 edit


[ROCm/clr commit: f57b7f8c29]
2014-07-23 08:27:56 -04:00
foreman f57b7f8c29 P4 to Git Change 1058289 by emankov@em-hsa-amd on 2014/07/23 08:19:09
ECR #333753 - Compiler Lib: back out CL 1056755 & 1056683

Affected files ...

... //depot/stg/opencl/drivers/opencl/compiler/lib/backends/common/v0_8/if_acl.cpp#42 edit
2014-07-23 08:27:56 -04:00
foreman 0598d0de9b P4 to Git Change 1058200 by johtaylo@johtaylo-JTBUILDER03-increment on 2014/07/23 03:00:12
EPR #010002 - Change OpenCL version number from 1594 to 1595.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1341 edit


[ROCm/clr commit: a7c60aeaed]
2014-07-23 03:18:34 -04:00
foreman a7c60aeaed P4 to Git Change 1058200 by johtaylo@johtaylo-JTBUILDER03-increment on 2014/07/23 03:00:12
EPR #010002 - Change OpenCL version number from 1594 to 1595.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1341 edit
2014-07-23 03:18:34 -04:00
foreman 7250d75fcd P4 to Git Change 1058007 by rili@rili_opencl_stg_01 on 2014/07/22 17:28:41
EPR #399808 - Fixed wrong conversion of sRGBA when using host copy instead of blit kernel transfer

Affected files ...

... //depot/stg/opencl/drivers/opencl/api/opencl/amdocl/cl_memobj.cpp#68 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/blit.cpp#3 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/blit.hpp#2 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpublit.cpp#107 edit


[ROCm/clr commit: 0160f6f1ed]
2014-07-22 17:42:44 -04:00
foreman 0160f6f1ed P4 to Git Change 1058007 by rili@rili_opencl_stg_01 on 2014/07/22 17:28:41
EPR #399808 - Fixed wrong conversion of sRGBA when using host copy instead of blit kernel transfer

Affected files ...

... //depot/stg/opencl/drivers/opencl/api/opencl/amdocl/cl_memobj.cpp#68 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/blit.cpp#3 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/blit.hpp#2 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpublit.cpp#107 edit
2014-07-22 17:42:44 -04:00
foreman 7cc9d4601a P4 to Git Change 1058007 by rili@rili_opencl_stg_01 on 2014/07/22 17:28:41
EPR #399808 - Fixed wrong conversion of sRGBA when using host copy instead of blit kernel transfer

Affected files ...

... //depot/stg/opencl/drivers/opencl/api/opencl/amdocl/cl_memobj.cpp#68 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/blit.cpp#3 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/blit.hpp#2 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpublit.cpp#107 edit


[ROCm/clr commit: 1681dd142f]
2014-07-22 17:42:44 -04:00
foreman 1681dd142f P4 to Git Change 1058007 by rili@rili_opencl_stg_01 on 2014/07/22 17:28:41
EPR #399808 - Fixed wrong conversion of sRGBA when using host copy instead of blit kernel transfer

Affected files ...

... //depot/stg/opencl/drivers/opencl/api/opencl/amdocl/cl_memobj.cpp#68 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/blit.cpp#3 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/blit.hpp#2 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpublit.cpp#107 edit
2014-07-22 17:42:44 -04:00
foreman f998e7094b P4 to Git Change 1057998 by gandryey@gera-dev-w7 on 2014/07/22 17:15:58
ECR #304775 - Device enqueuing
	- Use atomic fetch for enqueue flags
	- Switch to a multithreaded scheduler
	- Add a workaround for Linux host_multi_queue failures. Linux has only 2 queues, but the test allocates multiple host queues and the same HW ring can be used

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpublit.cpp#106 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpudevice.cpp#449 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpudevice.hpp#127 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuschedcl.cpp#22 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuvirtual.cpp#325 edit


[ROCm/clr commit: d2b905f18e]
2014-07-22 17:30:56 -04:00
foreman d2b905f18e P4 to Git Change 1057998 by gandryey@gera-dev-w7 on 2014/07/22 17:15:58
ECR #304775 - Device enqueuing
	- Use atomic fetch for enqueue flags
	- Switch to a multithreaded scheduler
	- Add a workaround for Linux host_multi_queue failures. Linux has only 2 queues, but the test allocates multiple host queues and the same HW ring can be used

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpublit.cpp#106 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpudevice.cpp#449 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpudevice.hpp#127 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuschedcl.cpp#22 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuvirtual.cpp#325 edit
2014-07-22 17:30:56 -04:00
foreman c6c4492380 P4 to Git Change 1057669 by johtaylo@johtaylo-JTBUILDER03-increment on 2014/07/22 03:00:12
EPR #010002 - Change OpenCL version number from 1593 to 1594.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1340 edit


[ROCm/clr commit: 1e8c506c75]
2014-07-22 03:10:32 -04:00
foreman 1e8c506c75 P4 to Git Change 1057669 by johtaylo@johtaylo-JTBUILDER03-increment on 2014/07/22 03:00:12
EPR #010002 - Change OpenCL version number from 1593 to 1594.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1340 edit
2014-07-22 03:10:32 -04:00
foreman 1d845a69b2 P4 to Git Change 1057571 by gandryey@gera-dev-w7 on 2014/07/21 18:35:02
ECR #304775 - Device enqueuing
	- Add L2 cache flush after the scheduler execution. Although CP has to work with L2 cache, it seems some functionality relies on direct memory access and without explicit L2 flush CP can pick old values in the template.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuvirtual.cpp#324 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gslbe/src/rt/GSLContext.cpp#61 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gslbe/src/rt/GSLContext.h#35 edit


[ROCm/clr commit: 4599bd0d4a]
2014-07-21 18:43:54 -04:00
foreman 4599bd0d4a P4 to Git Change 1057571 by gandryey@gera-dev-w7 on 2014/07/21 18:35:02
ECR #304775 - Device enqueuing
	- Add L2 cache flush after the scheduler execution. Although CP has to work with L2 cache, it seems some functionality relies on direct memory access and without explicit L2 flush CP can pick old values in the template.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuvirtual.cpp#324 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gslbe/src/rt/GSLContext.cpp#61 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gslbe/src/rt/GSLContext.h#35 edit
2014-07-21 18:43:54 -04:00
foreman 944dca18e8 P4 to Git Change 1057460 by gandryey@gera-dev-w7 on 2014/07/21 14:56:59
ECR #304775 - Device enqueuing
	- Report proper size for the device queue.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpukernel.cpp#259 edit


[ROCm/clr commit: e87e2d4c11]
2014-07-21 15:06:08 -04:00