Граф коммитов

17 Коммитов

Автор SHA1 Сообщение Дата
foreman e25e8f4561 P4 to Git Change 1089024 by ssahasra@ssahasra_opencl_windows on 2014/10/20 00:05:43
ECR #333753 - clc2: disallow implicit function declarations (bug 10328)

	In addition, the change fixes the following side-effects:
	1. Fix a typo in runtime/.../gpuschedcl.cpp, which fails due to the stricter check in Clang.
	2. Unconditionally add sub_group builtins for pipes, without checking if the extension is enabled. See bug 10366.
	3. Also added a test in ocl_features_clang to check for the sub_group builtins.

	Passes smoke, smoke_clang, precheckin.
	Additionally passes new tests added in ocl_features.

	Reviewed by Brian Sumner.

Affected files ...

... //depot/stg/opencl/drivers/opencl/compiler/clang/lib/Sema/SemaLookup.cpp#9 edit
... //depot/stg/opencl/drivers/opencl/compiler/clc2/wrapper/ClangWrapper.cpp#12 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuschedcl.cpp#33 edit
2014-10-20 00:17:58 -04:00
foreman af02d04050 P4 to Git Change 1087396 by bsumner@bsumner-lin-opencl on 2014/10/14 15:21:38
ECR #304775 - move majority of rutime kernels into library

Affected files ...

... //depot/stg/opencl/drivers/opencl/library/common.hsa/src/blitKernels.cl#3 add
... //depot/stg/opencl/drivers/opencl/library/common/src/blitKernels.cl#3 add
... //depot/stg/opencl/drivers/opencl/library/hsa/hsail/src/devenq/devenq.h#6 edit
... //depot/stg/opencl/drivers/opencl/library/hsa/hsail/src/devenq/schedule.cl#3 add
... //depot/stg/opencl/drivers/opencl/library/x86/gen/build/Makefile.gen#19 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/blitcl.cpp#8 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuschedcl.cpp#32 edit
2014-10-14 15:57:52 -04:00
foreman b737b1b1ab P4 to Git Change 1083488 by bsumner@bsumner-lin-opencl on 2014/10/02 09:01:39
ECR #304775 - back out 1083437

Affected files ...

... //depot/stg/opencl/drivers/opencl/library/common.hsa/src/blitKernels.cl#2 delete
... //depot/stg/opencl/drivers/opencl/library/common/src/blitKernels.cl#2 delete
... //depot/stg/opencl/drivers/opencl/library/hsa/hsail/src/devenq/devenq.h#5 edit
... //depot/stg/opencl/drivers/opencl/library/hsa/hsail/src/devenq/schedule.cl#2 delete
... //depot/stg/opencl/drivers/opencl/library/x86/gen/build/Makefile.gen#18 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/blitcl.cpp#6 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuschedcl.cpp#31 edit
2014-10-02 12:55:08 -04:00
foreman 72fd13e445 P4 to Git Change 1083437 by bsumner@bsumner-lin-opencl on 2014/10/01 20:06:47
ECR #304775 - move runtime kernels mostly into library

Affected files ...

... //depot/stg/opencl/drivers/opencl/library/common.hsa/src/blitKernels.cl#1 add
... //depot/stg/opencl/drivers/opencl/library/common/src/blitKernels.cl#1 add
... //depot/stg/opencl/drivers/opencl/library/hsa/hsail/src/devenq/devenq.h#4 edit
... //depot/stg/opencl/drivers/opencl/library/hsa/hsail/src/devenq/schedule.cl#1 add
... //depot/stg/opencl/drivers/opencl/library/x86/gen/build/Makefile.gen#17 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/blitcl.cpp#5 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuschedcl.cpp#30 edit
2014-10-01 20:12:43 -04:00
foreman 9c6c074122 P4 to Git Change 1075041 by gandryey@gera-dev-w7 on 2014/09/09 18:36:42
ECR #304775 - Device enqueue
	- Update the template header

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuschedcl.cpp#29 edit
2014-09-09 19:12:37 -04:00
foreman 2738b30287 P4 to Git Change 1072528 by gandryey@gera-dev-w7 on 2014/09/02 17:00:16
ECR #304775 - Add batching to the device enqueue for possible asynchronous execution
	- Increase the max device queue size to 512KB. That will allow to pass conformance tests that enqueue more jobs than the queue size.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpudevice.cpp#459 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpusched.hpp#13 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuschedcl.cpp#28 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuvirtual.cpp#333 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gslbe/src/rt/GSLContext.cpp#65 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gslbe/src/rt/GSLContext.h#39 edit
2014-09-02 17:15:39 -04:00
foreman 87e6a3fa02 P4 to Git Change 1071782 by gandryey@gera-dev-w7 on 2014/08/29 16:43:04
ECR #304775 - Combine BUSY and DONE processing in the scheduler

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuschedcl.cpp#27 edit
2014-08-29 16:48:31 -04:00
foreman 96a59a73b8 P4 to Git Change 1071369 by gandryey@gera-dev-w7 on 2014/08/28 18:09:49
ECR #304775 - Add a pointer to the profiling_capture_event_info values.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpusched.hpp#12 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuschedcl.cpp#26 edit
2014-08-28 18:45:10 -04:00
foreman a27a480c5f P4 to Git Change 1068415 by gandryey@gera-dev-w7 on 2014/08/20 10:41:55
ECR #304775 - Device enqueue
	- Fix a copy/paste typo with the "failure" argument

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuschedcl.cpp#25 edit
2014-08-20 11:00:32 -04:00
foreman 8fa184db38 P4 to Git Change 1065597 by gandryey@gera-dev-w7 on 2014/08/12 18:38:45
ECR #304775 - Device enqueuing
	- Provide scratch buffer offset for generic address space
	- Use single scratch buffer for all available queues. Each queue will have a unique subbuffer in the global buffer

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpudevice.cpp#454 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpudevice.hpp#129 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpusched.hpp#11 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuschedcl.cpp#24 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuvirtual.cpp#329 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuvirtual.hpp#120 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gslbe/src/rt/GSLContext.cpp#63 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gslbe/src/rt/GSLContext.h#37 edit
2014-08-12 18:49:08 -04:00
foreman 3a4400135d P4 to Git Change 1059564 by gandryey@gera-dev-w7 on 2014/07/25 18:14:33
ECR #304775 - Device enqueuing
	- Run the scheduler in the host queue by default.
	- GPU_USE_DEVICE_QUEUE=1 can force execution in the device queue

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpudevice.cpp#451 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpudevice.hpp#128 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpukernel.cpp#260 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpusched.hpp#10 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuschedcl.cpp#23 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpusettings.cpp#273 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpusettings.hpp#86 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuvirtual.cpp#327 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuvirtual.hpp#119 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gslbe/src/rt/GSLContext.cpp#62 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gslbe/src/rt/GSLContext.h#36 edit
... //depot/stg/opencl/drivers/opencl/runtime/utils/flags.hpp#208 edit
2014-07-25 20:41:05 -04:00
foreman d2b905f18e P4 to Git Change 1057998 by gandryey@gera-dev-w7 on 2014/07/22 17:15:58
ECR #304775 - Device enqueuing
	- Use atomic fetch for enqueue flags
	- Switch to a multithreaded scheduler
	- Add a workaround for Linux host_multi_queue failures. Linux has only 2 queues, but the test allocates multiple host queues and the same HW ring can be used

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpublit.cpp#106 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpudevice.cpp#449 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpudevice.hpp#127 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuschedcl.cpp#22 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuvirtual.cpp#325 edit
2014-07-22 17:30:56 -04:00
foreman 6627cf5cf7 P4 to Git Change 1056280 by gandryey@gera-dev-w7 on 2014/07/17 11:55:30
ECR #304775 - Device enqueuing
	- Add extra sync point for the scheduler exit.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpusched.hpp#9 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuschedcl.cpp#21 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuvirtual.cpp#323 edit
2014-07-17 12:20:49 -04:00
foreman 6314b334ba P4 to Git Change 1055054 by gandryey@gera-dev-w7 on 2014/07/14 20:18:53
ECR #304775 - Device enqueuing
	- Switch to the single thread scheduler for now(the current version isn't friendly for single thread). Hopefully it's a temporary solution until synchronization issue with multithreaded scheduler will be identified.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpublit.cpp#104 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuschedcl.cpp#20 edit
2014-07-14 20:24:58 -04:00
foreman 977f314094 P4 to Git Change 1053268 by gandryey@gera-dev-w7 on 2014/07/08 15:08:51
ECR #304775 - Device enqueuing
	- Add printing of the waiting events
	- Add early exit in the scheduler if nothing to launch

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuschedcl.cpp#19 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuvirtual.cpp#321 edit
2014-07-08 15:30:04 -04:00
foreman cd3fefb00d P4 to Git Change 1052832 by gandryey@gera-dev-w7 on 2014/07/07 18:44:29
ECR #304775 - Device enqueuing
	- Update the scheduler to handle event mask

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuschedcl.cpp#18 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuvirtual.cpp#320 edit
2014-07-07 18:58:52 -04:00
foreman 3694ab2ce8 initial commit 2014-07-04 16:17:05 -04:00