Commit Graph

261 Commitit

Tekijä SHA1 Viesti Päivämäärä
foreman 5b3ecedc9b P4 to Git Change 1171714 by marsenau@marsenau_home on 2015/07/17 19:29:59
ECR #304775 - First batch of build fixes for clang.

	Fixes hard source errors and a handful of simple warnings, but leaves most other warnings for later. Other errors not fixed here are from adding compile flags that are not understood.

Affected files ...

... //depot/stg/opencl/drivers/opencl/compiler/clc/src/e2lCommon.h#11 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/backends/common/opt_level.hpp#4 edit
... //depot/stg/opencl/drivers/opencl/compiler/llvm/lib/Target/HSAIL/BRIGAsmPrinter.cpp#117 edit
... //depot/stg/opencl/drivers/opencl/opencldefs#162 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpukernel.cpp#289 edit
2015-07-17 19:50:05 -04:00
foreman 0767cbd2e6 P4 to Git Change 1171462 by johtaylo@johtaylo-JTBUILDER03-increment on 2015/07/17 03:00:12
EPR #010002 - Change OpenCL version number from 1849 to 1850.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1596 edit
2015-07-17 03:43:16 -04:00
foreman fc727295b7 P4 to Git Change 1171095 by johtaylo@johtaylo-JTBUILDER03-increment on 2015/07/16 03:00:48
EPR #010002 - Change OpenCL version number from 1848 to 1849.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1595 edit
2015-07-17 01:24:04 -04:00
foreman e3e54cabd9 P4 to Git Change 1171338 by gandryey@gera-dev-w7 on 2015/07/16 17:12:56
ECR #304775 - Remove the obsolete code

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpudevice.cpp#513 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gslbe/src/include/cal/calcl.h#32 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gslbe/src/rt/GSLContext.cpp#74 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gslbe/src/rt/GSLContext.h#46 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gslbe/src/rt/GSLDevice.cpp#125 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gslbe/src/rt/GSLDevice.h#49 edit
2015-07-16 21:38:11 -04:00
foreman 7ab78d6db2 P4 to Git Change 1170805 by nhaustov@nhaustov_hsa on 2015/07/15 10:59:34
ECR #333756 - Fix missing add of offset in ORCAHSALoaderContext::SegmentAddress.

	Testing: pre-checkin

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuprogram.cpp#195 edit
2015-07-16 08:30:58 -04:00
foreman 8cc3f47661 P4 to Git Change 1170297 by nhaustov@nhaustov_hsa on 2015/07/14 05:36:10
ECR #333753 - ORCA RT/Compiler Lib: HSA Code Object/RT independent loader introducing/integration into OpenCL.

	Changes by Evgeniy Mankov.

	       Purpose:
	       Use the same Finalizer & loader for both HSA & ORCA RT.
	       AMDIL path is not affected.

	       Changes:
	       1. The whole BRIG is finalized now instead of per kernel finalization (both in gpuprogram & hsail_be).
	       2. HSALoader is changed in order to work with CodeObject and new HSA Loader's API <96> Context. Now it is in ORCA<92>s gpuprogram instead of Compiler Lib.
	       3. brig_loader.cpp is removed from compiler lib, as well as __aclHSALoader function exports from the whole stack.
	       4. BIF .text section now contains the whole finalized HSA CodeObject instead of separate symbols for finalized kernels.
	       5. ORCA RT now works directly with amd_kernel_code_t and doesn't need any SC metadata anymore.
	       6. aoc2 is supplemented with fake offline loader correspondingly.
	       7. amdocl/complib make sytem changes.
	       8. test_driver.pl update.

	       ToDo:
	       1. Implement disassemble() & BuildLog() functions to support ISA dumping & SC error handling (Konstantin).
	       2. Global variables initialization by pragma reference (Konstantin). Test to verify: test_basic progvar_prog_scope_init.
	       3. Code Object without kernels support (Nikolay - ready). Test to verify: test_generic_address_space.exe library_function

	       testing: windows smoke, pre check-in, ocl conformance 2.0, ocl SDK 2.9

	       Reviewers: Nikolay Haustov, German Andryeyev

Affected files ...

... //depot/stg/opencl/drivers/opencl/api/opencl/amdocl/amdocl.def.in#13 edit
... //depot/stg/opencl/drivers/opencl/api/opencl/amdocl/amdocl.map.in#15 edit
... //depot/stg/opencl/drivers/opencl/api/opencl/amdocl/build/Makefile.api#116 edit
... //depot/stg/opencl/drivers/opencl/compiler/legacy-lib/amdoclcl.def.in#2 edit
... //depot/stg/opencl/drivers/opencl/compiler/legacy-lib/amdoclcl.map.in#2 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/amdoclcl.def.in#12 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/amdoclcl.map.in#11 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/backends/common/v0_8/if_acl.cpp#70 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/backends/gpu/build/Makefile.gpu#32 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/backends/gpu/hsail_be.cpp#44 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/build/Makefile.complib#85 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/utils/v0_8/libUtils.cpp#9 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/utils/v0_8/libUtils.h#18 edit
... //depot/stg/opencl/drivers/opencl/compiler/tools/aoc2/aoc2.cpp#70 edit
... //depot/stg/opencl/drivers/opencl/compiler/tools/aoc2/build/Makefile.aoc2#24 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/device.hpp#248 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpudefs.hpp#121 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpukernel.cpp#288 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpukernel.hpp#112 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuprogram.cpp#194 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuprogram.hpp#59 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuscsi.cpp#33 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuvirtual.cpp#368 edit
... //depot/stg/opencl/drivers/opencl/tests/hsa/bin/test_driver.pl#12 edit
2015-07-14 17:08:54 -04:00
foreman 4285620de2 P4 to Git Change 1169722 by rili@rili_opencl_stg on 2015/07/10 17:39:21
EPR #419313 - Disable large allocaiton for Win 10.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpusettings.cpp#313 edit
2015-07-11 13:58:17 -04:00
foreman 4fd51c5805 P4 to Git Change 1168792 by rili@rili_opencl_stg on 2015/07/08 17:02:22
EPR #419313 - Fix the wrong check for string.find

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpukernel.cpp#287 edit
2015-07-08 17:07:41 -04:00
foreman 75b2321608 P4 to Git Change 1168292 by gandryey@gera-w8 on 2015/07/07 14:29:14
ECR #304775 - Fix a crash in memorybandwidth test
	-  Remove a pinned mem object from the list only if we need a free slot.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuvirtual.cpp#367 edit
2015-07-07 14:34:52 -04:00
foreman 8ba12a7808 P4 to Git Change 1167979 by kzhuravl@linu-kzhuravl-oclhsa on 2015/07/06 16:55:44
EPR #333756 - AMD HSA Header Files Changes (cross branch checkin)

	- Switched AMD HSA Header Files from bitfields to bitmasks (amd_kernel_code_t and amd_queue_t)
	- Fixed minor typos/name inconsistencies
	- Made required changes in SC/Dev/SCDevUtil + device enqueue and runtime changes

	Testing: sc farm, precheckin (http://ocltc.amd.com:8111/viewModification.html?modId=54240&personal=true&tab=vcsModificationBuilds)
	ReviewBoardURL: http://ocltc.amd.com/reviews/r/7861/, http://ocltc.amd.com/reviews/r/7919/

Affected files ...

... //depot/stg/opencl/drivers/opencl/compiler/lib/backends/gpu/scwrapper/SI/scCompileSI.cpp#56 edit
... //depot/stg/opencl/drivers/opencl/compiler/sc/HSAIL/ext/loader/executable.cpp#2 integrate
... //depot/stg/opencl/drivers/opencl/compiler/sc/HSAIL/include/amd_hsa_common.h#5 integrate
... //depot/stg/opencl/drivers/opencl/compiler/sc/HSAIL/include/amd_hsa_kernel_code.h#5 integrate
... //depot/stg/opencl/drivers/opencl/compiler/sc/HSAIL/include/amd_hsa_queue.h#3 integrate
... //depot/stg/opencl/drivers/opencl/compiler/sc/HSAIL/include/amd_hsa_signal.h#3 integrate
... //depot/stg/opencl/drivers/opencl/compiler/sc/Src/Dev/Dev2010.vcxproj.filters#7 integrate
... //depot/stg/opencl/drivers/opencl/compiler/sc/Src/Dev/Tools/R1000Memory.cxx#6 integrate
... //depot/stg/opencl/drivers/opencl/compiler/sc/Src/NSrc/SCShaderInfo.cpp#6 integrate
... //depot/stg/opencl/drivers/opencl/compiler/sc/Src/NSrc/SCShaderInfo.hpp#6 integrate
... //depot/stg/opencl/drivers/opencl/compiler/sc/Src/NSrc/SiHwShaders.cpp#6 integrate
... //depot/stg/opencl/drivers/opencl/library/hsa/hsail/src/devenq/devenq.h#10 edit
... //depot/stg/opencl/drivers/opencl/library/hsa/hsail/src/devenq/schedule.cl#8 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpukernel.cpp#286 edit
2015-07-06 17:14:23 -04:00
foreman 12491efc9b P4 to Git Change 1167421 by gandryey@gera-w8 on 2015/07/03 15:12:19
ECR #304775 - Mipmaps support.
	- Add a check for mipmap levels in the resource cache

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuresource.cpp#222 edit
2015-07-03 15:18:55 -04:00
foreman 83ea36fe33 P4 to Git Change 1164406 by rili@rili_opencl_stg on 2015/06/23 17:15:31
EPR #419313 - Limit >4GB allocation support only for workstation board

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpusettings.cpp#312 edit
2015-06-23 17:27:26 -04:00
foreman 6f88046a9a P4 to Git Change 1163938 by gandryey@gera-w8 on 2015/06/22 18:23:52
ECR #304775 - Mipmaps support. Fix falures after CL#1163104
	- don't allocate extra view if level/layer was created already

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gslbe/src/rt/GSLDevice.cpp#124 edit
2015-06-22 18:35:54 -04:00
foreman 76b0dcb140 P4 to Git Change 1163104 by gandryey@gera-w8 on 2015/06/18 18:55:01
ECR #304775 - Mipmaps support
	- Revert unnecessary changes in gpuresource.cpp.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuresource.cpp#221 edit
2015-06-18 19:01:25 -04:00
foreman 706ff60007 P4 to Git Change 1163090 by gandryey@gera-w8 on 2015/06/18 17:29:32
ECR #304775 - Mipmaps support
	- Fix the view creation for the host path transfers. GSL can ignore the original mipmap surface dimensions and apply the new settings

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuresource.cpp#220 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gslbe/src/rt/GSLDevice.cpp#123 edit
2015-06-18 17:39:39 -04:00
foreman d0b51ccdbf P4 to Git Change 1161664 by rili@rili_opencl_stg on 2015/06/15 18:01:53
EPR #421679 - remove all source code related to open video

Affected files ...

... //depot/stg/opencl/drivers/opencl/api/opencl/OpenVideo/Makefile#3 delete
... //depot/stg/opencl/drivers/opencl/api/opencl/OpenVideo/OVDecode.cpp#15 delete
... //depot/stg/opencl/drivers/opencl/api/opencl/OpenVideo/OVDecode.def#5 delete
... //depot/stg/opencl/drivers/opencl/api/opencl/OpenVideo/OVDecode.h#7 delete
... //depot/stg/opencl/drivers/opencl/api/opencl/OpenVideo/OVDecode.rc#2 delete
... //depot/stg/opencl/drivers/opencl/api/opencl/OpenVideo/OVDecodeTypes.h#6 delete
... //depot/stg/opencl/drivers/opencl/api/opencl/OpenVideo/OVEncode.cpp#37 delete
... //depot/stg/opencl/drivers/opencl/api/opencl/OpenVideo/OVEncode.h#11 delete
... //depot/stg/opencl/drivers/opencl/api/opencl/OpenVideo/OVEncodeTypes.h#16 delete
... //depot/stg/opencl/drivers/opencl/api/opencl/OpenVideo/OpenVideo.cpp#6 delete
... //depot/stg/opencl/drivers/opencl/api/opencl/OpenVideo/OpenVideo.def#3 delete
... //depot/stg/opencl/drivers/opencl/api/opencl/OpenVideo/OpenVideo.h#5 delete
... //depot/stg/opencl/drivers/opencl/api/opencl/OpenVideo/OpenVideo.rc#3 delete
... //depot/stg/opencl/drivers/opencl/api/opencl/OpenVideo/OpenVideo.sln#5 delete
... //depot/stg/opencl/drivers/opencl/api/opencl/OpenVideo/OpenVideo.vcxproj#11 delete
... //depot/stg/opencl/drivers/opencl/api/opencl/OpenVideo/OpenVideo.vcxproj.filters#3 delete
... //depot/stg/opencl/drivers/opencl/api/opencl/OpenVideo/OpenVideo_2008.sln#2 delete
... //depot/stg/opencl/drivers/opencl/api/opencl/OpenVideo/OpenVideo_2008.vcproj#2 delete
... //depot/stg/opencl/drivers/opencl/api/opencl/OpenVideo/atiumdva.cap#2 delete
... //depot/stg/opencl/drivers/opencl/api/opencl/OpenVideo/build.bat#2 delete
... //depot/stg/opencl/drivers/opencl/api/opencl/OpenVideo/build/Makefile#6 delete
... //depot/stg/opencl/drivers/opencl/api/opencl/OpenVideo/build/Makefile.ov#18 delete
... //depot/stg/opencl/drivers/opencl/api/opencl/OpenVideo/build/Makefile.ovd#27 delete
... //depot/stg/opencl/drivers/opencl/api/opencl/OpenVideo/buildOV/Makefile#5 delete
... //depot/stg/opencl/drivers/opencl/api/opencl/OpenVideo/res/OpenVideo.rc2#2 delete
... //depot/stg/opencl/drivers/opencl/api/opencl/amdocl/build/Makefile.api#113 edit
... //depot/stg/opencl/drivers/opencl/api/opencl/amdocl/cl_common.hpp#17 edit
... //depot/stg/opencl/drivers/opencl/api/opencl/amdocl/cl_context.cpp#43 edit
... //depot/stg/opencl/drivers/opencl/api/opencl/amdocl/cl_device.cpp#59 edit
... //depot/stg/opencl/drivers/opencl/api/opencl/amdocl/cl_open_video.cpp#35 delete
... //depot/stg/opencl/drivers/opencl/api/opencl/amdocl/cl_open_video_amd.h#20 delete
... //depot/stg/opencl/drivers/opencl/runtime/device/cpu/cpudevice.cpp#273 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/cpu/cpudevice.hpp#91 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/cpu/cpuvirtual.cpp#24 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/cpu/cpuvirtual.hpp#12 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/device.hpp#247 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpudevice.cpp#512 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpudevice.hpp#145 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpusettings.cpp#311 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpusettings.hpp#91 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuvirtual.cpp#366 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuvirtual.hpp#130 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gslbe/src/rt/GSLContext.cpp#73 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gslbe/src/rt/GSLContext.h#45 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/hsa/hsadevice.cpp#92 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/hsa/hsadevice.hpp#49 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/hsa/hsavirtual.hpp#28 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/hsa_foundation/hsadevice.cpp#34 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/hsa_foundation/hsadevice.hpp#15 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/hsa_foundation/hsavirtual.hpp#20 edit
... //depot/stg/opencl/drivers/opencl/runtime/platform/command.hpp#78 edit
... //depot/stg/opencl/drivers/opencl/runtime/platform/commandqueue.cpp#21 edit
... //depot/stg/opencl/drivers/opencl/runtime/platform/commandqueue.hpp#16 edit
... //depot/stg/opencl/drivers/opencl/runtime/platform/object.hpp#15 edit
... //depot/stg/opencl/drivers/opencl/runtime/platform/video_session.hpp#13 delete
... //depot/stg/opencl/drivers/opencl/runtime/top.hpp#22 edit
... //depot/stg/opencl/drivers/opencl/runtime/utils/flags.hpp#235 edit
2015-06-15 18:09:54 -04:00
foreman 9423b93fb2 P4 to Git Change 1161661 by skudchad@skudchad_test_win_opencl2 on 2015/06/15 17:50:19
EPR #420701 - WDDM 2.0
	- Indicate lite initialization via a new parameter initLite. This is needed as OCL\OGL can query devices first and lot of  setup in IOL may not be really needed. On Win10, we create paging\fences for sync which dont get destroyed by the OS unless process is killed. This may prevent slave GPU powerdown in PX cases.
	- Disable WriteFence for OpenCL for now.

	ReviewBoardURL = http://ocltc.amd.com/reviews/r/7765/diff/

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gslbe/src/rt/GSLContext.cpp#72 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gslbe/src/rt/GSLDevice.cpp#122 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gslbe/src/rt/GSLDevice.h#48 edit
2015-06-15 18:01:55 -04:00
foreman e919691665 P4 to Git Change 1160575 by gandryey@gera-dev-w7 on 2015/06/11 18:31:59
ECR #304775 - Real time queue support
	- Add the new interface for the real time queue creation

Affected files ...

... //depot/stg/opencl/drivers/opencl/api/opencl/amdocl/cl_command.cpp#10 edit
... //depot/stg/opencl/drivers/opencl/api/opencl/amdocl/cl_device.cpp#58 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/cpu/cpudevice.hpp#90 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/device.hpp#246 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpudevice.cpp#511 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpudevice.hpp#144 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/hsa/hsadevice.cpp#91 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/hsa/hsadevice.hpp#48 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/hsa_foundation/hsadevice.cpp#33 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/hsa_foundation/hsadevice.hpp#14 edit
... //depot/stg/opencl/drivers/opencl/runtime/platform/commandqueue.cpp#20 edit
... //depot/stg/opencl/drivers/opencl/runtime/platform/commandqueue.hpp#15 edit
2015-06-11 19:17:47 -04:00
foreman 7b35af8cd4 P4 to Git Change 1159996 by weizhang@weizhang-lnx-opencl-tmp on 2015/06/10 16:03:37
EPR #410989 - [Project Brahma] - CL-GL Support

	   At present, multi-gpu is not supported on Brahma stack. Thus, glxGetContextMVPUINFOAMD hasn't been implemented by OGL.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gslbe/src/rt/GSLDeviceGL.cpp#24 edit
2015-06-10 16:23:26 -04:00
foreman 7e30125446 P4 to Git Change 1159578 by bdhanase@bala_workpc_ocl on 2015/06/09 21:12:31
EPR #397491 - Disabling 32 bit generic address space temporarily because of  bug 10841.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gslbe/src/rt/GSLDevice.cpp#121 edit
2015-06-09 21:24:41 -04:00
foreman 7181d6bc62 P4 to Git Change 1159420 by bdhanase@bala_workpc_ocl on 2015/06/09 15:03:32
EPR #397491 - Back out changelist 1158497

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gslbe/src/rt/GSLDevice.cpp#120 edit
2015-06-09 15:25:21 -04:00
foreman ca7f6ab119 P4 to Git Change 1159349 by rili@rili_opencl_stg on 2015/06/09 12:52:32
EPR #419313 - Blit Kernel execution split
	                         Enabled flag GPU_ENABLE_LARGE_ALLOCATION
	                         Return error if global work size is larger than 32bits.

	Code Review#7664

Affected files ...

... //depot/stg/opencl/drivers/opencl/api/opencl/amdocl/cl_execute.cpp#22 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpukernel.cpp#285 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpukernel.hpp#111 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuvirtual.cpp#365 edit
... //depot/stg/opencl/drivers/opencl/runtime/utils/flags.hpp#234 edit
2015-06-09 12:58:27 -04:00
foreman 2a6b6b1c23 P4 to Git Change 1159286 by gandryey@gera-ubuntu14 on 2015/06/09 10:30:28
ECR #304775 - Mipmaps support
	- Fix clCopyImages failures under Linux. Unmap for READ was skipping the view destruction. Combine READ/WRITE map/unmap paths for mipmap textures

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/device.cpp#179 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuvirtual.cpp#364 edit
2015-06-09 10:41:49 -04:00
foreman e08daae657 P4 to Git Change 1158497 by bdhanase@bala_workpc_ocl on 2015/06/05 18:25:55
EPR #397491 - Back out changelist 1158454

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gslbe/src/rt/GSLDevice.cpp#119 edit
2015-06-05 18:32:33 -04:00
foreman 4be12926b4 P4 to Git Change 1158454 by bdhanase@bala_workpc_ocl on 2015/06/05 16:42:38
EPR #397491 - Back out changelist 1152826

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gslbe/src/rt/GSLDevice.cpp#118 edit
2015-06-05 16:57:15 -04:00
foreman 279bd0757d P4 to Git Change 1158411 by bdhanase@bala_workpc_ocl on 2015/06/05 15:03:35
EPR #397491 - Disable 32 bit OpenCL2.0 on Linux alone

	http://ocltc.amd.com/reviews/r/7683/

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpucompiler.cpp#151 edit
2015-06-05 15:09:54 -04:00
foreman a53e6a74a4 P4 to Git Change 1157927 by asalmanp@asalmanp-opencl-stg1 on 2015/06/04 13:19:47
EPR #420344 - Forum [180211]: enqueueNDRangeKernel crashes to execute device binary if it contains printf statements

	This is a temporary workaround to avoid app crash when a kernel has pritntf but the program object is built from a binary (i.e., the printf info is not propagated if the program object is built from a binary).

	ReviewBoardURL = http://ocltc.amd.com/reviews/r/7676/

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuprintf.cpp#36 edit
2015-06-04 13:30:57 -04:00
foreman 79b758537c P4 to Git Change 1156441 by yaxunl@yaxunl_stg_win50 on 2015/06/01 13:40:44
EPR #420584 - [CQE OCL][ISV][QR][SI] FAHBenchmark application is crashing on all SI cards.

	Wave limiter causes FAH crash on SI. Disable wave limiter for SI as a workaround.

	Opened bug #10817 to track this issue.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuwavelimiter.cpp#6 edit
2015-06-01 13:51:27 -04:00
foreman 097a9fdd56 P4 to Git Change 1156178 by kzhuravl@win-kzhuravl-stg-oclhsa on 2015/05/30 02:59:02
EPR #333756 - Integrate header file cleanup from stg hsa and do necessary updates (cross branch checkin)

	Reviewers: Nikolay, Evgeniy, Laurent
	Testing: p4precheckin(http://ocltc:8111/viewModification.html?modId=52194&personal=true&init=1&tab=vcsModificationBuilds)

Affected files ...

... //depot/stg/opencl/drivers/opencl/compiler/lib/backends/gpu/sc/HSAIL/include/amd_hsa_common.h#1 branch
... //depot/stg/opencl/drivers/opencl/compiler/lib/backends/gpu/sc/HSAIL/include/amd_hsa_elf.h#1 branch
... //depot/stg/opencl/drivers/opencl/compiler/lib/backends/gpu/sc/HSAIL/include/amd_hsa_kernel_code.h#1 branch
... //depot/stg/opencl/drivers/opencl/compiler/lib/backends/gpu/sc/HSAIL/include/amd_hsa_queue.h#1 branch
... //depot/stg/opencl/drivers/opencl/compiler/lib/backends/gpu/sc/HSAIL/include/amd_hsa_signal.h#1 branch
... //depot/stg/opencl/drivers/opencl/compiler/lib/backends/gpu/sc/HSAIL/include/hsa.h#1 branch
... //depot/stg/opencl/drivers/opencl/compiler/lib/backends/gpu/sc/HSAIL/include/hsa_ext_finalize.h#1 branch
... //depot/stg/opencl/drivers/opencl/compiler/lib/backends/gpu/sc/HSAIL/include/hsa_ext_image.h#1 branch
... //depot/stg/opencl/drivers/opencl/compiler/lib/backends/gpu/scwrapper/SI/scCompileSI.cpp#55 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/backends/gpu/scwrapper/build/Makefile.scwrapper#27 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/build/Makefile.gpu#63 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpudebugger.hpp#6 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpukernel.hpp#110 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuvirtual.cpp#363 edit
2015-05-30 03:12:34 -04:00
foreman 89ab6d41a0 P4 to Git Change 1155415 by gandryey@gera-w8 on 2015/05/28 11:39:24
ECR #304775 - Mipmaps support
	- Avoid a bogus error message for a mipmap unmap

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpumemory.cpp#121 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuresource.cpp#219 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuresource.hpp#81 edit
2015-05-28 11:55:23 -04:00
foreman 6d913bfc23 P4 to Git Change 1155068 by jatang@jatang-opencl-hsa-stg1 on 2015/05/27 17:48:24
EPR #419347 - Fix a d3d9 memory leak.

	According to https://msdn.microsoft.com/en-us/library/windows/desktop/bb174386(v=vs.85).aspx: “Calling IDirect3DDevice9::GetDirect3D will increase the internal reference count on the IDirect3D9 interface. Failure to call IUnknown::Release when finished using this IDirect3D9 interface results in a memory leak.”.

	Although p3d9devEx->Release() has the same effect as p3d9dev->Release(), for clarification we better use p3d9dev->Release() instead.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gslbe/src/rt/GSLDeviceD3D9.cpp#13 edit
2015-05-27 18:00:49 -04:00
foreman 5c0f3858ff P4 to Git Change 1154572 by gandryey@gera-w8 on 2015/05/26 16:21:11
ECR #304775 - Mipmaps support in OpenCL
	- Enable PAD2 bit for miplevel views

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuresource.cpp#218 edit
2015-05-26 16:35:34 -04:00
foreman 4f338e5578 P4 to Git Change 1152849 by skudchad@skudchad_test_win_opencl2 on 2015/05/20 17:15:00
EPR #403782 - IOMMU2/SVM
	- Update the caching and hit logic for resource cache to reflect allocation attributes for SVM. Else it can give wrong hits leading to hangs if a regular surface is used for shader upload etc. IOMMUv2 strictly needs shader and command buffers to have EXECUTE attribute.

	ReviewBoardURL = http://ocltc.amd.com/reviews/r/7572/diff/

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuresource.cpp#217 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuresource.hpp#80 edit
2015-05-20 17:28:59 -04:00
foreman 24ac37876f P4 to Git Change 1152826 by bdhanase@bala_workpc_ocl on 2015/05/20 16:21:12
EPR #397491 - Disabling generic address space on 32 bit windows too for now.

	Back out revision 116 from //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gslbe/src/rt/GSLDevice.cpp

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gslbe/src/rt/GSLDevice.cpp#117 edit
2015-05-20 16:35:07 -04:00
foreman 9d37ac1fc8 P4 to Git Change 1152825 by bdhanase@bala_workpc_ocl on 2015/05/20 16:19:13
EPR #397491 - According to HSA-Finalizer-ADD, for GPUVM32 private_segment_aperture_base_hi and group_segment_aperture_base_hi should be equal to the 32 bits of the 32 bit private and group segment flat address aperture.

	Reviewed by: German

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuvirtual.cpp#362 edit
2015-05-20 16:30:03 -04:00
foreman 4cffcd3997 P4 to Git Change 1152294 by nhaustov@spb-hlc-tonga2_hsa on 2015/05/19 12:34:06
ECR #333756 - HSA Finalizer: Make sure size of kernarg segment, alignment of kernarg, private and group segments are multiple of 16. Update ORCA runtime assert. [ OpenCL integration of CL 1151953]

	Change by Nikolay Haustov
	Testing: http://ocltc:8111/viewModification.html?modId=51851&personal=true&init=1&tab=vcsModificationBuilds

	Also fix uncovered problem in test.

	Testing: pre-checkin
	Reviewed by: German

Affected files ...

... //depot/stg/opencl/drivers/opencl/compiler/lib/backends/gpu/sc/HSAIL/hsail-fin/HSAILFinalizer.cpp#16 integrate
... //depot/stg/opencl/drivers/opencl/compiler/lib/backends/gpu/sc/HSAIL/tests/src/finalizer/features/structural_analysis/short_circuit/short_circuit06.hsail#4 integrate
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpukernel.cpp#284 edit
2015-05-19 12:40:37 -04:00
foreman 915592103f P4 to Git Change 1151968 by skudchad@skudchad_test_win_opencl2 on 2015/05/18 20:09:41
EPR #403782 - IOMMU2/SVM
	- Disable DX interop on SVM. This is a feature for SVM and may need more work.

	ReviewBoardURL = http://ocltc.amd.com/reviews/r/7555/diff/

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpusettings.cpp#310 edit
2015-05-18 20:15:12 -04:00
foreman 488ac00670 P4 to Git Change 1151655 by gandryey@gera-w8 on 2015/05/15 19:27:54
ECR #304775 - Mipmaps support
	- Following CL#1151650. Change the comparison condition to 1.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuresource.cpp#216 edit
2015-05-15 19:38:50 -04:00
foreman b8c761b199 P4 to Git Change 1151650 by gandryey@gera-w8 on 2015/05/15 18:45:24
ECR #304775 - Mipmaps support
	- Enable miplevel flag even for the first mip level when runtime creates a view. Otherwise GSL may change the pitch alignment for the created view.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuresource.cpp#215 edit
2015-05-15 18:50:37 -04:00
foreman 5303c618b2 P4 to Git Change 1151521 by jatang@jatang-opencl-hsa-stg1 on 2015/05/15 15:16:15
EPR #419347 - Fix a d3d9 memory leak.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gslbe/src/rt/GSLDeviceD3D9.cpp#12 edit
2015-05-15 15:49:45 -04:00
foreman dd63bd7d18 P4 to Git Change 1151492 by bdhanase@bala_workpc_ocl on 2015/05/15 14:40:44
EPR #397491 - Disable generic address space only on 32 bit Linux

	Reviewed by: German

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gslbe/src/rt/GSLDevice.cpp#116 edit
2015-05-15 15:06:35 -04:00
foreman 1c5ee64101 P4 to Git Change 1151052 by emankov@em-hsa-amd on 2015/05/14 15:16:47
ECR #333753 - Compiler Lib/RT: libutils.h usage removal due to non-API interface

	Utils are to be used only by Compiler Lib itself.

	Testing: pre checkin

	Reviewers: German Andryeyev, Brian Sumner, Yaxun Liu

Affected files ...

... //depot/stg/opencl/drivers/opencl/compiler/lib/utils/v0_8/libUtils.cpp#8 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/utils/v0_8/libUtils.h#17 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/device.cpp#178 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuprogram.cpp#193 edit
2015-05-14 15:28:01 -04:00
foreman 22e298d645 P4 to Git Change 1150377 by gandryey@gera-w8 on 2015/05/13 11:39:13
EPR #419351 - clEnqueueNDRange crash if one doesn't create a device queue and use device enqueue in the kernel
	- add a check for defQueue is NULL in case the app didn't create one.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpudevice.cpp#510 edit
2015-05-13 12:20:57 -04:00
foreman 7a956a4aa7 P4 to Git Change 1149454 by yaxunl@yaxunl_stg_win50 on 2015/05/11 15:00:43
EPR #419065 - [CQE OCL][ISV][QR][G] FAHBenchmark application is crashing.

	Two issues:
	1. Remove clearing of profileEnabled_ since it may cause incorrect kernel execution time measurement.
	2. Blit kernels causes assertion in getWavesPerSH since they do not have wave limiters. Remove the assert. If a kernel has no wave limiter, returns 0 in getWavesPerSH.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuvirtual.cpp#361 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuwavelimiter.cpp#5 edit
2015-05-11 15:14:59 -04:00
foreman 51efa976bf P4 to Git Change 1147021 by yaxunl@yaxunl_stg_win50 on 2015/05/04 16:06:06
ECR #304775 - Wave limiter: Fix bug in adaptation.

	Dumped waves/simd value is incorrect.
	Should exit adptation only after the changed waves/simd value is applied.
	Added wave limiter manager to handle situation that one kernel is enqueued to more than one queues. Create wave limiter for each virtual device.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/device.hpp#245 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpukernel.cpp#283 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpukernel.hpp#109 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuvirtual.cpp#360 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuwavelimiter.cpp#4 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuwavelimiter.hpp#3 edit
... //depot/stg/opencl/drivers/opencl/runtime/platform/command.cpp#70 edit
... //depot/stg/opencl/drivers/opencl/runtime/platform/command.hpp#76 edit
2015-05-04 16:20:24 -04:00
foreman 51f8154b23 P4 to Git Change 1146707 by bdhanase@bala_workpc_ocl on 2015/05/01 18:29:18
EPR #410821 - Reduced the maxMemAlloc and global memory size on APUs to 75% of uncachedRemoteRAM.

	http://ocltc.amd.com/reviews/r/7425/

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpudevice.cpp#509 edit
2015-05-01 18:36:34 -04:00
foreman b5a5c65e53 P4 to Git Change 1144775 by yaxunl@yaxunl_stg_win50 on 2015/04/27 12:11:07
ECR #304775 - Wave limiter: Fix crash in CompuBenchCL video composition due to profiling data not collected correctly.

	Gpuvirtual.cpp only collects profiling data when all events have profiling enabled. Fixed it by adding a member to indicate at least one event has profiling enabled and collect profiling data.

	Improved adptation by changing waves/simd only when the last change has been enforced. Also detecting discontinuities in measured data and discard them.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuvirtual.cpp#359 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuvirtual.hpp#129 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuwavelimiter.cpp#3 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuwavelimiter.hpp#2 edit
... //depot/stg/opencl/drivers/opencl/runtime/utils/flags.hpp#230 edit
2015-04-27 12:21:03 -04:00
foreman 8c00d76f96 P4 to Git Change 1144455 by gandryey@gera-dev-w7 on 2015/04/24 19:06:19
ECR #304775 - Fix offline compilation for Hawaii with -cl-fp32-correctly-rounded-divide-sqrt flag
	- check for cl-fp32-correctly-rounded-divide-sqrt support uses device info, but device info was never fully updated for offline devices. This change will update device info structure for offline devices as well.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpudevice.cpp#508 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpudevice.hpp#143 edit
2015-04-24 19:18:07 -04:00
foreman 6b46087492 P4 to Git Change 1143340 by skudchad@skudchad_test_win_opencl2 on 2015/04/22 13:03:52
EPR #403782 - IOMMU2/SVM on CZ Win10, GL Interop changes for SVM.
	- Pass flag to GL to disable forceRemoteAllocation during GLDissociate

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gslbe/src/rt/GSLDeviceGL.cpp#23 edit
2015-04-22 13:12:25 -04:00
foreman 5cb151c778 P4 to Git Change 1143317 by yaxunl@yaxunl_stg_win50 on 2015/04/22 11:23:22
ECR #304775 - Wave limiter: fix bug about some variables not initialized before being used.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuwavelimiter.cpp#2 edit
2015-04-22 11:31:43 -04:00