Grafik Komit

731 Melakukan

Penulis SHA1 Pesan Tanggal
foreman 4ec6306997 P4 to Git Change 1181960 by johtaylo@johtaylo-JTBUILDER03-increment on 2015/08/19 03:01:46
EPR #010002 - Change OpenCL version number from 1874 to 1875.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1621 edit


[ROCm/clr commit: bd71fc8775]
2015-08-19 03:12:52 -04:00
foreman e1382f2d12 P4 to Git Change 1181925 by marsenau@marsenau_home on 2015/08/19 00:14:53
ECR #304775 - Remove HLC_Unroll_* variables.

	HLC_Unroll_Scratch_Threshold was unused. The others have equivalent settings in the AMDLLVMContextHook, so consistently use that version. The patches to opt were already had different set of command line flags for the same options.

	This changes two of the defaults in compiler library and the equivalent flags in opt to match the values which were actually in use so this shoudn't change the current behavior. The unroll threshold default and allow partial unrolling defaults were changed to the actually used default values. Eventually all of these custom options should be removed, because in current LLVM these can be controlled per loop by the TargetTransformInfo, and all have equivalent cl::opts already.

Affected files ...

... //depot/stg/opencl/drivers/opencl/compiler/lib/utils/OPTIONS.def#128 edit
... //depot/stg/opencl/drivers/opencl/compiler/llvm/include/llvm/AMDLLVMContextHook.h#27 edit
... //depot/stg/opencl/drivers/opencl/compiler/llvm/include/llvm/Transforms/IPO/AMDOptOptions.h#7 edit
... //depot/stg/opencl/drivers/opencl/compiler/llvm/lib/Transforms/IPO/AMDOptOptions.cpp#9 edit
... //depot/stg/opencl/drivers/opencl/compiler/llvm/lib/Transforms/IPO/AMDPassManagerBuilder.cpp#55 edit
... //depot/stg/opencl/drivers/opencl/compiler/llvm/tools/opt/amdopt.inc#25 edit


[ROCm/clr commit: 5ad9c314c6]
2015-08-19 00:25:53 -04:00
foreman 504718d8e0 P4 to Git Change 1181419 by johtaylo@johtaylo-JTBUILDER03-increment on 2015/08/18 03:00:10
EPR #010002 - Change OpenCL version number from 1873 to 1874.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1620 edit


[ROCm/clr commit: 87f0549195]
2015-08-18 03:08:20 -04:00
foreman ea4f515260 P4 to Git Change 1181309 by johtaylo@johtaylo_L7_stg on 2015/08/17 18:25:46
ECR #304775 - Preparation for kernel caching feature

	force line ending on files to one style. Precheckin does not capture this failure as it converts all text files to linux format so that both linux and windows can compile the code during precheckin.

Affected files ...

... //depot/stg/opencl/drivers/opencl/compiler/lib/backends/common/cache.cpp#4 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/backends/common/cache.hpp#4 edit


[ROCm/clr commit: 02cf1c13a0]
2015-08-17 18:34:10 -04:00
foreman d28d824917 P4 to Git Change 1181226 by kebai@kebai-lnx1 on 2015/08/17 16:11:30
ECR #304775 - Preparation for kernel caching feature
	1. Each device have a separate cache directory
	2. It logs caching errors, so we can debug the cache and/or detect collisions
	3. Implementeded cache size tracking, so we can evict old data when cache files are too large
	4. Added file/path access permission control on both windows and linux
	5. Have read/write file lock protection
	6. -kcache-disable flag can be used to turn on/off the caching functionality
	7. AMD_FORCE_KCACHE_TEST env variable is used for internal testing
	8. For the stage we want to cache, call getCacheEntry() followed by makeCacheEntry() if the get fails; otherwise directly return cached data.
	- After the compiler library code is refactored, getCacheEntry() and makeCacheEntry() will be wrapped into one function call, so that only one call is needed at the place we want to cache

	TO DO:
	1. Use it in the compiler library code
	- Waiting for the decision on how many stages we want to cache, i) 1-stage caching: source->ISA; or ii) 3-stage caching: source->LLVM IR, LLVM IR->IL, IL->ISA
	2. Tracking of timestamps for cache entries
	- LRU eviction when cache grows too large
	- Suggestion from Laurent: Regarding tracking timestamps for LRU eviction: Random eviction would probably perform as well as LRU and does not require timestamps.
	3. Track cache entries per application

	ReviewBoardURL = http://ocltc.amd.com/reviews/r/8194/

Affected files ...

... //depot/stg/opencl/drivers/opencl/compiler/lib/backends/common/cache.cpp#3 add
... //depot/stg/opencl/drivers/opencl/compiler/lib/backends/common/cache.hpp#3 add
... //depot/stg/opencl/drivers/opencl/compiler/lib/utils/OPTIONS.def#127 edit


[ROCm/clr commit: 7b52a80a63]
2015-08-17 16:27:05 -04:00
foreman 001be6d3ea P4 to Git Change 1181197 by bdhanase@opencl_intel5 on 2015/08/17 14:59:15
EPR #397491 - Re enable 32 bit OCL2.0 on Win 10

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpusettings.cpp#324 edit


[ROCm/clr commit: fe96d1dc70]
2015-08-17 15:21:17 -04:00
foreman fd050f9ce5 P4 to Git Change 1181122 by gandryey@gera-dev-w7 on 2015/08/17 13:35:16
ECR #304775 - Disable Pre-SI asics initialization in runtime. EG and NI are no longer supported from 15.30

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpudefs.hpp#125 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpudevice.cpp#519 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gslbe/src/rt/GSLDevice.cpp#135 edit


[ROCm/clr commit: ab074c69c7]
2015-08-17 13:47:02 -04:00
foreman 556aaadd13 P4 to Git Change 1181062 by rili@rili_opencl_stg on 2015/08/17 10:39:25
EPR #419313 - Check internal kernel in right palce

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpukernel.cpp#292 edit


[ROCm/clr commit: e11d0a712d]
2015-08-17 10:48:55 -04:00
foreman 267b183f37 P4 to Git Change 1180920 by johtaylo@johtaylo-JTBUILDER03-increment on 2015/08/17 03:00:10
EPR #010002 - Change OpenCL version number from 1872 to 1873.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1619 edit


[ROCm/clr commit: 57e25aa29e]
2015-08-17 03:10:22 -04:00
foreman 96167db76d P4 to Git Change 1180775 by johtaylo@johtaylo-JTBUILDER03-increment on 2015/08/15 03:00:28
EPR #010002 - Change OpenCL version number from 1871 to 1872.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1618 edit


[ROCm/clr commit: 5e6c8eef77]
2015-08-15 03:12:01 -04:00
foreman 8f0e25ef3e P4 to Git Change 1180763 by todli@todli-win-opencl-kv1 on 2015/08/14 22:54:27
EPR #399601 - OpenCL Stoney BU changes

	Reviewed by: Jeff, Brian, Saleel

Affected files ...

... //depot/stg/opencl/drivers/opencl/compiler/lib/backends/gpu/scwrapper/devState.cpp#27 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/loaders/bif/amuabi/amuABI.h#17 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/loaders/bif/amuabi/amuABIMultiBinary.cpp#17 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/utils/v0_8/target_mappings.h#32 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/utils/v0_8/target_mappings_amdil.h#27 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/utils/v0_8/target_mappings_amdil64.h#24 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/utils/v0_8/target_mappings_hsail.h#23 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/utils/v0_8/target_mappings_hsail64.h#18 edit
... //depot/stg/opencl/drivers/opencl/compiler/llvm32/lib/Target/AMDIL/AMDIL.h#4 edit
... //depot/stg/opencl/drivers/opencl/compiler/llvm32/lib/Target/AMDIL/AMDILSubtarget.cpp#4 edit
... //depot/stg/opencl/drivers/opencl/compiler/llvm32/lib/Target/AMDIL/Internal/AMDILInternalDeviceFlags.h#4 edit
... //depot/stg/opencl/drivers/opencl/compiler/llvm32/lib/Target/AMDIL/Internal/InternalProcessors.td#4 edit
... //depot/stg/opencl/drivers/opencl/compiler/tools/driver/driver.cpp#55 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpudefs.hpp#124 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpudevice.cpp#518 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpusettings.cpp#323 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gslbe/src/rt/GSLDevice.cpp#134 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gslbe/src/rt/caltarget.h#2 edit
... //depot/stg/opencl/drivers/opencl/tests/ocltst/module/binary/BIFEnumCheck.cpp#24 edit
... //depot/stg/opencl/drivers/opencl/tests/ocltst/module/math/OCLMathFunc.cpp#21 edit
... //depot/stg/opencl/drivers/opencl/tests/ocltst/module/runtime/OCLDeviceQueries.cpp#26 edit
... //depot/stg/opencl/drivers/opencl/tests/ocltst/module/runtime/OCLPerfCounters.cpp#23 edit


[ROCm/clr commit: ae1fa5ae2f]
2015-08-14 23:09:39 -04:00
foreman a867069209 P4 to Git Change 1180729 by marsenau@marsenau_home on 2015/08/14 18:09:05
ECR #304775 - Move AMDExportKernelNature to compiler library.

	This is part of the effort to move all extra passes out of the LLVM directory and into compiler library so we can build with a pure copy of LLVM. This is far from all of the sources required to move, but I'm trying to do the migration in moderately sized pieces.

	Duplicates AMDSymbolNames.h utility header to avoid having a circular dependency between LLVM and compiler library. Eventually when all extra sources are moved into compiler library, there should only be one copy left.

Affected files ...

... //depot/stg/opencl/drivers/opencl/compiler/lib/backends/common/optimizer.cpp#27 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/linker/include/AMDPasses.h#1 add
... //depot/stg/opencl/drivers/opencl/compiler/lib/linker/include/AMDSymbolName.h#1 branch
... //depot/stg/opencl/drivers/opencl/compiler/lib/linker/lib/AMDExportKernelNature.cpp#1 move/add
... //depot/stg/opencl/drivers/opencl/compiler/llvm/include/llvm/Analysis/Passes.h#56 edit
... //depot/stg/opencl/drivers/opencl/compiler/llvm/include/llvm/InitializePasses.h#81 edit
... //depot/stg/opencl/drivers/opencl/compiler/llvm/lib/Analysis/AMDExportKernelNature.cpp#16 move/delete
... //depot/stg/opencl/drivers/opencl/compiler/llvm/lib/Analysis/Analysis.cpp#34 edit


[ROCm/clr commit: 9c9f5d42f9]
2015-08-14 18:21:40 -04:00
foreman d252efbe9f P4 to Git Change 1180506 by yaxunl@yaxunl_stg_win50 on 2015/08/14 11:18:00
ECR #354633 - SPIR-V: Add clCreateProgramWithIL API to CL 2.0.

Affected files ...

... //depot/stg/opencl/drivers/opencl/api/opencl/amdocl/amdocl.def.in#14 edit
... //depot/stg/opencl/drivers/opencl/api/opencl/amdocl/amdocl.map.in#17 edit
... //depot/stg/opencl/drivers/opencl/api/opencl/amdocl/cl_icd.cpp#23 edit
... //depot/stg/opencl/drivers/opencl/api/opencl/amdocl/cl_icd_amd.h#16 edit
... //depot/stg/opencl/drivers/opencl/api/opencl/amdocl/cl_program.cpp#35 edit
... //depot/stg/opencl/drivers/opencl/api/opencl/khronos/headers/opencl2.0/CL/cl.h#20 edit
... //depot/stg/opencl/drivers/opencl/api/opencl/khronos/icd/icd_dispatch.c#27 edit
... //depot/stg/opencl/drivers/opencl/api/opencl/khronos/icd/icd_dispatch.h#22 edit
... //depot/stg/opencl/drivers/opencl/runtime/platform/program.hpp#31 edit


[ROCm/clr commit: b9d5051b72]
2015-08-14 11:36:12 -04:00
foreman fc1a977400 P4 to Git Change 1180360 by johtaylo@johtaylo-JTBUILDER03-increment on 2015/08/14 03:00:17
EPR #010002 - Change OpenCL version number from 1870 to 1871.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1617 edit


[ROCm/clr commit: 977ba9ae12]
2015-08-14 03:12:19 -04:00
foreman c5cbf6e446 P4 to Git Change 1180135 by yaxunl@yaxunl_stg_win50 on 2015/08/13 13:24:11
ECR #354633 - SPIR-V: Fix build failure in mainline.

Affected files ...

... //depot/stg/opencl/drivers/opencl/compiler/lib/backends/common/linker.cpp#132 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/build/Makefile.complib#89 edit
... //depot/stg/opencl/drivers/opencl/compiler/llvm/Makefile#53 edit
... //depot/stg/opencl/drivers/opencl/compiler/llvm/lib/Makefile#26 edit


[ROCm/clr commit: 97f6edabb3]
2015-08-13 13:32:33 -04:00
foreman 5aea79d7bf P4 to Git Change 1180133 by ashi1@ashi1_win50 on 2015/08/13 13:20:33
ECR #292434 - Add option to turn on/off SCOption_POST_RA_INSTRUCTION_SCHEDULING.

	Also fixed some default comments.

Affected files ...

... //depot/stg/opencl/drivers/opencl/compiler/lib/backends/gpu/scwrapper/scState.cpp#39 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/utils/OPTIONS.def#126 edit


[ROCm/clr commit: c37750a8ac]
2015-08-13 13:28:01 -04:00
foreman 6dfaff3485 P4 to Git Change 1180048 by bsumner@bsumner-lin-opencl on 2015/08/13 09:37:01
ECR #304775 - back out 1179705 since it also affects AMDIL path

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpusettings.cpp#322 edit


[ROCm/clr commit: bde9db095c]
2015-08-13 09:47:56 -04:00
foreman 4fe0787df8 P4 to Git Change 1179933 by johtaylo@johtaylo-JTBUILDER03-increment on 2015/08/13 03:00:13
EPR #010002 - Change OpenCL version number from 1869 to 1870.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1616 edit


[ROCm/clr commit: 6b37ecd4c5]
2015-08-13 03:08:22 -04:00
foreman e75f6223e2 P4 to Git Change 1179851 by bdhanase@opencl_intel5 on 2015/08/12 20:49:50
EPR #397491 - Disable 32 bit OCL2.0 on Win 10 for now.  isWindows10OrLater() call doesn't work.  So, passing a flag from GSL to runtime to disable 32 bit OCL2.0 on Win 10.

	http://ocltc.amd.com/reviews/r/8234/

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpusettings.cpp#321 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gslbe/src/rt/GSLDevice.cpp#133 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gslbe/src/rt/backend.h#8 edit


[ROCm/clr commit: 68cf197c22]
2015-08-12 21:03:02 -04:00
foreman cd07dd6fad P4 to Git Change 1179705 by bsumner@bsumner-lin-opencl on 2015/08/12 14:06:12
ECR #304775 - switch on "denorm" support on VI and make HSAIL default to not generating ftz modifier

Affected files ...

... //depot/stg/opencl/drivers/opencl/compiler/llvm/lib/Target/HSAIL/HSAILISelDAGToDAG.cpp#48 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpusettings.cpp#320 edit


[ROCm/clr commit: 473c92d0f4]
2015-08-12 14:33:51 -04:00
foreman 502cc124b5 P4 to Git Change 1179663 by gandryey@gera-dev-w7 on 2015/08/12 13:14:46
EPR #419072 - [OpenCL2.0] Enable 16MB large on device queues
	- Enable device queue creation up to 12MB. That should allow to run Intel SDK sample from the EPR that requires 6MB queue only.
	- Currently a queue with >12.5MB size has a significant performance degradation. Thus the current max possible is 12MB. In general it's preferable to use the queue size more suitable for the task, rather than max possible.

Affected files ...

... //depot/stg/opencl/drivers/opencl/library/hsa/hsail/src/devenq/schedule.cl#10 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpublit.cpp#115 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpublit.hpp#38 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpudefs.hpp#123 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpudevice.cpp#517 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpusched.hpp#17 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuvirtual.cpp#372 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuvirtual.hpp#131 edit


[ROCm/clr commit: 1386191b6c]
2015-08-12 13:37:08 -04:00
foreman 3fdb8d9147 P4 to Git Change 1179475 by johtaylo@johtaylo-JTBUILDER03-increment on 2015/08/12 03:00:15
EPR #010002 - Change OpenCL version number from 1868 to 1869.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1615 edit


[ROCm/clr commit: 6d9138334f]
2015-08-12 03:19:21 -04:00
foreman 66c524df26 P4 to Git Change 1179091 by nhaustov@nhaustov_hsa on 2015/08/11 05:48:01
ECR #333756 - Merge oclutils from runtime.

	Also updates temp file handling code to use pid/atomic which should fix Bug 10793.
	Note, changed the functions to use different prefix to avoid clash with runtime code.

	Reviewed by: Laurent Morichetti
	Testing: smoke, pre-checkin

Affected files ...

... //depot/stg/opencl/drivers/opencl/compiler/lib/promotions/oclutils/os/alloc.cpp#2 integrate
... //depot/stg/opencl/drivers/opencl/compiler/lib/promotions/oclutils/os/os.cpp#5 integrate
... //depot/stg/opencl/drivers/opencl/compiler/lib/promotions/oclutils/os/os.hpp#6 integrate
... //depot/stg/opencl/drivers/opencl/compiler/lib/promotions/oclutils/os/os_posix.cpp#10 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/promotions/oclutils/os/os_win32.cpp#5 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/promotions/oclutils/thread/atomic.hpp#4 integrate
... //depot/stg/opencl/drivers/opencl/compiler/lib/promotions/oclutils/thread/monitor.cpp#2 integrate
... //depot/stg/opencl/drivers/opencl/compiler/lib/promotions/oclutils/thread/monitor.hpp#2 integrate
... //depot/stg/opencl/drivers/opencl/compiler/lib/promotions/oclutils/thread/semaphore.cpp#2 integrate
... //depot/stg/opencl/drivers/opencl/compiler/lib/promotions/oclutils/thread/semaphore.hpp#3 integrate
... //depot/stg/opencl/drivers/opencl/compiler/lib/promotions/oclutils/thread/thread.cpp#3 integrate
... //depot/stg/opencl/drivers/opencl/compiler/lib/promotions/oclutils/thread/thread.hpp#3 integrate
... //depot/stg/opencl/drivers/opencl/compiler/lib/promotions/oclutils/utils/debug.hpp#2 integrate
... //depot/stg/opencl/drivers/opencl/compiler/lib/promotions/oclutils/utils/macros.hpp#3 integrate
... //depot/stg/opencl/drivers/opencl/compiler/lib/promotions/oclutils/utils/traits.hpp#2 delete
... //depot/stg/opencl/drivers/opencl/compiler/lib/promotions/oclutils/utils/util.hpp#2 integrate


[ROCm/clr commit: 3607d18592]
2015-08-11 06:09:15 -04:00
foreman 8f669340df P4 to Git Change 1179013 by johtaylo@johtaylo-JTBUILDER03-increment on 2015/08/11 03:00:11
EPR #010002 - Change OpenCL version number from 1867 to 1868.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1614 edit


[ROCm/clr commit: e41da28605]
2015-08-11 03:15:52 -04:00
foreman ff6e211cf6 P4 to Git Change 1178942 by bdhanase@bala_workpc_ocl on 2015/08/10 20:14:46
EPR #397491 - Disable OpenCL2.0 on Win 10 for now

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpusettings.cpp#319 edit


[ROCm/clr commit: 249153c6e0]
2015-08-10 20:23:20 -04:00
foreman 6124269c06 P4 to Git Change 1178405 by johtaylo@johtaylo-JTBUILDER03-increment on 2015/08/08 03:00:14
EPR #010002 - Change OpenCL version number from 1866 to 1867.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1613 edit


[ROCm/clr commit: 1ed8b37ed7]
2015-08-08 03:09:18 -04:00
foreman 5901f7047c P4 to Git Change 1178388 by asalmanp@asalmanp-opencl-stg on 2015/08/07 19:02:14
ECR #417452 - [OpenCL][HWS/CWSR] Adding support for Hardware Scheduler and Compute Wave Save Restore (CWSR) feature on ORCA. runtime changes
	ReviewBoardURL = http://ocltc.amd.com/reviews/r/8159/

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuvirtual.cpp#371 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gslbe/src/rt/GSLDevice.cpp#132 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gslbe/src/rt/GSLDevice.h#53 edit


[ROCm/clr commit: e09e1601dd]
2015-08-07 19:13:26 -04:00
foreman 18672e3df8 P4 to Git Change 1178314 by bdhanase@bala_workpc_ocl on 2015/08/07 16:30:49
EPR #397491 - enable 32 bit opencl 2.0

	http://ocltc.amd.com/reviews/r/8191/

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpusettings.cpp#318 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gslbe/src/rt/GSLDevice.cpp#131 edit


[ROCm/clr commit: f204610517]
2015-08-07 16:45:39 -04:00
foreman 4cc01dd9f4 P4 to Git Change 1178230 by gandryey@gera-w8 on 2015/08/07 14:23:53
ECR #304775 - Mipmaps support in OpenCL
	- Enable mipmaps in mainline

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/utils/flags.hpp#237 edit


[ROCm/clr commit: 4cbfee2d74]
2015-08-07 14:44:53 -04:00
foreman 7778c3b127 P4 to Git Change 1178228 by marsenau@marsenau_home on 2015/08/07 14:23:27
ECR #304775 - Fork llvm-link into new tool called opencl-link

	Most of what the patched llvm-link does has nothing to do with linking LLVM IR, and more to do with loading SPIR and specifically handling the builtin library.

	Forking this into a separate tool is the fastest way to fix dependencies on large LLVM patches for the OpenCL build to work. With this in place, it should now be possible to move the various linker and SPIR conversions out of the llvm directory and into compiler library.

	Ideally this would be fixed by:

	1. Not always lowering the library from SPIR
	2. Having a separate SPIR lowering tool for testing
	3. Using function attributes and stub libraries for library FP options instead of linker flags
	4. Structuring all of the SPIR conversions as passes and having a single PassManager handle all of the lowering / linking / optimization passes.

	But accomplishing all of these will be more time consuming.

	Branching

	//depot/stg/opencl/drivers/opencl/compiler/llvm/tools/llvm-link/...

	to //depot/stg/opencl/drivers/opencl/compiler/lib/linker/tools/opencl-link/...

Affected files ...

... //depot/stg/opencl/drivers/opencl/api/opencl/amdocl/build/Makefile.api#118 edit
... //depot/stg/opencl/drivers/opencl/compiler/Makefile#60 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/Makefile#34 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/backends/common/build/Makefile.common#33 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/backends/common/linker.cpp#131 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/build/Makefile.complib#88 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/compliblinkerlibs#1 add
... //depot/stg/opencl/drivers/opencl/compiler/lib/linker/Makefile#1 add
... //depot/stg/opencl/drivers/opencl/compiler/lib/linker/include/AMDFixupKernelModule.h#1 move/add
... //depot/stg/opencl/drivers/opencl/compiler/lib/linker/include/AMDPrelink.h#1 move/add
... //depot/stg/opencl/drivers/opencl/compiler/lib/linker/include/AMDResolveLinker.h#1 move/add
... //depot/stg/opencl/drivers/opencl/compiler/lib/linker/lib/AMDFixupKernelModule.cpp#1 move/add
... //depot/stg/opencl/drivers/opencl/compiler/lib/linker/lib/AMDPrelink.cpp#1 move/add
... //depot/stg/opencl/drivers/opencl/compiler/lib/linker/lib/AMDResolveLinker.cpp#1 move/add
... //depot/stg/opencl/drivers/opencl/compiler/lib/linker/lib/Makefile#1 add
... //depot/stg/opencl/drivers/opencl/compiler/lib/linker/lib/build/Makefile#1 add
... //depot/stg/opencl/drivers/opencl/compiler/lib/linker/lib/build/Makefile.linker#1 add
... //depot/stg/opencl/drivers/opencl/compiler/lib/linker/lib/clpVectorExpansion.cpp#1 move/add
... //depot/stg/opencl/drivers/opencl/compiler/lib/linker/tools/opencl-link/Android.mk#1 branch
... //depot/stg/opencl/drivers/opencl/compiler/lib/linker/tools/opencl-link/Makefile#1 branch
... //depot/stg/opencl/drivers/opencl/compiler/lib/linker/tools/opencl-link/build/Makefile#1 branch
... //depot/stg/opencl/drivers/opencl/compiler/lib/linker/tools/opencl-link/build/Makefile.opencl-link#1 add
... //depot/stg/opencl/drivers/opencl/compiler/lib/linker/tools/opencl-link/opencl-link.cpp#1 add
... //depot/stg/opencl/drivers/opencl/compiler/llvm/include/llvm/AMDFixupKernelModule.h#3 move/delete
... //depot/stg/opencl/drivers/opencl/compiler/llvm/include/llvm/AMDPrelink.h#2 move/delete
... //depot/stg/opencl/drivers/opencl/compiler/llvm/include/llvm/AMDResolveLinker.h#7 move/delete
... //depot/stg/opencl/drivers/opencl/compiler/llvm/lib/Linker/AMDFixupKernelModule.cpp#9 move/delete
... //depot/stg/opencl/drivers/opencl/compiler/llvm/lib/Linker/AMDPrelink.cpp#2 move/delete
... //depot/stg/opencl/drivers/opencl/compiler/llvm/lib/Linker/AMDResolveLinker.cpp#11 move/delete
... //depot/stg/opencl/drivers/opencl/compiler/llvm/lib/Linker/clpVectorExpansion.cpp#3 move/delete
... //depot/stg/opencl/drivers/opencl/compiler/llvm/llvmdefs#41 edit
... //depot/stg/opencl/drivers/opencl/compiler/llvm/tools/llvm-link/llvm-link.cpp#55 edit
... //depot/stg/opencl/drivers/opencl/openclrules#87 edit
... //depot/stg/opencl/drivers/opencl/tests/hsa/bin/hsa_dist.pl#2 edit
... //depot/stg/opencl/drivers/opencl/tests/hsa/bin/test_driver.pl#18 edit


[ROCm/clr commit: f5885934a0]
2015-08-07 14:34:06 -04:00
foreman b3135a37f8 P4 to Git Change 1178177 by rili@rili_opencl_stg on 2015/08/07 12:18:10
EPR #424426 - enable large allocation for Win 10

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpusettings.cpp#317 edit


[ROCm/clr commit: 36b4f843f5]
2015-08-07 12:30:33 -04:00
foreman fb2d2a6358 P4 to Git Change 1178107 by emankov@em-hsa-amd on 2015/08/07 09:05:47
ECR #333753 - Compiler Lib: switch Bif Version to 3.1 by default for HSAIL

	It is needed due to the latest AMD HSA Code Object introduction in BIF.

	TODO (in separate changes):
	1. Analyze the changes in sections/symbols and remove (if needed) unused anymore (in BIF31), for example, symISAMeta, check backward compatibility.
	2. Move the bif versions/conversions code from libUtils to loader\Bif.
	3. Refactor the bif versions/conversions code in order to get rid of copy/paste (templates?).
	4. Drop aclBIFVersionCAL.

	Testing: pre check-in

	Reviewer: Brian Sumner, Nikolay Haustov

Affected files ...

... //depot/stg/opencl/drivers/opencl/compiler/lib/api/v0_8/acl.cpp#33 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/include/v0_8/aclEnums.h#20 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/loaders/bif/bif.hpp#3 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/loaders/bif/bif20.cpp#14 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/loaders/bif/bif20.hpp#7 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/loaders/bif/bif21.cpp#7 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/loaders/bif/bif21.hpp#7 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/loaders/bif/bif30.cpp#26 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/loaders/bif/bif30.hpp#8 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/loaders/bif/bif31.cpp#1 add
... //depot/stg/opencl/drivers/opencl/compiler/lib/loaders/bif/bif31.hpp#1 add
... //depot/stg/opencl/drivers/opencl/compiler/lib/loaders/bif/bifbase.cpp#53 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/loaders/bif/bifbase.hpp#23 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/utils/bif_section_labels.hpp#22 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/utils/v0_8/libUtils.cpp#11 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/utils/v0_8/libUtils.h#21 edit
... //depot/stg/opencl/drivers/opencl/compiler/tools/aoc2/aoc2.cpp#75 edit
... //depot/stg/opencl/drivers/opencl/tests/ocltst/module/binary/BIFAssumptionCheck.cpp#9 edit
... //depot/stg/opencl/drivers/opencl/tests/ocltst/module/binary/BIFAssumptionCheck.hpp#5 edit
... //depot/stg/opencl/drivers/opencl/tests/ocltst/module/complib/CLEnumCheck.cpp#45 edit


[ROCm/clr commit: e54e8da337]
2015-08-07 09:11:37 -04:00
foreman 80b479c16b P4 to Git Change 1178030 by johtaylo@johtaylo-JTBUILDER03-increment on 2015/08/07 03:00:12
EPR #010002 - Change OpenCL version number from 1865 to 1866.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1612 edit


[ROCm/clr commit: 8fe643e7b6]
2015-08-07 03:15:07 -04:00
foreman d939bb5f12 P4 to Git Change 1177889 by weizhang@msdnweizhang-opencl on 2015/08/06 17:16:54
EPR #421017 - IOMMU2/SVM on CZ Win10, the bit INST_ATC of  COMPUTE_PGM_HI needs to be set for device enqueue.

Affected files ...

... //depot/stg/opencl/drivers/opencl/library/hsa/hsail/src/devenq/schedule.cl#9 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpusched.hpp#16 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuvirtual.cpp#370 edit


[ROCm/clr commit: 7a54d367f3]
2015-08-06 17:39:55 -04:00
foreman 337d6861b4 P4 to Git Change 1177724 by jatang@jatang-opencl-hsa-stg1 on 2015/08/06 09:59:33
EPR #424562 - Add Averaging algorithm to Wave Limiter.

	1. Extract the algorithms to a sub-class of the wave limiter class.
	2. Add Averaging algorithm

	This averaging algorithm typically improves performance of BasemarkCL wave simulation by 8% on Tonga/Fiji than the current smooth algorithm. This change has not enable the averaging algorithm yet. Follow-up changes should be made to intelligently select which algorithm to use.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuwavelimiter.cpp#7 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuwavelimiter.hpp#4 edit


[ROCm/clr commit: 8fe485de95]
2015-08-06 10:19:59 -04:00
foreman 8ab2e462d2 P4 to Git Change 1177633 by johtaylo@johtaylo-JTBUILDER03-increment on 2015/08/06 03:00:10
EPR #010002 - Change OpenCL version number from 1864 to 1865.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1611 edit


[ROCm/clr commit: fd5d51f911]
2015-08-06 03:06:54 -04:00
foreman 0fc4f1da76 P4 to Git Change 1177536 by bdhanase@opencl_merge_emailpc on 2015/08/05 18:23:44
EPR #397491 - Back out changelist 1177450.  Disable 32 bit generic address space

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gslbe/src/rt/GSLDevice.cpp#130 edit


[ROCm/clr commit: bcbc9101d4]
2015-08-05 18:31:02 -04:00
foreman cb1988e4f5 P4 to Git Change 1177450 by bdhanase@bala_workpc_ocl on 2015/08/05 15:40:16
EPR #397491 - Enable generic address space for 32 bit on Windows

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gslbe/src/rt/GSLDevice.cpp#129 edit


[ROCm/clr commit: 65a624e904]
2015-08-05 16:07:37 -04:00
foreman 4355623d0b P4 to Git Change 1177220 by emankov@em-hsa-amd on 2015/08/05 06:08:39
ECR #333753 - ORCA RT/Compiler Lib/aoc2: AMD HSA Code Object Import feature (part II) - arbitrary hidden (extra) kernargs support

	Only HSAIL path is affected. It doesn't affect blit kernels.

	To use offline by aoc2:
	aoc2 -hsacodeobject=<importing_code_object_filename> -numhiddenkernargs=<num> -cl-std=CL2.0 -march=hsail(-64) -mdevice=Bonaire <source_cl_filename>

	To use online by setting env:
	AMD_DEBUG_HSA_NUM_HIDDEN_KERNARGS=<num>

	where num >= 0. If num == 0, then no additional arguments will be added on RT for every kernel. The default value is unchanged and equal to 6 for now.

	Misc:
	+ get rid of PRE & POST defines in Compiler Lib, as they started to conflict with ugl\gl\gs\hwl\ headers with the same defines.
	+ minor copy/paste eliminations & typo fixes
	+ ocltst complib tests update

	Testing: pre check-in, manually based on ocl sdk MatrixMultiplication

	Reviewers: Brian Sumner, German Andryeyev, Nikolay Haustov, Artem Tamazov

Affected files ...

... //depot/stg/opencl/drivers/opencl/compiler/lib/backends/common/v0_8/if_acl.cpp#72 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/backends/gpu/hsail_be.cpp#49 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/backends/gpu/metadata.cpp#8 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/include/v0_8/aclDefs.h#5 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/include/v0_8/aclEnums.h#19 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/include/v0_8/aclStructs.h#17 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/utils/bif_section_labels.hpp#21 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/utils/v0_8/libUtils.cpp#10 edit
... //depot/stg/opencl/drivers/opencl/compiler/lib/utils/v0_8/libUtils.h#20 edit
... //depot/stg/opencl/drivers/opencl/compiler/tools/aoc2/aoc2.cpp#74 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/device.cpp#181 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/device.hpp#249 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpukernel.cpp#291 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpukernel.hpp#113 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuprogram.cpp#199 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuvirtual.cpp#369 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/hsa/hsaprogram.cpp#38 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/hsa_foundation/hsakernel.cpp#8 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/hsa_foundation/hsakernel.hpp#5 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/hsa_foundation/hsaprogram.cpp#19 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/hsa_foundation/hsavirtual.cpp#43 edit
... //depot/stg/opencl/drivers/opencl/tests/ocltst/module/complib/CLAssumptionCheck.cpp#43 edit
... //depot/stg/opencl/drivers/opencl/tests/ocltst/module/complib/CLEnumCheck.cpp#44 edit


[ROCm/clr commit: 81b331f4c5]
2015-08-05 06:18:33 -04:00
foreman fe82f3d59b P4 to Git Change 1177133 by johtaylo@johtaylo-JTBUILDER03-increment on 2015/08/05 03:00:49
EPR #010002 - Change OpenCL version number from 1863 to 1864.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1610 edit


[ROCm/clr commit: 881eeab3b8]
2015-08-05 03:11:12 -04:00
foreman 5081502f55 P4 to Git Change 1176728 by johtaylo@johtaylo-JTBUILDER03-increment on 2015/08/04 03:00:13
EPR #010002 - Change OpenCL version number from 1862 to 1863.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1609 edit


[ROCm/clr commit: 65d6919554]
2015-08-04 03:10:51 -04:00
foreman a222830411 P4 to Git Change 1176331 by johtaylo@johtaylo-JTBUILDER03-increment on 2015/08/02 03:00:10
EPR #010002 - Change OpenCL version number from 1861 to 1862.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1608 edit


[ROCm/clr commit: c14f291dc2]
2015-08-02 03:09:10 -04:00
foreman 5db90bd1ad P4 to Git Change 1176310 by johtaylo@johtaylo-JTBUILDER03-increment on 2015/08/01 03:00:10
EPR #010002 - Change OpenCL version number from 1860 to 1861.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1607 edit


[ROCm/clr commit: 108ae9395d]
2015-08-01 03:20:54 -04:00
foreman e318ca8aa1 P4 to Git Change 1176126 by smekhano@stas-nova-hsa on 2015/07/31 12:20:56
ECR #333753 - HSA HLC: disable nfma_f32, nrcp_f32 and nsrqrt_f32 if denorms are not flushed
	Testing: smoke, precheckin
	Reviewed by Brian Sumner

Affected files ...

... //depot/stg/opencl/drivers/opencl/compiler/lib/backends/common/linker.cpp#130 edit
... //depot/stg/opencl/drivers/opencl/compiler/llvm/lib/Target/HSAIL/BRIGAsmPrinter.cpp#122 edit
... //depot/stg/opencl/drivers/opencl/compiler/llvm/lib/Target/HSAIL/HSAILFusion.td#17 edit
... //depot/stg/opencl/drivers/opencl/compiler/llvm/lib/Target/HSAIL/HSAILISelDAGToDAG.cpp#47 edit
... //depot/stg/opencl/drivers/opencl/compiler/llvm/lib/Target/HSAIL/HSAILInstructions.td#12 edit
... //depot/stg/opencl/drivers/opencl/tests/hsa/tlst/llc_opt.tlst#41 edit


[ROCm/clr commit: a320fe1f95]
2015-07-31 12:39:52 -04:00
foreman 57078ac448 P4 to Git Change 1175956 by nhaustov@nhaustov_hsa on 2015/07/31 04:55:15
ECR #333756 - Bug 10951: HSA Loader: Handle HSA_EXT_SAMPLER_ADDRESSING_MODE_UNDEFINED addressing mode.

	This fixes Compubench failure.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuprogram.cpp#198 edit


[ROCm/clr commit: 9508f6b67f]
2015-07-31 05:04:50 -04:00
foreman 6a91542a18 P4 to Git Change 1175937 by johtaylo@johtaylo-JTBUILDER03-increment on 2015/07/31 03:00:11
EPR #010002 - Change OpenCL version number from 1859 to 1860.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1606 edit


[ROCm/clr commit: 8fe89d4d5a]
2015-07-31 03:35:06 -04:00
foreman 1d21b8cfb2 P4 to Git Change 1175724 by gandryey@gera-ubuntu14 on 2015/07/30 15:50:59
ECR #304775 - Remove CALdevicestatus

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpudevice.cpp#516 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpudevice.hpp#146 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gslbe/src/rt/GSLDevice.cpp#128 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gslbe/src/rt/GSLDevice.h#52 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gslbe/src/rt/backend.h#7 edit


[ROCm/clr commit: 4b97729c2e]
2015-07-30 16:07:40 -04:00
foreman 4322990c61 P4 to Git Change 1175561 by mbareghe@mbareghe_stg_win30 on 2015/07/30 10:53:59
EPR #419362 - Forum [170348]: problem with printf for OpenCL 2.0 kernel build - fix for regression caused in Sanity debugger tests.

Affected files ...

... //depot/stg/opencl/drivers/opencl/compiler/llvm32/lib/Target/AMDIL/AMDILKernelManager.cpp#3 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuprogram.cpp#197 edit


[ROCm/clr commit: 6ee94556b7]
2015-07-30 11:08:43 -04:00
foreman 5bc370b9ab P4 to Git Change 1175426 by johtaylo@johtaylo-JTBUILDER03-increment on 2015/07/30 03:00:13
EPR #010002 - Change OpenCL version number from 1858 to 1859.

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#1605 edit


[ROCm/clr commit: 51ecd77d53]
2015-07-30 03:11:51 -04:00
foreman 6fb6013664 P4 to Git Change 1175317 by gandryey@gera-dev-w7 on 2015/07/29 17:42:25
ECR #304775 - Remove obsolete functions

Affected files ...

... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpudevice.cpp#515 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gslbe/src/rt/GSLDevice.cpp#127 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gslbe/src/rt/GSLDevice.h#51 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gslbe/src/rt/GSLDeviceLinux.cpp#2 delete
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gslbe/src/rt/GSLDeviceWin.cpp#5 delete
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gslbe/src/rt/backend.cpp#13 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gslbe/src/rt/backend.h#6 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gslbe/src/rt/build/Makefile.backend#8 edit


[ROCm/clr commit: f48633c15c]
2015-07-29 17:52:26 -04:00