Граф коммитов

91 Коммитов

Автор SHA1 Сообщение Дата
Jay Cornwall 90ab72cd66 Implement optimized blit/fill kernels
Replace HSAIL kernels with SP3 shaders.
Support all alignment variations efficiently.

Change-Id: Icf7f5471f3ba68389f55484d82f2805dd9bc3827
2016-05-10 21:51:57 -05:00
James Edwards 023b302fae Add image and tools cmake files to the opensrc directory.
Change-Id: I9e95d391992fa6ad7d13b500cd28eb0fb93dda1d
2016-05-03 17:01:14 -05:00
bwicakso 6ea42ae333 hsa_amd_agent_memory_pool_get_info gives wrong results for gfx803. Root cause: missing break point when querying the num hop attribute. Other change: max the reported num hop to 1 since the runtime does not have enough information about each hop, also clarified the comment about HSA_AMD_AGENT_MEMORY_POOL_INFO_NUM_LINK_HOPS attribute in the header file
Change-Id: I5d868eb457666e1377d5308f6145e76176bbfaf7
2016-05-03 12:52:38 -04:00
James Edwards 24714cb769 Remove whitespace from comments in CMakeLists.txt
Change-Id: I9a94a6f224a5cbd5fb1f8b57ed0c369339e23228
2016-04-28 11:24:02 -05:00
Shi, Aaron (en ye) (xN/A) TO ad21f0606e HSA Finalizer: Promote SC PRM -> Finalizer (HSA tree) up to CL 1258514
[git-p4: depot-paths = "//depot/stg/hsa/drivers/hsa/runtime/": change = 1259784]
2016-04-19 15:31:52 -05:00
Jay Cornwall (xN/A) UK 1d4a257225 Fix SDMA fill for >=4MB regions
max_single_fill_size_ overflowed the packet field size. Reduce by one dword.

[git-p4: depot-paths = "//depot/stg/hsa/drivers/hsa/runtime/": change = 1259263]
2016-04-18 16:05:13 -05:00
Besar Wicaksono (xN/A) TX [TEXT] 5a584fa1ab Fix query HSA_AMD_AGENT_MEMORY_POOL_INFO_LINK_INFO
Querying HSA_AMD_AGENT_MEMORY_POOL_INFO_LINK_INFO between a gpu agent
and its own local memory pool returns a wrong information.
Fix: return link with 0 hop count.

[git-p4: depot-paths = "//depot/stg/hsa/drivers/hsa/runtime/": change = 1257544]
2016-04-13 12:39:25 -05:00
Hari Thangirala 0545761aa9 ROCR Build ID support
Fix dirty-tree status. Thanks to Fan for fixing the issue.

[git-p4: depot-paths = "//depot/stg/hsa/drivers/hsa/runtime/": change = 1256716]
2016-04-11 18:48:29 -05:00
Besar Wicaksono (xN/A) TX [TEXT] ea67bb8374 Sdma wraparound optimization.
Remove mutex and just make the thread spin again if the queue is wrapping.
Remove the wait for the queue to finish wrapping, and just check if there is enough space to recycle when reserving queue space.

[git-p4: depot-paths = "//depot/stg/hsa/drivers/hsa/runtime/": change = 1256713]
2016-04-11 18:31:45 -05:00
James Edwards (xN/A) TX 871412adff Remove ENV variables from CMakeLists.txt files.
[git-p4: depot-paths = "//depot/stg/hsa/drivers/hsa/runtime/": change = 1256687]
2016-04-11 17:18:01 -05:00
Hari Thangirala a148fd0b68 ROCR Build ID support
Build system/Package maintainer:
-    BUILDID is specified at cmake.
-    USAGE: cmake -DBUILDID=<ID> ../src

For developer builds the who typically don?t provide BUILDID, cmake will:
-    Determine the last git commit when this tree was syncd 
-    Deteremine the build date 
-    Check if tree is clean when built 

The idea of this embedded string is that later when you get a ROCR build, you can get some idea on the build origination by using: strings libhsa-runtime.so.1 | grep ?ROCR BUILD ID?

For eg:
-    If it?s a Jenkins build 25, it returns: ?ROCR  BUILD ID: 25?
-    If it?s a developer build sync'd @ 06f5f2a with modifications, it returns: ?ROCR BUILD ID: 06f5f2a-2016-04-11-0"

[git-p4: depot-paths = "//depot/stg/hsa/drivers/hsa/runtime/": change = 1256588]
2016-04-11 15:03:06 -05:00
Zhuravlyov, Konstantin (x21446) MA 503fd728dd Fail gracefully if memory allocation did not succeed
Testing: precheckin (http://ocltc.amd.com:8111/viewModification.html?modId=69427&personal=true&tab=vcsModificationBuilds)

[git-p4: depot-paths = "//depot/stg/hsa/drivers/hsa/runtime/": change = 1256179]
2016-04-09 16:40:24 -05:00
Besar Wicaksono (xN/A) TX [TEXT] 2ebde5d2a7 Fix unit test build error due to CL#1256098
[git-p4: depot-paths = "//depot/stg/hsa/drivers/hsa/runtime/": change = 1256119]
2016-04-08 16:51:45 -05:00
Besar Wicaksono (xN/A) TX [TEXT] 7760839934 Fix build error from CL#1256102 due to whitespace issue.
[git-p4: depot-paths = "//depot/stg/hsa/drivers/hsa/runtime/": change = 1256108]
2016-04-08 16:40:05 -05:00
Besar Wicaksono (xN/A) TX [TEXT] a03c5148a7 Add AMD extension version
[git-p4: depot-paths = "//depot/stg/hsa/drivers/hsa/runtime/": change = 1256102]
2016-04-08 16:31:00 -05:00
Besar Wicaksono (xN/A) TX [TEXT] 4ccc695b95 Add global memory clock and width info on the agent attribute list and deprecate the ones in the memory region attribute list.
[git-p4: depot-paths = "//depot/stg/hsa/drivers/hsa/runtime/": change = 1256098]
2016-04-08 16:29:10 -05:00
Nikolay Haustov [TEXT] a795909bca Cherry-pick CL 1250286 from SC stg.
HSA Finalizer: Add dumping of code object, ISA and executable to loader.

This is controlled by loader options -dump-all, -dump-isa, -dump-code, -dump-exec

The options can now also be set with env variable LOADER_OPTIONS_APPEND.

Added tests to finalizer_offline

Testing: smoke, dumping on hardware

Reviewed by: Konstantin Zhuravlyov

[git-p4: depot-paths = "//depot/stg/hsa/drivers/hsa/runtime/": change = 1255351]
2016-04-07 06:01:20 -05:00
Besar Wicaksono (xN/A) TX [TEXT] 823c254d61 Cleanup TODO format
[git-p4: depot-paths = "//depot/stg/hsa/drivers/hsa/runtime/": change = 1255182]
2016-04-06 16:50:50 -05:00
Ramesh Errabolu (xN/A) TX b93946790d Update Private Segment Size parameter of the dispatch packet
[git-p4: depot-paths = "//depot/stg/hsa/drivers/hsa/runtime/": change = 1254638]
2016-04-05 14:03:33 -05:00
Besar Wicaksono (xN/A) TX [TEXT] c95f96a9e4 Add environment flag to enable sdma workaround that will wait for the sdma queue to be idle before updating the write pointer. Add class to manage environment flags.
[git-p4: depot-paths = "//depot/stg/hsa/drivers/hsa/runtime/": change = 1254004]
2016-04-01 17:13:45 -05:00
James Edwards (xN/A) TX e3670a2bef Branch Brig.h file into opensrc hsa-runtime directory.
[git-p4: depot-paths = "//depot/stg/hsa/drivers/hsa/runtime/": change = 1251455]
2016-03-25 15:26:18 -05:00
Nikolay Haustov [TEXT] 46842a57e5 HSA Finalizer: Merge changes in libamdhsacode and loader from sc_prm into hsa/compiler/finalizer and hsa/runtime.
Testing: pre-checkin

[git-p4: depot-paths = "//depot/stg/hsa/drivers/hsa/runtime/": change = 1251389]
2016-03-25 08:36:20 -05:00
Zhuravlyov, Konstantin (x21446) MA f6565a2f70 Clean up extensions and provide public extension/API to query host address given device address:
- Partially remove 'amd_load_map' extension because it is not used and will not be used
- Remove 'hsa_amd_query_kernel_host_address' API
- Add 'hsa_ext_amd_loaded_code_object' extension
- Add 'hsa_ext_amd_loaded_code_object_query_host_address' API
	- Most likely to be used by debugger, profiler, and hcc (printf)
- Update affected sources
	- 'hsa_system_extension_supported'
	- 'hsa_system_get_extension_table'
	- SoftCP path
- Integrate CLs 1250699, 1251204, 1251214 from stg sc

ReviewBoardURL: http://ocltc.amd.com/reviews/r/10091/
Testing: smoke (ok), teamcity (ok), samples on fiji (AQL and SoftCP) (ok)

[git-p4: depot-paths = "//depot/stg/hsa/drivers/hsa/runtime/": change = 1251223]
2016-03-24 19:00:30 -05:00
Besar Wicaksono (xN/A) TX [TEXT] 9fa0531950 Always wait queue wrap around to finish and dont return not enough resource.
[git-p4: depot-paths = "//depot/stg/hsa/drivers/hsa/runtime/": change = 1251141]
2016-03-24 15:52:45 -05:00
Sean Keely (xN/A) TX 1c7142c129 Minor fix to hsa_amd_image_descriptor_t.
Change uint32_t data[0]; to uint32_t data[1];

[git-p4: depot-paths = "//depot/stg/hsa/drivers/hsa/runtime/": change = 1251050]
2016-03-24 13:24:22 -05:00
James Edwards (xN/A) TX 7d2bc9d113 Separate open source core runtime code from DK makefiles.
[git-p4: depot-paths = "//depot/stg/hsa/drivers/hsa/runtime/": change = 1250152]
2016-03-22 18:10:13 -05:00
James Edwards (xN/A) TX 7d1e6c3a57 Remove opensrc test files.
[git-p4: depot-paths = "//depot/stg/hsa/drivers/hsa/runtime/": change = 1249961]
2016-03-22 13:39:51 -05:00
James Edwards (xN/A) TX c9ffe0004e Check open source core runtime code into perforce. This includes license and README files.
[git-p4: depot-paths = "//depot/stg/hsa/drivers/hsa/runtime/": change = 1249136]
2016-03-20 15:39:40 -05:00
Besar Wicaksono (xN/A) TX [TEXT] 73d43224e9 Add IOLink support
[git-p4: depot-paths = "//depot/stg/hsa/drivers/hsa/runtime/": change = 1247220]
2016-03-14 18:42:31 -05:00
Ramesh Errabolu (xN/A) TX f7693cf777 Configure AQL packet header with System Scope for flush
[git-p4: depot-paths = "//depot/stg/hsa/drivers/hsa/runtime/": change = 1240170]
2016-02-24 14:08:35 -05:00
Besar Wicaksono (xN/A) TX [TEXT] bbe0be05d4 Modify MatrixMultiplication sample to use memory pool API
[git-p4: depot-paths = "//depot/stg/hsa/drivers/hsa/runtime/": change = 1237420]
2016-02-16 11:12:25 -05:00
Besar Wicaksono (xN/A) TX [TEXT] c494af9d49 Add sample application to use the new memory pool API.
Details:
- add HsaGetInfo program that prints out all available CPU, GPU and their respective memory pools.

[git-p4: depot-paths = "//depot/stg/hsa/drivers/hsa/runtime/": change = 1237219]
2016-02-15 18:11:44 -05:00
Ramesh Errabolu (xN/A) TX 2280190f70 Populate Cpu and Gpu nodes into different agent lists
[git-p4: depot-paths = "//depot/stg/hsa/drivers/hsa/runtime/": change = 1236865]
2016-02-12 16:14:39 -05:00
Ding, Wei (xN/A) TX df99562905 Changes 5 hsail apps for supporting gfx803.
[git-p4: depot-paths = "//depot/stg/hsa/drivers/hsa/runtime/": change = 1235366]
2016-02-08 15:39:18 -05:00
Nikolay Haustov [TEXT] d8e67d962b Split libHSAIL and libHSAIL-AMD (HSA Changes)
[git-p4: depot-paths = "//depot/stg/hsa/drivers/hsa/runtime/": change = 1223723]
2015-12-28 10:00:43 -05:00
Ramesh Errabolu (xN/A) TX 2f0425d354 Update Binarysearch and BlackScholes Hsa Sample to support FULL and BASE Profiles
[git-p4: depot-paths = "//depot/stg/hsa/drivers/hsa/runtime/": change = 1206169]
2015-10-30 17:53:25 -05:00
Ding, Wei (xN/A) TX a32c2b9854 ECR #333755 - HSA samples changes for dGPU. All passed on gfx802.
[git-p4: depot-paths = "//depot/stg/hsa/drivers/hsa/runtime/": change = 1186398]
2015-08-31 14:41:50 -05:00
Andrey Kasaurov (xN/A) SP 3bbf3c6a8b Fix HSA Finalizer
including Cherry pick of CL#1166690 from SC Stg and update for RT samples. It contains:

Change 1166280 on 2015/06/30 by bolek@bolek-common2

EPR #092474 - Fix missing integrations

	Change 1164156 on 2015-06-23 by nhaustov
	ECR #010005 - HSA Finalizer: Add missing tests.

	Change 1164232 on 2015-06-23 by nhaustov
	ECR #333756 - HSA Finalizer: Implement reading of notes.

Change 1166268 on 2015/06/30 by bolek@bolek-laser

EPR #010001 - Promotion of the Shader Compiler (SC)

	Release SC Library version 0001.IL01-02.0339 Date: June 30, 2015
	Changelist (stg/sc): 1165197

	Change 1163976 on 2015-06-22 by sashao
	EPR #373149 - OpenGL ES 3.0 Development

	Change 1164122 on 2015-06-23 by nhaustov
	ECR #033756 - HSA Finalizer: Fix compilation warnings.

	Change 1164150 on 2015-06-23 by efinger
	EPR #092474 - bugzilla 10829 - optimize out useless V_PERM_B32 feeding packed math op, in early expansion, rather than late expansion, and do it regardless of whether the packed op will be split or not.

	Change 1164187 on 2015-06-23 by efinger
	EPR #092474 - Add and use GetUAVInfo() and GetNumUAVs()

	Change 1164194 on 2015-06-23 by rgottlie
	EPR #092474 - Fix Linux Build Issue for SC_OPEN_SOURCE

	Change 1164204 on 2015-06-23 by mbedy
	EPR #092474 - Update Open Source build - now working from SC stg.

	Change 1164216 on 2015-06-23 by rouellet
	EPR #092474 - Add directive to do what -il_interpreter does.

	Change 1164232 on 2015-06-23 by nhaustov
	ECR #333756 - HSA Finalizer: Implement reading of notes.

	Change 1164239 on 2015-06-23 by nhaustov
	ECR #333756 - HSA Finalizer: Fix OpenCL build problem.

	Change 1164275 on 2015-06-23 by nhaustov
	ECR #333756 - HSA Finalizer: Fix Linux build errors.

	Change 1164365 on 2015-06-23 by efinger
	EPR #092474 - Cleanup UAV Atomic handling

	Change 1164393 on 2015-06-23 by kzhuravl
	EPR #333756 - Finalizer/Loader fixes

	Change 1164654 on 2015-06-24 by dpreobra
	ECR #333753 - HSA HLC: SPB_ASM: TestGen improvements

	Change 1164727 on 2015-06-24 by bolek
	EPR #092474 - Enable function level linking, COMDAT folding and unused function removal optimizations in Dev release builds. This saves about 2.6MB in code size on 64-bit Dev.

	Change 1164760 on 2015-06-24 by rgottlie
	EPR #422210 - Fix problem with TransformScratch heuristics

	Change 1164761 on 2015-06-24 by rgottlie
	EPR #422181 - Fix handling of sub-dword load instructions in propagation of immediates from store to load in RefineMemory

	Change 1164764 on 2015-06-24 by bolek
	EPR #092474 - Add missing const

	Change 1164769 on 2015-06-24 by efinger
	EPR #092474 - Cleanup GDS atomics

	Change 1164776 on 2015-06-24 by efinger
	EPR #092474 - Fix linux build

	Change 1164799 on 2015-06-24 by mbedy
	EPR #092474 - Improve alignment for 2 DWORD instructions by more closely

	Change 1164803 on 2015-06-24 by efinger
	EPR #092474 - Open Source Cleanup

	Change 1164809 on 2015-06-24 by bfavela
	EPR #092474 - Escape an infinite loop in shader during the build of a DAG when a block is visited twice

	Change 1164814 on 2015-06-24 by bolek
	EPR #092474 - Add Dev command line option to disable individual peephole patterns (blame Chris for this one).

	Change 1164827 on 2015-06-24 by bfavela
	EPR #092474 - Adding small change to CL 1164809 as suggested by creeve to remove superfluous if()

	Change 1164842 on 2015-06-24 by gujin
	EPR #092474 - Prevent moving exit-loop checking to the end of loop if there is a branch in the loop that is optimized with a target replacement bypassing the loop end. This is to fix an OpenGL hull shader conformance test fail (bug 10859).

	Change 1164876 on 2015-06-24 by rgottlie
	EPR #092474 - Only allow memory merging if no memory scope or order is specified

	Change 1164883 on 2015-06-24 by kdintino
	EPR #092474 - Add HSAIL files to the AMD -> LLVM copyright replacement loop.

	Change 1165060 on 2015-06-25 by efinger
	EPR #092474 - Open Source Cleanup - Copyright

	Change 1165077 on 2015-06-25 by efinger
	EPR #092474 - Cleanup LDS atomics - part 1 (groundwork)

	Change 1165080 on 2015-06-25 by bfavela
	EPR #092474 - Extension to SUPPRESS_PI_REDUCE_F32 for TAN (TAN_F16 is already handled by expansion)

	Change 1165189 on 2015-06-25 by efinger
	EPR #092474 - Cleanup LDS atomics - part 2

	Change 1165196 on 2015-06-25 by bolek
	EPR #092474 - Add syntax to the peephole pattern language to specify SCInst flag values or wildcards.

	Change 1165197 on 2015-06-25 by bolek
	EPR #092474 - Allow the MulAddToMadF peephole pattern to modify instructcions marked as invariant (result should still be the same)

Change 1165438 on 2015/06/26 by bolek@bolek-common2

EPR #010001 - Promotion of the Shader Compiler (SC)

	Release SC Library version 0001.IL01-02.0338 Date: June 26, 2015
	Changelist (stg/sc): 1163954

	Change 1161629 on 2015-06-15 by efinger
	EPR #092474 - Move CFG:IL2IRProcessDeclare() to global scope

	Change 1161633 on 2015-06-15 by rouellet
	EPR #092474 - Bugzilla 10852 call ConverInstFields when translating COND_MOVE.

	Change 1161643 on 2015-06-15 by rgottlie
	EPR #092474 - Handle manually inserted wait state for SALU writing M0 followd by VINTERP

	Change 1161718 on 2015-06-15 by lifpan
	EPR #092474 - The "point size" in copy shader of GS

	Change 1161721 on 2015-06-15 by xlji
	EPR #092474 - Split DIV_F16 and DIV_PRECISE_F16 

	Change 1161850 on 2015-06-16 by kzhuravl
	EPR #333756 - Change a few function names, general cleanup (no functional change)

	Change 1161934 on 2015-06-16 by efinger
	EPR #092474 - Fix linux compile warnings

	Change 1161946 on 2015-06-16 by nhaustov
	ECR #333756 - HSA Finalizer: Fix Linux build warnings.

	Change 1161981 on 2015-06-16 by efinger
	EPR #092474 - Open Source Cleanup

	Change 1161991 on 2015-06-16 by efinger
	EPR #092474 - Move CFG::IL2IRProcessSpecial() to global scope

	Change 1161997 on 2015-06-16 by rgottlie
	EPR #092474 - Fix compile warnings under Linux

	Change 1162001 on 2015-06-16 by efinger
	EPR #092474 - Fix linux build

	Change 1162045 on 2015-06-16 by mherdeg
	EPR #092474 - Comment out unused functions to fix linux compiler warnings.

	Change 1162048 on 2015-06-16 by akasauro
	EPR #092474 - SC: Some AMD OCL SDK tests (including BinomialOption) assert in SCInst.cpp. [on behalf of Atrem Tamazov]

	Change 1162061 on 2015-06-16 by efinger
	EPR #092474 - Rename NewIRInst to MakeIRInst and drop last (unused) arg.

	Change 1162066 on 2015-06-16 by creeve
	EPR #092474 - Linux build fixes for open source.

	Change 1162067 on 2015-06-16 by creeve
	EPR #092474 - Improve hash table grow and sanitize.

	Change 1162072 on 2015-06-16 by creeve
	EPR #092474 - Peephole |x| * |x| => x*x

	Change 1162089 on 2015-06-16 by chfang
	EPR #092474 - Fix linux compiler warnings in SCStructureAnalyzer.cpp.

	Change 1162145 on 2015-06-16 by efinger
	EPR #092474 - Improve interface to MakeInstOp[123]

	Change 1162427 on 2015-06-17 by efinger
	EPR #092474 - bugzilla 10862 - Back out changelist 1161549

	Change 1162434 on 2015-06-17 by rgottlie
	EPR #092474 - Only dump individual functions in each pass of Refine Memory

	Change 1162436 on 2015-06-17 by kzhuravl
	EPR #333756 - Integrate runtime independent loader from stg hsa + update project files

	Change 1162442 on 2015-06-17 by efinger
	EPR #092474 - Add and use CreateRegTemp()

	Change 1162505 on 2015-06-17 by skolton
	ECR #333756 - HSA Finalizer: Doorbell signals support 

	Change 1162527 on 2015-06-17 by kzhuravl
	EPR #333756 - Always set dx10_clamp to true for hsa

	Change 1162531 on 2015-06-17 by efinger
	EPR #092474 - Fix linux compile warnings

	Change 1162568 on 2015-06-17 by mbedy
	EPR #092474 - Specify a newer DX9 SDK for SCDevUtil that correctly links with WDK n10136.

	Change 1162623 on 2015-06-17 by mherdeg
	EPR #092474 - Remove duplicate #include "SCHSAInterface.h". It confuses Intellisense in Visual Studio.

	Change 1162905 on 2015-06-18 by rgottlie
	EPR #092474 - Fix Linux Build Warnings

	Change 1162930 on 2015-06-18 by nhaustov
	ECR #333756 - HSA Finalizer: Cleanup amdhsafin command-line tool.

	Change 1162938 on 2015-06-18 by nhaustov
	ECR #333756 - HSA Finalizer: Fix build problem.

	Change 1162944 on 2015-06-18 by rgottlie
	EPR #092474 - Clean up bug descriptions as per Phil's suggestion

	Change 1162951 on 2015-06-18 by skolton
	ECR #333756 - HSA Finalizer:  Bug fix for 1DB query image

	Change 1163009 on 2015-06-18 by nhaustov
	ECR #333756 - HSA Finalizer: build amdhsafin with WITH_LIBBRIGDWARF when needed.

	Change 1163263 on 2015-06-19 by nhaustov
	ECR #092474 - Fix patgen VS build by quoting %TMPDIR%.

	Change 1163265 on 2015-06-19 by skolton
	ECR #333756 - HSA Finalizer: Fix for doorbell signal store.

	Change 1163310 on 2015-06-19 by nhaustov
	ECR #333756 - HSA Finalizer: Introduce separate amdhsacode library.

	Change 1163316 on 2015-06-19 by nhaustov
	ECR #333756 - HSA Finalizer: Fix OpenCL build problem.

	Change 1163320 on 2015-06-19 by nhaustov
	ECR #333756 - HSA Finalizer: Fix another OpenCL build problem.

	Change 1163331 on 2015-06-19 by mjared
	EPR #092474 - Replace asin/acos 5th order minimax polynomial with a 6th order double locked (at 0 and 1) minimax polynomial

	Change 1163353 on 2015-06-19 by efinger
	EPR #092474 - Use normal temps (not expansion temps) for expansion template T regs.

	Change 1163473 on 2015-06-19 by mjared
	EPR #092474 - Improve accuracy of ATAN instruction by replacing rational approximation with a 17th order double locked minimax polynomial. Also increase degree of ASIN/ACOS double locked minimax polynomial to 7.

	Change 1163475 on 2015-06-19 by creeve
	EPR #092474 - Avoid putting partial write on export instruction. This feature existed before but only occurred if the output was point sprite. This change removed that restriction. Also fixed the implementation of //EsMode and //LsMode shader directi

	Change 1163481 on 2015-06-19 by mjared
	EPR #092747 - Misc. python scripts for working with transcendental functions. Includes fast implementation of remez minimax algorithm for absolute error and slower optimization-based remez for weighted/custom error reduction.

	Change 1163528 on 2015-06-19 by creeve
	EPR #092474 - Fix build issue.

	Change 1163603 on 2015-06-21 by bolek
	EPR #092474 - patgen makefile cleanup

	Change 1163614 on 2015-06-21 by kzhuravl
	EPR #333756 - Integrate runtime independent loader changes from stg hsa

	Change 1163699 on 2015-06-22 by rouellet
	EPR #092474 - bugzilla 10854 Get cb0[1] initialized with group dimensions for compute shaders on r800 and newer.  Make IL and HW interpreter details and variable names more closely match. Flush denorms when doing cube mapped samples (the cb0[1] init 

	Change 1163713 on 2015-06-22 by efinger
	EPR #092474 - Convert all usage of expansion temps to regular temps

	Change 1163718 on 2015-06-22 by nhaustov
	ECR #333756 - HSA Finalizer: Add loader (-loader option) to amdhsafin and update tests.

	Change 1163732 on 2015-06-22 by nhaustov
	ECR #333756 - HSA Finalizer: Implement images in amdhsafin loader and update tests.

	Change 1163774 on 2015-06-22 by mbedy
	EPR #092474 - Strip _DEV macros from open source. Fix issue in ifdef stripping.

	Change 1163786 on 2015-06-22 by mbedy
	EPR #092474 - Revert unintentionally submitted change.

	Change 1163803 on 2015-06-22 by bolek
	EPR #092474 - Peephole compile-time performance improvements

	Change 1163832 on 2015-06-22 by efinger
	EPR #092474 - bugzilla 10849 - fix copy propagation bug with SDWA

	Change 1163916 on 2015-06-22 by efinger
	EPR #092474 - Nuke support for expansion temps

	Change 1163954 on 2015-06-22 by creeve
	EPR #092474 - More code sanitization.

Change 1164740 on 2015/06/24 by vpykhtin@vpykhtin-SC

ECR #333753 - Cherrypicking CL1164641 from stg/sc (that is cherrypick of CL1164640 form stg/opencl)

Testing: TC PSDB

[git-p4: depot-paths = "//depot/stg/hsa/drivers/hsa/runtime/": change = 1167011]
2015-07-02 10:27:31 -05:00
Nikolay Haustov [TEXT] 5a8c84e012 ECR #010005 - Update HSA samples and test to use libHSAIL high-level tool interface.
[git-p4: depot-paths = "//depot/stg/hsa/drivers/hsa/runtime/": change = 1155310]
2015-05-28 05:38:18 -05:00
Zhuravlyov, Konstantin (x21446) MA 20bed7ce7f ECR #333756 - Add support for relocations/offline global support in finalizer/loader
Testing: precheckin (http://ocltc:8111/viewModification.html?modId=51121&personal=true&init=1&tab=vcsModificationBuilds)

[git-p4: depot-paths = "//depot/stg/hsa/drivers/hsa/runtime/": change = 1147298]
2015-05-05 08:29:40 -05:00
Besar Wicaksono (xN/A) TX [TEXT] 39a6a81461 ECR #333755 - Part 1. update hsa_Ext_amd.h according to HSART 1.0F spec.
Details:
- Refactor names of the type and interface in hsa_ext_amd.h according to the format specified by 1.0F spec.
- Update the core, ext, and tools implementation with the new name.
- Update oclhsa layer with the new interface.

Test: TC precheckin http://ocltc.amd.com:8111/viewLog.html?buildId=12461559&tab=buildResultsDiv&buildTypeId=PrecheckinTestsOpenCLSanity_PrecheckinSanitygfx700Linux64bitDebugHsaFoundation

[git-p4: depot-paths = "//depot/stg/hsa/drivers/hsa/runtime/": change = 1145877]
2015-04-29 12:29:56 -05:00
Ding, Wei (xN/A) TX 0f6deb9cf9 ECR #333755 - Restructure code to make timing function as close as possible to the one of OpenCL SDK samples.
[git-p4: depot-paths = "//depot/stg/hsa/drivers/hsa/runtime/": change = 1142515]
2015-04-20 16:58:41 -05:00
Ding, Wei (xN/A) TX 13fbcac8d0 ECR #333755 - Add code for timing setup, kernel execution, and total time.
[git-p4: depot-paths = "//depot/stg/hsa/drivers/hsa/runtime/": change = 1141931]
2015-04-17 12:12:16 -05:00
Ding, Wei (xN/A) TX 73a921b926 ECR #333755 - Merging HSA samples from prm to stg to conform the 1.0F
[git-p4: depot-paths = "//depot/stg/hsa/drivers/hsa/runtime/": change = 1138673]
2015-04-08 14:46:33 -05:00
Ding, Wei (xN/A) TX 8863254f76 ECR #333755 - Code clean up for releasing.
[git-p4: depot-paths = "//depot/stg/hsa/drivers/hsa/runtime/": change = 1138207]
2015-04-07 16:16:06 -05:00
Ding, Wei (xN/A) TX 3b6932dc39 ECR #333755 - Rename base class file name.
[git-p4: depot-paths = "//depot/stg/hsa/drivers/hsa/runtime/": change = 1137731]
2015-04-06 16:59:46 -05:00
Ding, Wei (xN/A) TX fc9f4cff44 ECR #333755 - Rename base class file name.
[git-p4: depot-paths = "//depot/stg/hsa/drivers/hsa/runtime/": change = 1137730]
2015-04-06 16:59:17 -05:00
Ding, Wei (xN/A) TX 31250a5f80 ECR #333755 - Rename hsa samples base class.
[git-p4: depot-paths = "//depot/stg/hsa/drivers/hsa/runtime/": change = 1137729]
2015-04-06 16:57:54 -05:00
Ramesh Errabolu (xN/A) TX 13d77a9c6b ECR #333755 - Updates to build Hsa Samples on Windows for Non-Prm branch
[git-p4: depot-paths = "//depot/stg/hsa/drivers/hsa/runtime/": change = 1134033]
2015-03-24 19:17:26 -05:00
Ding, Wei (xN/A) TX 533bad8360 ECR #333755 - Fixed linux compilation of undefined functions.
[git-p4: depot-paths = "//depot/stg/hsa/drivers/hsa/runtime/": change = 1133906]
2015-03-24 14:28:03 -05:00