コミットグラフ

2925 コミット

作成者 SHA1 メッセージ 日付
Konstantin Zhuravlyov bf46c9e840 Add support for dynamic relocations (code object v2.1)
Change-Id: Ic19be97d3ea78b53f5aa814787515b587d0be21b


[ROCm/ROCR-Runtime commit: 5a14d496ab]
2016-05-26 14:09:07 -04:00
Besar Wicaksono 76dea4613a Add profiling support to DMA copy function
Change-Id: Iadeefa2692f35d9305ac1b242284a6220d5830a7


[ROCm/ROCR-Runtime commit: a8b00680b6]
2016-05-26 11:29:29 -04:00
James Edwards 0a5966812c Correct minor issues in License text and sample code for hsa-rocr-dev package.
Change-Id: If1c4387794de3cb707a8ba8281a40a1123130c95


[ROCm/ROCR-Runtime commit: 50339c12f1]
2016-05-26 09:42:24 -04:00
Ramesh Errabolu a09c9bfb04 Refactor Scratch Memory Descriptor Initialization
Change-Id: Ib4a136c266646cc5d5f5afb98f4aaf9266d02072


[ROCm/ROCR-Runtime commit: 383ed6983f]
2016-05-25 22:17:43 -04:00
James Edwards 1c58bfd7fd Add hsa-rocr-dev packaging CMakeList.txt file.
Change-Id: I1f6a0d4ad44aa7f20f43d43942719f668b620c36


[ROCm/ROCR-Runtime commit: ec6478e693]
2016-05-25 17:04:27 -04:00
James Edwards 3d304b4f99 Add hsa-ext-rocr-dev automatic packaging.
Change-Id: Ieb0d179b4e1a398a9400bd80037a46d0513582bc


[ROCm/ROCR-Runtime commit: 72cb6dd33f]
2016-05-23 10:10:44 -04:00
Amber Lin de82e820c2 Add performance counters for gfx70x
Add performance counters for gfx70x. The reference is the gfx7 register spec.
The register being looked at is SQ_PERFCOUNTER0_SELECT.

Change-Id: I344bfb7452f6148f4dc268163d12c553c6be8424


[ROCm/ROCR-Runtime commit: 6d21c4e753]
2016-05-20 16:24:36 -04:00
Besar Wicaksono 520d33a875 Use lazy initialization to create Blit objects
Change-Id: I388865030dc2538c5c881c055e38af52a57f6d87


[ROCm/ROCR-Runtime commit: bc589048a9]
2016-05-20 14:26:06 -04:00
James Edwards e69fdb12c2 Update hsa-ext-image CMakeList.txt file to include static lib compiler options
Change-Id: I06cff984d3dc169cdb30832bf0115bc7d821eadf


[ROCm/ROCR-Runtime commit: ceab9a3eb0]
2016-05-19 15:48:42 -05:00
shaoyunl 62d4e557ed libhsakmt: Add new device id for virtualized function of gfx803
Signed-off-by: Shaoyun Liu <Shaoyun.liu@amd.com>
Change-Id: I90b0bdaeaed8e9e80375e5a7a142205f2a542289


[ROCm/ROCR-Runtime commit: 16d5aa0d83]
2016-05-12 13:25:01 -04:00
Jay Cornwall 02a67f61c4 Implement optimized blit/fill kernels
Replace HSAIL kernels with SP3 shaders.
Support all alignment variations efficiently.

Change-Id: Icf7f5471f3ba68389f55484d82f2805dd9bc3827


[ROCm/ROCR-Runtime commit: 90ab72cd66]
2016-05-10 21:51:57 -05:00
Felix Kuehling a6b5c17133 Report gfx70x engine ID as 7.0.1
Stepping 1 indicates higher double-precision float performance and
potentially other runtime workarounds needed for lack of PCIe atomics
on gfx70x.

Change-Id: I97185c1233e7d24caaf20a1eadea931d5a2bc664


[ROCm/ROCR-Runtime commit: fa102f3b8b]
2016-05-04 13:53:24 -04:00
James Edwards 127d8c357f Add image and tools cmake files to the opensrc directory.
Change-Id: I9e95d391992fa6ad7d13b500cd28eb0fb93dda1d


[ROCm/ROCR-Runtime commit: 023b302fae]
2016-05-03 17:01:14 -05:00
bwicakso 293a073c40 hsa_amd_agent_memory_pool_get_info gives wrong results for gfx803. Root cause: missing break point when querying the num hop attribute. Other change: max the reported num hop to 1 since the runtime does not have enough information about each hop, also clarified the comment about HSA_AMD_AGENT_MEMORY_POOL_INFO_NUM_LINK_HOPS attribute in the header file
Change-Id: I5d868eb457666e1377d5308f6145e76176bbfaf7


[ROCm/ROCR-Runtime commit: 6ea42ae333]
2016-05-03 12:52:38 -04:00
Amber Lin 19b4a16ead Correct NumCaches against the CPU node
In a NUMA system, topology should report NumCaches as the number of caches
within the node but current code reports the total caches in the system. This
patch fixes the error. This patch also uses cpuid to get cache information
instead of reading from sysfs files. See "Intel Corporation, Intel 64 and IA-32
 Architectures Software Developer's  Manual Volume 2(2A, 2B & 2C) Instruction
Set Reference" 3-179 for cpuid instruction features used in this patch.


Change-Id: I8ecece6c2b230741822620b44e66ddc201ff5112


[ROCm/ROCR-Runtime commit: 73ad0a1942]
2016-05-03 11:39:33 -04:00
James Edwards 555e9169d8 Remove whitespace from comments in CMakeLists.txt
Change-Id: I9a94a6f224a5cbd5fb1f8b57ed0c369339e23228


[ROCm/ROCR-Runtime commit: 24714cb769]
2016-04-28 11:24:02 -05:00
Felix Kuehling 87bd249ed5 Add gfx70x support
Change-Id: I400adb62b5225ef3a42da279d067fb0a62907089


[ROCm/ROCR-Runtime commit: 97e51ce33d]
2016-04-25 14:27:44 -04:00
Shi, Aaron (en ye) (xN/A) TO 93ce815152 HSA Finalizer: Promote SC PRM -> Finalizer (HSA tree) up to CL 1258514
[git-p4: depot-paths = "//depot/stg/hsa/drivers/hsa/runtime/": change = 1259784]


[ROCm/ROCR-Runtime commit: ad21f0606e]
2016-04-19 15:31:52 -05:00
Jay Cornwall (xN/A) UK def38e14f3 Fix SDMA fill for >=4MB regions
max_single_fill_size_ overflowed the packet field size. Reduce by one dword.

[git-p4: depot-paths = "//depot/stg/hsa/drivers/hsa/runtime/": change = 1259263]


[ROCm/ROCR-Runtime commit: 1d4a257225]
2016-04-18 16:05:13 -05:00
Andres Rodriguez e8d96eac7a package: rename to hsathk-rocm-dev
Since we include headers and not just a library anymore, we should be
considered a -dev package and not a lib package.

Change-Id: I220465ea4ffc8d66d8d76e6716e6c6c50cdacea1


[ROCm/ROCR-Runtime commit: 44572965f6]
2016-04-13 19:39:54 -04:00
Besar Wicaksono (xN/A) TX [TEXT] 4b44183941 Fix query HSA_AMD_AGENT_MEMORY_POOL_INFO_LINK_INFO
Querying HSA_AMD_AGENT_MEMORY_POOL_INFO_LINK_INFO between a gpu agent
and its own local memory pool returns a wrong information.
Fix: return link with 0 hop count.

[git-p4: depot-paths = "//depot/stg/hsa/drivers/hsa/runtime/": change = 1257544]


[ROCm/ROCR-Runtime commit: 5a584fa1ab]
2016-04-13 12:39:25 -05:00
Hari Thangirala 56f5e65938 ROCR Build ID support
Fix dirty-tree status. Thanks to Fan for fixing the issue.

[git-p4: depot-paths = "//depot/stg/hsa/drivers/hsa/runtime/": change = 1256716]


[ROCm/ROCR-Runtime commit: 0545761aa9]
2016-04-11 18:48:29 -05:00
Besar Wicaksono (xN/A) TX [TEXT] 02b6c65740 Sdma wraparound optimization.
Remove mutex and just make the thread spin again if the queue is wrapping.
Remove the wait for the queue to finish wrapping, and just check if there is enough space to recycle when reserving queue space.

[git-p4: depot-paths = "//depot/stg/hsa/drivers/hsa/runtime/": change = 1256713]


[ROCm/ROCR-Runtime commit: ea67bb8374]
2016-04-11 18:31:45 -05:00
Andres Rodriguez ade12f4ec1 Adopt new ROCm packaging guidelines
All files should go into /opt/rocm/$component

For developer convenience, a single include directory is created through
symlinks, from the component include directory to /opt/rocm/include.

Similarly, a unified linked directory is present in /opt/rocm/lib

The component lib directory should not include linker names (library
names without version numbers).

This commit also fixes 'make rpm' running correctly without the need for
sourcing build/envsetup.sh

Change-Id: I95a680f6d3e3bd1ae688d0694934a0577dbd007c


[ROCm/ROCR-Runtime commit: 9f355b78a0]
2016-04-11 18:30:54 -04:00
James Edwards (xN/A) TX f716a9ae63 Remove ENV variables from CMakeLists.txt files.
[git-p4: depot-paths = "//depot/stg/hsa/drivers/hsa/runtime/": change = 1256687]


[ROCm/ROCR-Runtime commit: 871412adff]
2016-04-11 17:18:01 -05:00
Hari Thangirala 6a4a5f7c71 ROCR Build ID support
Build system/Package maintainer:
-    BUILDID is specified at cmake.
-    USAGE: cmake -DBUILDID=<ID> ../src

For developer builds the who typically don�t provide BUILDID, cmake will:
-    Determine the last git commit when this tree was syncd 
-    Deteremine the build date 
-    Check if tree is clean when built 

The idea of this embedded string is that later when you get a ROCR build, you can get some idea on the build origination by using: strings libhsa-runtime.so.1 | grep �ROCR BUILD ID�

For eg:
-    If it�s a Jenkins build 25, it returns: �ROCR  BUILD ID: 25�
-    If it�s a developer build sync'd @ 06f5f2a with modifications, it returns: �ROCR BUILD ID: 06f5f2a-2016-04-11-0"

[git-p4: depot-paths = "//depot/stg/hsa/drivers/hsa/runtime/": change = 1256588]


[ROCm/ROCR-Runtime commit: a148fd0b68]
2016-04-11 15:03:06 -05:00
Felix Kuehling f0af6eceed Fix 4GB and larger system memory allocations
Intermediate size was stored in a 32-bit variable. This resulted in
4GB allocations to fail in KFD due to 0 size. Larger allocations
would allocate the wrong amount of memory.

Change-Id: If19dedf64952f1d2edd813793241e12c0362d220


[ROCm/ROCR-Runtime commit: 82b3fad320]
2016-04-11 11:17:06 -04:00
Zhuravlyov, Konstantin (x21446) MA de9132a710 Fail gracefully if memory allocation did not succeed
Testing: precheckin (http://ocltc.amd.com:8111/viewModification.html?modId=69427&personal=true&tab=vcsModificationBuilds)

[git-p4: depot-paths = "//depot/stg/hsa/drivers/hsa/runtime/": change = 1256179]


[ROCm/ROCR-Runtime commit: 503fd728dd]
2016-04-09 16:40:24 -05:00
Besar Wicaksono (xN/A) TX [TEXT] 1063b89f2d Fix unit test build error due to CL#1256098
[git-p4: depot-paths = "//depot/stg/hsa/drivers/hsa/runtime/": change = 1256119]


[ROCm/ROCR-Runtime commit: 2ebde5d2a7]
2016-04-08 16:51:45 -05:00
Besar Wicaksono (xN/A) TX [TEXT] be9b46fdfd Fix build error from CL#1256102 due to whitespace issue.
[git-p4: depot-paths = "//depot/stg/hsa/drivers/hsa/runtime/": change = 1256108]


[ROCm/ROCR-Runtime commit: 7760839934]
2016-04-08 16:40:05 -05:00
Besar Wicaksono (xN/A) TX [TEXT] 869f361a0b Add AMD extension version
[git-p4: depot-paths = "//depot/stg/hsa/drivers/hsa/runtime/": change = 1256102]


[ROCm/ROCR-Runtime commit: a03c5148a7]
2016-04-08 16:31:00 -05:00
Besar Wicaksono (xN/A) TX [TEXT] c8f0efbee1 Add global memory clock and width info on the agent attribute list and deprecate the ones in the memory region attribute list.
[git-p4: depot-paths = "//depot/stg/hsa/drivers/hsa/runtime/": change = 1256098]


[ROCm/ROCR-Runtime commit: 4ccc695b95]
2016-04-08 16:29:10 -05:00
Andres Rodriguez e4f1d95ef2 package: change install directory to /opt/rocm
Align with the rest of the driver stack on the new installation path
/opt/rocm/*

This mechanism for generating packages should be changed for something
nicer and more standards compliant in the future.

Change-Id: Ic31409b0d0b8f6ee4b25296d2580982a76aab564


[ROCm/ROCR-Runtime commit: 31861c838e]
2016-04-08 11:41:49 -04:00
Nikolay Haustov [TEXT] 6b9e626507 Cherry-pick CL 1250286 from SC stg.
HSA Finalizer: Add dumping of code object, ISA and executable to loader.

This is controlled by loader options -dump-all, -dump-isa, -dump-code, -dump-exec

The options can now also be set with env variable LOADER_OPTIONS_APPEND.

Added tests to finalizer_offline

Testing: smoke, dumping on hardware

Reviewed by: Konstantin Zhuravlyov

[git-p4: depot-paths = "//depot/stg/hsa/drivers/hsa/runtime/": change = 1255351]


[ROCm/ROCR-Runtime commit: a795909bca]
2016-04-07 06:01:20 -05:00
Besar Wicaksono (xN/A) TX [TEXT] b156bd28b9 Cleanup TODO format
[git-p4: depot-paths = "//depot/stg/hsa/drivers/hsa/runtime/": change = 1255182]


[ROCm/ROCR-Runtime commit: 823c254d61]
2016-04-06 16:50:50 -05:00
Ramesh Errabolu (xN/A) TX cae56040bb Update Private Segment Size parameter of the dispatch packet
[git-p4: depot-paths = "//depot/stg/hsa/drivers/hsa/runtime/": change = 1254638]


[ROCm/ROCR-Runtime commit: b93946790d]
2016-04-05 14:03:33 -05:00
Besar Wicaksono (xN/A) TX [TEXT] 2caa405817 Add environment flag to enable sdma workaround that will wait for the sdma queue to be idle before updating the write pointer. Add class to manage environment flags.
[git-p4: depot-paths = "//depot/stg/hsa/drivers/hsa/runtime/": change = 1254004]


[ROCm/ROCR-Runtime commit: c95f96a9e4]
2016-04-01 17:13:45 -05:00
James Edwards (xN/A) TX fae50d66a8 Branch Brig.h file into opensrc hsa-runtime directory.
[git-p4: depot-paths = "//depot/stg/hsa/drivers/hsa/runtime/": change = 1251455]


[ROCm/ROCR-Runtime commit: e3670a2bef]
2016-03-25 15:26:18 -05:00
Nikolay Haustov [TEXT] b69a730231 HSA Finalizer: Merge changes in libamdhsacode and loader from sc_prm into hsa/compiler/finalizer and hsa/runtime.
Testing: pre-checkin

[git-p4: depot-paths = "//depot/stg/hsa/drivers/hsa/runtime/": change = 1251389]


[ROCm/ROCR-Runtime commit: 46842a57e5]
2016-03-25 08:36:20 -05:00
Zhuravlyov, Konstantin (x21446) MA 92a990b426 Clean up extensions and provide public extension/API to query host address given device address:
- Partially remove 'amd_load_map' extension because it is not used and will not be used
- Remove 'hsa_amd_query_kernel_host_address' API
- Add 'hsa_ext_amd_loaded_code_object' extension
- Add 'hsa_ext_amd_loaded_code_object_query_host_address' API
	- Most likely to be used by debugger, profiler, and hcc (printf)
- Update affected sources
	- 'hsa_system_extension_supported'
	- 'hsa_system_get_extension_table'
	- SoftCP path
- Integrate CLs 1250699, 1251204, 1251214 from stg sc

ReviewBoardURL: http://ocltc.amd.com/reviews/r/10091/
Testing: smoke (ok), teamcity (ok), samples on fiji (AQL and SoftCP) (ok)

[git-p4: depot-paths = "//depot/stg/hsa/drivers/hsa/runtime/": change = 1251223]


[ROCm/ROCR-Runtime commit: f6565a2f70]
2016-03-24 19:00:30 -05:00
Besar Wicaksono (xN/A) TX [TEXT] 4d763ffe0e Always wait queue wrap around to finish and dont return not enough resource.
[git-p4: depot-paths = "//depot/stg/hsa/drivers/hsa/runtime/": change = 1251141]


[ROCm/ROCR-Runtime commit: 9fa0531950]
2016-03-24 15:52:45 -05:00
Sean Keely (xN/A) TX 9e27d8b95a Minor fix to hsa_amd_image_descriptor_t.
Change uint32_t data[0]; to uint32_t data[1];

[git-p4: depot-paths = "//depot/stg/hsa/drivers/hsa/runtime/": change = 1251050]


[ROCm/ROCR-Runtime commit: 1c7142c129]
2016-03-24 13:24:22 -05:00
David Ogbeide 9abf85c06b libhsakmt: get CPU model name from proc/cpuinfo
HSA thunk is currently only aware of GPU node
model info, CPU names are NULL.



Signed-off-by: David Ogbeide <davidboyowa.ogbeide@amd.com>
Change-Id: I3c2adbb8566a5048b44c39fff4fd8228912468ff


[ROCm/ROCR-Runtime commit: 682776d89a]
2016-03-23 11:11:18 -04:00
James Edwards (xN/A) TX 8504fcb6ca Separate open source core runtime code from DK makefiles.
[git-p4: depot-paths = "//depot/stg/hsa/drivers/hsa/runtime/": change = 1250152]


[ROCm/ROCR-Runtime commit: 7d2bc9d113]
2016-03-22 18:10:13 -05:00
James Edwards (xN/A) TX 6da3325470 Remove opensrc test files.
[git-p4: depot-paths = "//depot/stg/hsa/drivers/hsa/runtime/": change = 1249961]


[ROCm/ROCR-Runtime commit: 7d1e6c3a57]
2016-03-22 13:39:51 -05:00
James Edwards (xN/A) TX 67e6064a6a Check open source core runtime code into perforce. This includes license and README files.
[git-p4: depot-paths = "//depot/stg/hsa/drivers/hsa/runtime/": change = 1249136]


[ROCm/ROCR-Runtime commit: c9ffe0004e]
2016-03-20 15:39:40 -05:00
Felix Kuehling a8a5960095 Add environment variable to disable GPU caching
This option may help debug synchronization or coherency issues
involving the GPU caches. It works only on dGPUs, by changing the
cache policy of the GPUVM default aperture to "cohrent", which is
implemented as non-cached on current dGPU hardware.

Change-Id: I544ac9cc5c0cf1fa5c4e30f67aa42b3b5e44ae67


[ROCm/ROCR-Runtime commit: 06d391c6c9]
2016-03-17 18:51:47 -04:00
Harish Kasiviswanathan 718e3600b8 Add QPI or HT io_links
Create QPI or HT links among all NUMA nodes. For now, assume all the
NUMA nodes are interconnected with same Weight (=1).

Change-Id: Id48ba95b9d75515a186f7dc5006b19bd92743ae3


[ROCm/ROCR-Runtime commit: f1fbacca15]
2016-03-15 21:10:53 -04:00
Harish Kasiviswanathan 14e60b6ab3 Get processor vendor from /proc/cpuinfo
Change-Id: I9039385d268ef1693fab121cbf1caf442129a12e


[ROCm/ROCR-Runtime commit: ee1dd5d9c2]
2016-03-15 15:37:52 -04:00
Besar Wicaksono (xN/A) TX [TEXT] 2d78a85c21 Add IOLink support
[git-p4: depot-paths = "//depot/stg/hsa/drivers/hsa/runtime/": change = 1247220]


[ROCm/ROCR-Runtime commit: 73d43224e9]
2016-03-14 18:42:31 -05:00