Evgeny
bb8eaf3ac8
aqlprofile API: _aqlprofile_start() returns required profile buffer sizes if undersized
...
Change-Id: Ib14b2cb2e7e2026c3af0b7bd2f08f51e48e598b2
2017-11-09 20:03:55 -06:00
Sean Keely
6455a69b03
Fix bad casts in tools.
...
Also virtualize queue profiling enable.
Change-Id: I761b41269be3df7eb64a5914ee9951ed6b51bb04
2017-11-08 15:50:02 -05:00
Sean Keely
a6d8a48cbc
Add callback exception forwarding.
...
Modified callbacks for intercept queue, queue error, iterate agent and
iterate region.
Change-Id: I8bdd67f2312510ea7eb9caec93babca244938b40
2017-11-08 15:50:02 -05:00
Sean Keely
f312a7386e
Exception support for Queue.
...
Remove "zombie" queue state and report queue creation failure via
exceptions. Make Shared object a final container and support array
objects with Shared. Add message printing to hsa_exception in
debug builds.
Change-Id: I459f38c80846018acbf45538874e95f91dd6b195
2017-11-08 15:50:02 -05:00
Sean Keely
0c7dde2d1f
Add queue intercept support to the runtime.
...
Queue intercept is exposed as two tools-only APIs via the API
intercept table.
Change-Id: Iac9602ed3143974d85c3569e9092295ad18037f8
2017-11-08 15:50:01 -05:00
Kent Russell
b29d3f63e2
Revert "aqlprofile API: _start() sets buffers sizes with NULL ptr; block counters reg number / block name info"
...
This reverts commit 3daa85fad8 .
Change-Id: Ie90b091df772bf9391494c773d63858aafbc1176
2017-11-08 06:59:33 -04:00
Evgeny
3daa85fad8
aqlprofile API: _start() sets buffers sizes with NULL ptr; block counters reg number / block name info
...
Change-Id: I3cb93453b683c55bf5ec26271648232306a5d140
2017-11-07 15:05:47 -05:00
Sean Keely
d93f92f42d
Make HostQueue::queue_count_ a portable atomic type.
...
Also make lint happy.
Change-Id: I0f965df6a76fd959df9eb411d1f1b11847159790
2017-10-31 02:38:25 -05:00
Qingchuan Shi
ce6aee01ed
Add APIs to support debugging vm fault
...
1. Add hsa ext api hsa_amd_register_vmfault_handler for debugger to register callback in case of VM fault.
2. Extend hsa_ven_amd_loader API to:
(1) iterate loaded code objects in executable:
hsa_ven_amd_loader_executable_iterate_loaded_code_objects
(2) get loaded code object info:
hsa_ven_amd_loader_loaded_code_object_get_info
3. Make the id of hsa_queue the same as the one used in communication with thunk (for amd_aql_queue)
Change-Id: I68910809e59e24297350d262606f00e96c14bcbd
2017-10-28 21:48:26 -04:00
Sean Keely
6ee2ccb08b
Fix error message description.
...
Change-Id: I32efed68e970a4882aca9decbbcda3fcd5c5cb43
2017-10-24 21:52:21 -05:00
Sean Keely
5a4ab91be1
Set doorbell kind code for gfx9+ device enqueue.
...
Change-Id: I93c4cea677ae51f97ac768614333743fb26b2f54
2017-10-21 11:08:44 -04:00
Sean Keely
a8d818a6bc
Improve build system handling of non-default directory layouts.
...
Adds the thunk include and lib paths to the cache, removes paths
to indicator files from the cache, uses the cached path directory
(if any) as a search hint for indicator files.
Change-Id: I0859faa8d229a97abfaacb408d2c831e317aed5f
2017-10-21 11:08:15 -04:00
Sean Keely
3cef9b1a04
Improve unhandled exception error reporting in debug builds.
...
Change-Id: Ia92d1a93163105d817a2147d96f2edd399e2b70d
2017-10-21 11:08:01 -04:00
Sean Keely
737966eb25
Fix memory leak in exception path.
...
Change-Id: Iad5f035cd1909be4a8f1a1f5dd7ca5abec0694b4
2017-10-21 11:08:00 -04:00
Ramesh Errabolu
dccbc9f2af
Changes to support surfacing of link weight as part of link info
...
Change-Id: I1c0705a9374af1245f0419c51beded0d7ee10639
2017-10-20 12:09:31 -04:00
Chris Freehill
a7cbe78366
Use Major/Minor/Step device numbers to differentiate gfx devices
...
Change-Id: I0901871971a5b33018917ada6c0e69ac7aa91944
2017-10-13 16:18:24 -04:00
Sean Keely
4f299a9909
Capture more memory allocation types with the 2MB allocator.
...
TensorFlow was running out of VRAM due to padding up allocations
from legacy memory APIs. These allocations have been added to
the fragment allocator to improve VRAM utilization.
Change-Id: Ic680fff576a0434b3b17a4c91746da44e09957fa
2017-10-12 23:22:10 -04:00
Evgeny
fd99e909ff
aqlprofile API: enabling privilege memory related counters
...
Change-Id: I28a24ad1a3ce78c5d8a6319635ae1ffd392ab690
2017-10-09 17:34:54 -05:00
Sean Keely
9ba83d83f7
Release cached memory blocks when memory allocation fails and retry.
...
Change-Id: I6d96e136e147d8ffe9ff7baec26b4b5a678b739f
2017-10-03 20:12:05 -04:00
Sean Keely
e9a6f2c3e6
Support hsa_amd_agents_allow_access on page fragments.
...
Since access may only be manipulated on whole pages, suballocator fragments must cooperate to set the page's access.
Since the KFD does not migrate memory on access changes this implementation makes agent access sticky across the requests in a fragmented page.
Change-Id: I88479ed45fb40e9782b704526a7b8ffb22e7bd76
2017-09-27 19:04:04 -05:00
Evgeny
0e88414f5c
removing graphics specific block RMI, ennabling memory related blocks
...
Change-Id: I477adc49b9ee3c8593c193bdc69c0deb4a9726e1
2017-09-25 10:49:22 -05:00
Sean Keely
476c8e36bf
Fix assert in simple_heap.
...
Also add comments to clarify pointer info constraints.
Change-Id: I8d07831a0e953d667c84c96fe53ed07c18ba115c
2017-09-21 00:47:18 -04:00
Evgeny
fcaecfee80
adding hsa_ven_amd_aqlprofile.h to the packaging
...
Change-Id: I3b69396e3cea129106d47be53218213e29de9843
2017-09-20 14:40:49 -04:00
Sean Keely
30fce248c6
Enable use of CLOCK_MONOTONIC_RAW for post 4.4 kernels.
...
Change-Id: I3c1f27c7e639df5128c36d81f715fa16e6c1cf13
2017-09-20 14:28:23 -04:00
Sean Keely
9dfdce5b3c
Improve branch elimination in ScopedAcquire.
...
GCC can't reasonably be told that the lock ptr isn't null. Adding a private bool
allows the branch to be eliminated, along with the bool.
Change-Id: I0605d69474d6a6e6951be93c0af1d8caf3f77124
2017-09-19 06:08:36 -04:00
Sean Keely
4275661682
Add env key to disable 2MB suballocation.
...
Change-Id: Icca3041c3578aa180a656c01aae62f2ad6e8b583
2017-09-19 06:08:36 -04:00
Sean Keely
117be0b55a
Add suballocator for ordinary VRAM allocations smaller than 2MB.
...
Track pointer info for sub 2MB fragment allocations in allocation_map_.
Add fragment support to IPC.
Change-Id: I00cfc2e2fa289aac90a4718c392f9bb056a61a87
2017-09-19 06:08:36 -04:00
hthangir
87d2df3da3
Use non-RAW version in clock_getres to workaround bug in older kernels
...
Change-Id: Ice0606a42cd7054f0804baf4af3521ffae3b7d50
2017-09-14 13:56:15 -05:00
Sean Keely
f1a661dedb
Report tools library load failures in debug builds.
...
Change-Id: Ie1ff313e929fc46134e58730a1d370c5d7ace8db
2017-08-31 21:32:48 -04:00
Laurent Morichetti
0f05ef73ac
Include <errno.h> for EBUSY
...
Change-Id: I9fa3417445866f3ce37af2169f623afa8e92e873
2017-08-31 07:32:51 -07:00
Evgeny
09c732c2f9
aqlprofile api: adding gfx8 mc counters
...
Change-Id: I84dc06c24b7961dfe665cf7e2ae6cc9ae3b7326b
2017-08-23 15:23:05 -05:00
Sean Keely
0cb1e8cb35
Correct vm_fault signal cleanup.
...
Change-Id: Id2f14b911e3991a76771425bc09f38a613280e6b
2017-08-18 22:12:38 -04:00
Evgeny
fb4afca8c3
AqlProfile API, commenting out hanging GFX9 blocks, ATC, GCEA, RPB
...
Change-Id: I411fb33f77c9538ca236b9b6b09c7dfe75220c02
2017-08-15 12:44:00 -05:00
Sean Keely
dec5c52e07
Simplify pointer info version check.
...
Change-Id: I0ed363f1261ffc041547f313970ca67298ace56c
2017-08-12 03:14:39 -04:00
Sean Keely
c9642cf7af
Initial IPC signal support.
...
Added an API for creating signals with attributes.
Added two APIs for IPC operations on signals.
Initial use of exceptions for error handling.
Add ref counting to signals.
Removed spin loops from signal destructors.
Signals are no longer to be destroyed with delete, use DeleteSignal instead.
Added delete safety to doorbells.
Added secondary hsa_signal_t -> Signal* translation path for IPC enabled signals.
Change-Id: Id59065d002f0c2566b0a9425694da2ed27cb7d7f
2017-08-11 18:41:34 -05:00
Sean Keely
2732b18092
Initial exception support for signals.
...
Also separate signal ABI block allocations from the runtime interface object.
Change-Id: If16763338db664f29163a1348f8f4c38cf0597b2
2017-08-11 18:41:34 -05:00
Evgeny
287afd3a52
adding aqlprofile member to HsaApiTable
...
Change-Id: Id674186dfa2e83295a51f770ccc0400f1cb51a98
2017-08-09 16:09:39 -05:00
Evgeny
4824a2db0b
Adding HSA_API macro to the API method declarations to be consistent with other HSA header files, TCS removing
...
Change-Id: Ic217d3b2bdbb22d3600c5ecaacb7ab53bf26096a
2017-08-08 10:46:12 -04:00
Evgeny
47322942b3
aqlprofile block list, explicit numbers assigned, IA removed
...
Change-Id: I9f9358f8e03e13eb81845de2e33dd5f3da27811a
2017-08-03 11:39:21 -04:00
Evgeny
c66f68041c
aql-profile api: reducing blocks list to compute only and new gfx9 blocks
...
Change-Id: Ib506b82ea407afec4f5d4bcad755d4d57b92e34b
2017-08-02 12:21:24 -04:00
hthangir
9ee0108e58
Fix compilation issue reported with GLIBC 2.12 (RHEL 6.9)
...
Change-Id: I770b72ba1d61475a76aa72d0c52ebfb380db6019
2017-07-28 11:11:01 -04:00
Evgeny
08d5efe29d
moving hsa-amd-aqlprofile to ssh://gerritgit/hsa/ec/aqlprofile
...
Change-Id: Ic42752ca41f877db02aa5a5d8d617cd67cce8956
2017-07-14 14:59:42 -05:00
Evgeny
ab67b8511b
hsa_ven_amd_aqlprofile.h: include <hsa.h> fix
...
Change-Id: Idfd2fdde112d502d4b4a3365512ec601f7e56a5b
2017-07-12 15:43:58 -05:00
Sean Keely
a0a3587345
Remove use of anonymous member in C builds.
...
Tools/CodeXL will retain older versions of structs if them need them.
Change-Id: I568d7b445778dd575ef71000b4b839300572288e
2017-07-12 16:40:00 -04:00
Sean Keely
bc0bd00746
Fix queue interception in tools.
...
1. Correct amd::AqlQueue::ExecutePM4 to support interception.
2. Minor fixes to AqlPacket and SoftCP.
3. Minimal change to disable interception of runtime internal queues.
Change-Id: I103fece2ebf9a188d27f01e61221c737405d7253
2017-07-12 16:39:43 -04:00
Sean Keely
29b5b5c029
Correct handling of slow clocks under linux.
...
Change-Id: I9a1b08d5457caa6739220603bbd37b00febc64d7
2017-07-12 12:49:49 -04:00
Sean Keely
3e50adc7ce
Properly order signal copy agent tagging with copy operation.
...
Change-Id: Ic428c958551279fbea1b2449afba930b82804ede
2017-07-11 13:10:00 -04:00
Sean Keely
c9f0427cb0
Decrement hsa_init ref counter when init fails.
...
Change-Id: If9376344d4b559e601932d070731132c8450104e
2017-07-07 21:21:03 -05:00
Evgeny
4174f07fd1
hsa-runtime integration
...
Change-Id: I48968966ffe164218ebff88d0e3a1268e96bf1dd
2017-07-05 10:55:30 -04:00
Evgeny
c533229bc1
Block list extending
...
Change-Id: Id17efde25fce287296e80f2b37c77b15aa59b561
2017-06-23 16:37:02 -04:00