Γράφημα Υποβολών

418 Υποβολές

Συγγραφέας SHA1 Μήνυμα Ημερομηνία
Sean Keely 1e0d690948 Use ptrinfo rather than apertures in hsa_memory_copy
Apertures now overlap with the change to 48bit addressing which
precludes using aperture checks to discover buffer ownership.
Switches to ptrinfo to decide which device a buffer owned by.

This corrects faults in the legacy hsa_memory_copy api.

Change-Id: I5c7ce0216e1cdc96f836fc6fec9c3defdf4b9d90
2018-10-11 13:34:53 -04:00
Konstantin Zhuravlyov 386874da55 Loader: Add support for v3 object code.
Change-Id: I7215bd0c1277c2036bf0fadf5b23cb57fdf7f665
2018-10-06 14:01:59 -04:00
Jay Cornwall f1ffbc3286 Revert "Extend SDMA disable list until firmware stability resolved"
This reverts commit 5e1ccdc4a9.

Change-Id: I17b379e4d0e49a79dc8d4a60f01ea424fda24f02
2018-10-05 15:17:27 -04:00
Kent Russell ed9baefd75 Only remove ldconf on uninstall
On update, the removal will occur AFTER the new package is installed,
due to some stupidity with how yum/rpm does things. Only remove it if
we're doing a pure uninstall

Change-Id: I4982610828d8bc1f2d8691b1e4ee1718c89413cc
2018-10-03 08:10:06 -04:00
Evgeny fdbe277f2a hsa_ven_amd_aqlprofile_pfn_t alias
Change-Id: Ia4a67ef0d2f8975f0e541e85c215afec76e9de5f
2018-09-26 14:10:21 -04:00
Scott Linder 47f0e6f7d3 Apply dynamic relocations for STT_FUNC symbols
Required to support function calls through GOT table.

Change-Id: I174a0269fdd67369d38fe41855b7bd01f350b839
2018-09-23 21:42:32 -04:00
Ramesh Errabolu 01eea21d6c Capture number of Numa Nodes present on system
Change-Id: Ic789a6b9da8e316cb483e50b0fe9faa03798f97c
2018-09-18 16:27:30 -05:00
Ramesh Errabolu f007870792 ROCr changes to enable small BAR P2P over xGMI
Change-Id: I6aaa3fe2565cdf7e15d58a7484d6bd5916ffff64
2018-09-17 22:54:40 -04:00
Evgeny 81532bb6f5 VERSION_MINOR macro typo fix
aqlprofile info ENABLE_CMD enum adding;

Change-Id: I7b19082144d2bd0bf7af7ddc282358168b225759
2018-09-17 20:49:47 -04:00
Sean Keely 3357cadeec Check fill addresses for alignment.
Check was documented but missing.

Change-Id: I97951635d794fd22e20c25d20e9d0e35035254af
2018-09-05 16:34:19 -04:00
Sean Keely 2843988dd7 Remove redundant initialization.
LinkInfo is already initialized to zero in its default constructor.

Change-Id: Ifa4fb886cce9b474c6879c9c82744044ab394082
2018-08-29 19:36:07 -04:00
Sean Keely 56ed5c8904 Refactor blocking sdma commands.
Remove fence pool and use two signals.  Two signals allows overlapped
submission and copy while reducing thread busy polling.

Change-Id: Idb5f8e4c7f482a596ffce9e7799191fdd785a216
2018-08-29 19:13:23 -04:00
Sean Keely e0839ab27e Implement SDMA copy rect for gfx9.
Fix pitch overflow due to small element detection.
Add wide pitch 2D copy handling.
Cleanup code duplication.

Change-Id: I93b1584aba8e5964957eb7ab3544df806ca3e2f9
2018-08-29 19:13:07 -04:00
Sean Keely aca00b7238 Add debug checking of time stamps validity.
Can only check that the signal has some time stamp, can't check if
the translating agent matches the last used agent or not.

Change-Id: I62943a864318808059c617280bb65a269dfadd1b
2018-08-26 12:36:35 -04:00
Sean Keely cd8e5c1da8 Expose ROCr build ID.
Adds HSA_AMD_SYSTEM_INFO_BUILD_VERSION=0x200 to hsa_system_info_t.
This returns a const char* pointing at the build string (git describe).

Change-Id: I73e6612482bf6ffc4037fd365808eb9211a650ad
2018-08-20 20:44:32 -05:00
Chris Freehill c1fbd8aa54 Use 64 suffix for rocm_smi lib name
Change-Id: Idab0f5576f830657afb6bf26e1d88b18244431cb
2018-08-20 08:05:31 -05:00
Sean Keely 6c47780620 Experimental flag to swap copy agent for async copy APIs.
Adds env flag HSA_REV_COPY_DIR.  If set to 1 async copy will
copy from dst device to src device rather than from src to dst.

Change-Id: I3095642066fa026dc112c2eac06db9393341cd7e
2018-08-09 10:58:14 -04:00
Jay Cornwall 5e1ccdc4a9 Extend SDMA disable list until firmware stability resolved
Change-Id: I5e21cb761ae970ba2b68edd97b1564b36ca1f0f4
2018-08-08 11:20:14 -05:00
James Edwards 4d7d50feba Add tools headers and library back to packaging.
Change-Id: If6c9befe50fc111eb154bd5b4eb5c7858f5d510b
2018-07-16 16:51:12 -04:00
Sean Keely 35a270ef7e Do not initialize runtime internal queues based on mapping memory to a GPU.
Conserves VMIDs when multiple processes are in use and memory operations
are not GPU specific.  For instance HIP API hipHostMalloc does not accept
a target GPU so when used with one process per GPU (ie GPU == MPI rank) we can
quickly exceed the available VMID slots if every process consumes a VMID on
every GPU.

Change-Id: Ib6fa051290089f71581029c09f9a44b9992237d1
2018-07-13 19:58:45 -04:00
Chris Freehill 65c3cf27f5 Use the new name of the rocm_smi library
Change-Id: I7358b7b819826f1d3d3b0ca99fc5fd1a4e6d9536
2018-07-13 11:46:49 -04:00
Chris Freehill 3cca09ccca Fix NUMA async copy test
Change-Id: I64b5bd1ac5bf9b58d86c3dfc170bcf06a39abee4
2018-07-11 19:20:13 -04:00
Sean Keely c6cf161125 Fix git describe command to retrieve version tags correctly.
Change-Id: I904f5ccdb88c1e28d5eeffd104174fcd57626ee7
2018-07-10 20:19:04 -05:00
Chris Freehill 06759fed5f Undo temporary work-around for RSMI change
Change-Id: I9bf144add951c95e4eebc8647bffb71d13f4f612
2018-07-09 08:46:57 -05:00
Chris Freehill ae0c852074 Temporary work-around for RSMI change
Change-Id: If4913d5d0cdb0415569c75ab312c39b4253cd4fa
2018-07-07 22:57:42 -05:00
Wilkin 170e2a142f OpenCL BLIT for Image library
- include support for gfx702

Change-Id: If681a4eef9bd076e25300e1c1bca55b4f7c92b46
2018-07-06 10:35:44 -04:00
James Edwards 58a411dd36 Change packaging for rocr-dev and rocr-ext.
Change-Id: Ia096a2d31ddd7bef2e05bb3d6c58e94d8c339598
2018-07-02 13:40:45 -05:00
Jay Cornwall e388a23344 Add hsa_amd_queue_set_priority extension function
Controls dispatch and wavefront scheduling arbitration across quees.

Change-Id: I498f4898b544f79b8fb8514bf7e789ca9da29462
2018-06-19 19:41:28 -05:00
rohit pathania 6df6ef778d Kernel group memory dynamic allocation, basic allocation and free test
Change-Id: I17fdb77f17567ac1b429d9a571cac70ac1e64dd4
2018-06-15 10:49:10 +05:30
Sean Keely 3e3aa37750 Enable SDMA use without platform atomic support.
SDMA will use atomic completion fences if KFD reports 64bit atomic support.
Otherwise it will fall back to store completion fences.

Change-Id: I12b76f8a74ec3ee96372c250f9824d846051536e
2018-06-12 15:38:44 -04:00
rohit pathania d8e47ba115 Modified memory atomics on non Large bar system and pool info test memcmp issue fix
Change-Id: I951fdb6c91508f43b1c51f7eb92870543fc58e53
2018-06-11 18:49:29 +05:30
Chris Freehill 12a81ae96f More emulator friendly tests/examples
Change-Id: I27ab26add14743dfb065238129c14b48913d9df8
2018-06-08 17:58:37 -04:00
Chris Freehill 8a6f0d6b24 Disable Signal tests
They are breaking Jenkins builds.

Change-Id: I1647049abee0ebc2a4751e66d9ceed56cadc4c3e
2018-06-08 15:47:09 -04:00
rohit pathania c2ddd11979 Build failure issue in rocmaster 8386
Change-Id: I413abe0c9fbe16ab2e722cf3f7567aa2853e585b
2018-06-07 13:05:39 +05:30
srinivas Charupally f0a1b310fd Adding Signal Kernel tests
Change-Id: Ie34de41741a7c4731a0ff3761e940971b6f08745
2018-06-06 16:25:18 +05:30
Felix Kuehling 5f25d024a8 Prepare for hsakmt build system cleanup
These fixes are needed to find the hsakmt headers and libraries with
an upcoming hsakmt build system cleanup. It should continue to work
with the original hsakmt build system.

Change-Id: I6b3fcea8f2588698c130c9ec50952c66712afa6c
Signed-off-by: Felix Kuehling <Felix.Kuehling@amd.com>
2018-06-05 16:01:03 -04:00
rohit pathania d0f6da277d Concurrent testing for queue write index
Change-Id: If5b60b943a861d8f97d01b7fd8f757fdb36845c6
2018-06-05 11:57:38 +05:30
Chris Freehill 2c8cbf61c3 Make emulator friendly
Disable some tests that rely on features not typically available
in emulator and use smaller data and iteration sets

Change-Id: I587bf83162b114719e0361109ed44c6bf2adf34c
2018-06-04 18:51:09 -05:00
Chris Freehill 845a539478 Disable Aql_Barrier_* tests
Change-Id: Ibe08b88c101a60e4c6f0c61cda756e2cb5857d7d
2018-06-04 11:46:08 -04:00
srinivas Charupally a632bfddb2 Adding Aql tests
Change-Id: Id22dcafbf0ea0b346f3a03d4acef27350b706f36
2018-05-31 15:29:37 +05:30
srinivas Charupally 2c551d38b6 Adding Signal tests
Change-Id: I1815267a0e19614a84013e797bd3df3e77ee8179
2018-05-30 01:40:31 -04:00
Sean Keely f09eb2e8c7 Move SDMA dependencies back to hardware.
SDMA poll packet preemption has been fixed.

Change-Id: I3da878c433d4594a169e3bc8f173d3651448fd2d
2018-05-29 23:32:47 -04:00
Sean Keely c593dfc6bf Enable SDMA conditionally based on link atomic suport.
Avoids using non-atomic SDMA fences by default since that path can duplicate fences.
If HSA_ENABLE_SDMA is set this will override copy path selection and may use
non-atomic fences.

Change-Id: I4747e9a766f7f649d21ddf6bfded047ac26fd60e
2018-05-29 23:32:34 -04:00
rohit pathania 044fb8dc27 Different Atomic operation tests on GPU and system memory
Change-Id: I04154b588086d49142a64c8fe4826d041ded2991
2018-05-28 22:18:48 +05:30
rohit pathania 08a253684b Queue validation tests and memory alignment tests
Change-Id: I96d8c2898795240288517bdcbc2b48ff2cc04f66
2018-05-28 14:26:05 +05:30
srinivas Charupally 2c1919c681 Adding concurrent shutdown, reference count and max reference count tests
Change-Id: Ib6f40585bf1ab2b1d6f33bbb1675e13545a23a4e
2018-05-28 00:51:05 -04:00
Qingchuan Shi 3a46556dcc Add debug trap rocrtst.
Change-Id: I73682d7a2ad51eed9988075e012478a1afc76c7c
2018-05-22 13:31:45 -04:00
rohit pathania 47af1d673e Memory Concurrent tests for pool Memory allocate, Memory free and get pool info
Change-Id: I6a1343348e400fe466e041d651adaa67be561a21
2018-05-14 01:30:54 -04:00
Jay Cornwall 536823482b Handle llvm.trap only in gfx9 trap handler
llvm.debugtrap and other trap IDs are reserved and should not place
the queue into an error state.

Change-Id: I98193a35ac7da94c4a42ee75d87754ee552ebea0
2018-05-04 13:15:23 -05:00
Evgeny 0e0be791ec Tool load failure report changing to unconditional print bcos it's already is controlled with the env var
Change-Id: I91b400ba94575a32005e825e6b41bda05c55b357
2018-05-03 22:31:17 -05:00