Wykres commitów

4686 Commity

Autor SHA1 Wiadomość Data
Vlad Sytchenko 2cefbe38e1 Merge branch 'amd-master-next' into amd-npi-next
Change-Id: I75be5fbe5ddd6552a6a316ec99ca8833ee02cb6c


[ROCm/hip commit: e19c8e176f]
2020-05-21 19:17:10 -04:00
Laurent Morichetti 02d5a7c4b9 load functions when registered if lazy load is disabled
Change-Id: I08c9684ca4ab86a6294ed9e670f3125c1a0cc6c4


[ROCm/hip commit: 22edde84d5]
2020-05-21 09:19:20 -04:00
Sarbojit Sarkar ca8de0f6c8 added null check
Change-Id: I7631965286bad5684bacb7e38c63031b5c691dc0


[ROCm/hip commit: 87d1101a33]
2020-05-20 23:08:55 -04:00
Saleel Kudchadker 7836dfd322 Fix elapsed time calc for hipEventElapsedTime
If the start and stop events have same command internally
then measure command end to command start
Change-Id: Ie70cfa37c06c06573f0ed58dab2bbe4434c1724b


[ROCm/hip commit: 50be95e169]
2020-05-20 16:02:48 -04:00
German Andryeyev 20e64e1450 Add a test case for async double memset
Change-Id: I8a1df610c8d3d942651f258e7812e8697067c347


[ROCm/hip commit: 4503d44e12]
2020-05-20 12:52:29 -04:00
Dittakavi Satyanvesh e9783da3c1 enable occupancy sample on HIP-Clang
Change-Id: I236daad743bbbc7f8daa41e8157a4ff73058518b


[ROCm/hip commit: 700474db6f]
2020-05-19 03:08:19 -04:00
agodavar e5c6555eee Fix hipMemset for unaligned ptr, correct offset required by ROCclr per aligment and testcase to validate unaligned ptr
Change-Id: Ib1a0f9d1556d09cf72a12a90cad2e27c6d9c6a1c


[ROCm/hip commit: 458a966aa1]
2020-05-16 02:38:14 -04:00
Christophe Paquot f0bd04856a [hip] Skip lang guess if -x <lang> is specified.
Change-Id: If5cbdd1e5d36d2d8b83f3eadd0598162691167aa


[ROCm/hip commit: 96704ddd00]
2020-05-15 17:07:40 -04:00
Matt Arsenault 0becc2185e Include interface directories from rocclr target
This is overly painful because this is an object library.

Change-Id: I618843b810549876053212bf24efff18aece9470


[ROCm/hip commit: f55e253699]
2020-05-15 13:17:13 -04:00
Payam bfea574271 load functions when registered if lazy load is disabled
Change-Id: I6f10a72f7643e476407af03b04ef52e59741be45


[ROCm/hip commit: a5b4f5f02d]
2020-05-15 12:15:32 -04:00
Icarus Sparry 8fd47616be Merge "SWDEV-235875:Fix rocALUTION build error" into amd-master-next
[ROCm/hip commit: f7256d9bae]
2020-05-15 12:10:02 -04:00
Aaron En Ye Shi 04b6bae304 Merge "API tracing instrumentation" into amd-master-next
[ROCm/hip commit: cefc8e4b1f]
2020-05-15 11:18:14 -04:00
Vlad Sytchenko 13afb72279 Merge branch 'amd-master-next' into amd-npi-next
Change-Id: Ie692324c8cf6215bf6cf87de0d5e3aa1bdfd5ea1


[ROCm/hip commit: 38ce7d8ae7]
2020-05-15 11:15:27 -04:00
Aaron En Ye Shi b1284a6468 Merge "Fix missing hip_version.h install and package" into amd-master-next
[ROCm/hip commit: 1d717a05e0]
2020-05-15 10:38:09 -04:00
Aaron Enye Shi 1da231b380 Fix missing hip_version.h install and package
Change-Id: I7182518f8c0dea6e00ffd649d1a29027fdc78b24


[ROCm/hip commit: a573ad7204]
2020-05-14 21:54:28 +00:00
agodavar 8760f75135 SWDEV-235875:Fix rocALUTION build error
Change-Id: I2f2a15687e943a860ff6ca05f22422f791a36c0a


[ROCm/hip commit: 07419c7635]
2020-05-14 14:43:25 -04:00
Payam e5a32d5511 updating envar to HIP_ROCCLR_HOME
Change-Id: Ic0bbf40638538307377e6db1a7379fb23dec8306


[ROCm/hip commit: 0153027fd8]
2020-05-14 12:59:51 -04:00
Evgeny 73eb4c6330 API tracing instrumentation
Change-Id: I257409b9fe299b009ded3e3a43287322d5f93a70


[ROCm/hip commit: 5abb8e1a68]
2020-05-14 11:03:09 -05:00
Payam 514b242275 adding support to disable lazy loading, HIP_ENABLE_LAZY_KERNEL_LOADING
Change-Id: Iafe0f396570ee2bccf642b537cbf9aad967e1370


[ROCm/hip commit: 56509b0ffd]
2020-05-13 17:18:48 -04:00
Aaron En Ye Shi 19449c03f8 Merge "Fix missing hip_prof_str.h in hip-base package" into amd-master-next
[ROCm/hip commit: 405a051bc0]
2020-05-13 16:25:06 -04:00
German Andryeyev 9b69ab8053 Merge "Correct mgpu coop launch test" into amd-master-next
[ROCm/hip commit: d461c82dc7]
2020-05-13 16:20:24 -04:00
Aaron Enye Shi 161bfe2e37 Fix missing hip_prof_str.h in hip-base package
Change-Id: Icd163ff83fadc2ee0e68f5755c652a45d2e781e5


[ROCm/hip commit: 7c7c8a50e2]
2020-05-13 19:13:16 +00:00
German Andryeyev 80bd9632ad Correct mgpu coop launch test
When the original size is devided accross all GPUs rounding can
occur, causing incorrect validation. Readjust the final value
for comparison to the new size accordingly.

Change-Id: I9b42149e33dfcb328de7419e546a0202a69a8610


[ROCm/hip commit: 20f0e36041]
2020-05-13 14:39:11 -04:00
Vlad Sytchenko 969c3284fb Correctly check if env vars are set
Change-Id: I365da786c822e9395aec5c6b75753ba297da3f64


[ROCm/hip commit: a45085f9b9]
2020-05-13 14:08:03 -04:00
Christophe Paquot 56a91401a4 Make sure to remove the global var from amd::MemObjMap
We need this otherwise ROCr can give us a matching address
for another allocation and doing "insert" in ROCclr will not
update the map with the newest object. We would then end up
using stale objects (yikes)
SWDEV-234992

Change-Id: I3475adf9781a9309d64a024fae45181d7e5afb04


[ROCm/hip commit: a03fee04fe]
2020-05-13 00:26:19 -07:00
Christophe Paquot 2588c88679 Add lock to addFatBinary and removeFatBinary
In case hipModule(Un)Load is called from different thread as hipInit we need to grab the lock
as both are going to modify modules_
Also add some logging for __hipExtractCodeObjectFromFatBinary in case binary isn't found for GPU
SWDEV-236032

Change-Id: Icbd72b412502df80d5066cea42a4fbcd5b0b8a98


[ROCm/hip commit: f100ae3679]
2020-05-12 16:04:08 -07:00
Aaron En Ye Shi 967c7306da Merge "search path logic updated for extractkernel SWDEV-230929" into amd-master-next
[ROCm/hip commit: da01762f1a]
2020-05-12 17:59:47 -04:00
Saleel Kudchadker cd7c57c3ff Merge "SWDEV-235495 Fix elapsed time calculation" into amd-master-next
[ROCm/hip commit: 7f548c5aa9]
2020-05-12 15:23:23 -04:00
Saleel Kudchadker 9aa52425e6 SWDEV-235495 Fix elapsed time calculation
This issue happens because we getLastQueuedCommand when recording
the event and do end_ - start_ so it takes the ticks for the
completion of the last command before event record. This may not
happen if one records a marker command for hipEventRecord

Change-Id: I1d6b06a5befb3b93f16b67692c59dca25c982e0f


[ROCm/hip commit: 43986c6791]
2020-05-12 09:38:01 -07:00
Christophe Paquot 71c2e64024 Merge "HPC : Intermittent hangs are observed while running Gromacs benchmarks" into amd-master-next
[ROCm/hip commit: 538f18f06c]
2020-05-12 11:21:47 -04:00
Karthik Jayaprakash fe7cfd3bfe Merge "SWDEV-227602 - Fixing hipFuncGetAttribute parameters" into amd-master-next
[ROCm/hip commit: 54d264c418]
2020-05-12 10:52:25 -04:00
Matt Arsenault 5d0b99f693 Use find_package to find ROCclr
Maintain compatability with the old finding for now for the
convenience of commit order.

Change-Id: I99b236cbb3d61b00650e3da7fe5931d4c4b3fec6


[ROCm/hip commit: 024764c337]
2020-05-12 09:28:19 -04:00
kjayapra-amd 38587ea539 SWDEV-227602 - Fixing hipFuncGetAttribute parameters
Change-Id: I46bd079372f453cabfaa2c709e2b30c69400dd33


[ROCm/hip commit: 78e27488be]
2020-05-12 09:15:43 -04:00
Matt Arsenault 98787af268 Use project relative include dir for generated header paths
Change-Id: I8c0834d68e6dac00b41a7059a978cbd2ebb933f9


[ROCm/hip commit: ab285297e9]
2020-05-11 21:57:25 -04:00
Payam Ghafari 36f71739d9 Merge "updating hipDeviceProps_t-< arch flags according to SWDEV-234277" into amd-master-next
[ROCm/hip commit: e48fb337e3]
2020-05-11 21:28:43 -04:00
Christophe Paquot 9bfa6e5c99 HPC : Intermittent hangs are observed while running Gromacs benchmarks
SWDEV-235579
Move the lock before destroying the queue as there's a multithreaded race condition if the queue
is being destroy and right after we set queue_ to nullptr, another thread can call ihipWaitStreams
which will then call create on that same stream because queue is now nullptr.
Moving the lock on streamSet prevents this from happening because we would remove the stream from that
list and therefore ihipWait will not try to call asHostQueue which tries to create the queue if not created yet
since the stream won't be in the list anymore

Change-Id: I3108657ab403d39d4123e83294fcf1f0880e5563


[ROCm/hip commit: 6b361bc1a0]
2020-05-11 16:35:13 -07:00
Icarus Sparry a16fd740bd Merge "Fix runtime failure on windows" into amd-master-next
[ROCm/hip commit: 33fcc105ea]
2020-05-11 18:58:17 -04:00
Icarus Sparry f47f8ad6f6 Fix runtime failure on windows
Signed-off-by: Icarus Sparry <icarus.sparry@amd.com>
Change-Id: I6c991a943e04ef29eff2905becbed0038c3b8ab9
Signed-off-by: Icarus Sparry <icarus.sparry@amd.com>


[ROCm/hip commit: 64f8ff5e70]
2020-05-11 17:30:47 -04:00
Payam 565d802d8c updating hipDeviceProps_t-< arch flags according to SWDEV-234277
Change-Id: I6238edf9a2df15a5dfe420dbb8723e10a0725f98


[ROCm/hip commit: a477612a22]
2020-05-11 16:43:13 -04:00
Matt Arsenault 39c4ca378a Start cleaning up rocclr include paths
Use target_include_directories instead of global includes.

Change-Id: I3adadc905843f152a548c71b2f12701476065207


[ROCm/hip commit: 16dbce3776]
2020-05-11 15:24:19 -04:00
Matt Arsenault 3a07d0e785 Remove some asm declarations for intrinsics
This technique should never be used, and only accessed through
__builtins.

There's currently no builtin for groupstaticsize. I left ds_swizzle
since for some reason it switches to the builtin based on __HCC__ or
not.

Change-Id: If1e1394221dba83ea4add6db5e94d6b715552044


[ROCm/hip commit: d2dd307c7d]
2020-05-11 15:20:58 -04:00
Michael Hong Bin Liao 175c7ac634 Merge "[hip] Fix -Wduplicate-decl-specifier warning. NFC." into amd-master-next
[ROCm/hip commit: fbc03e94c1]
2020-05-11 15:18:57 -04:00
Icarus Sparry 1bf3ae1c4a Merge "Correct quoting of arguments" into amd-master-next
[ROCm/hip commit: 0936d9d64d]
2020-05-11 11:39:47 -04:00
Michael LIAO aa4d6212fa [hip] Fix -Wduplicate-decl-specifier warning. NFC.
Change-Id: Iae48bbb7805c39f1005c920df8e76504426f2d3b


[ROCm/hip commit: a2dbcc075c]
2020-05-11 10:12:33 -04:00
Dittakavi Satyanvesh adcb645ab2 search path logic updated for extractkernel SWDEV-230929
Change-Id: I48d6332502774485d7ced3fee065a74f15774500


[ROCm/hip commit: 83f0c4402e]
2020-05-11 02:27:40 -04:00
Icarus Sparry b59442a555 Correct quoting of arguments
The hipcc script takes arguments and uses this to build up a new
command. Characters which are special to the shell need to be quoted
to prevent them being interpreted.

In particular adding
     --Wl,--enable-new-dtags -Wl,--rpath,'$ORIGIN:$ORIGIN/../lib'
to the command should pass quoted dollar signs into the resulting
string so the shell passes them on, rather than substituting the
values.

The arguments are processed in a conventional loop, but can be altered
during the course of the loop, and also by linker response files.

Tested by running
  HIPCC_VERBOSE=7 HIP_COMPILER=clang hipcc --cxxflags \
      fred.c -Wl,,--rpath,'$ORIGIN:$ORIGIN:/../lib'
and observing "-Wl,--rpath,\$ORIGIN\:\$ORIGIN\:..\/lib" in the
displayed hipcc-cmd output (and ignoring the errors due to rocm not
being installed)

Change-Id: I26b62f09ff3518cceeb85fa8823bb12a95c1c78e
Signed-off-by: Icarus Sparry <icarus.sparry@amd.com>


[ROCm/hip commit: a4f01ffca6]
2020-05-10 18:16:48 -04:00
Vlad Sytchenko 27da694f1c Correct HIP_FUNC_ATTRIBUTE_NUM_REGS query
Change-Id: I526cc7871c690260df0fa8c1b3b4b15fbc5af219


[ROCm/hip commit: fec51e85b0]
2020-05-09 12:42:30 -04:00
Christophe Paquot 424571c360 Merge "Don't add a null command to waitList in hipMemcpy" into amd-master-next
[ROCm/hip commit: 507f6bb0c3]
2020-05-08 17:01:38 -04:00
Christophe Paquot 201e411e6f Don't add a null command to waitList in hipMemcpy
SWDEV-235345

Change-Id: Ib1abd0ba6414d081891a9f5209df083c45734aee


[ROCm/hip commit: 3ca2d1e6a9]
2020-05-08 12:43:41 -07:00
Vlad Sytchenko 802b60054d Correct HIP_FUNC_ATTRIBUTE_MAX_THREADS_PER_BLOCK query
We should be returning the max workgroup size calculated by the compiler.

Change-Id: If86590efbb9b291f470bdbe87e5df992e661c539


[ROCm/hip commit: 1b1c032e9f]
2020-05-08 14:36:47 -04:00