Wykres commitów

5380 Commity

Autor SHA1 Wiadomość Data
Aaron En Ye Shi cefc8e4b1f Merge "API tracing instrumentation" into amd-master-next 2020-05-15 11:18:14 -04:00
Aaron En Ye Shi 1d717a05e0 Merge "Fix missing hip_version.h install and package" into amd-master-next 2020-05-15 10:38:09 -04:00
Aaron Enye Shi a573ad7204 Fix missing hip_version.h install and package
Change-Id: I7182518f8c0dea6e00ffd649d1a29027fdc78b24
2020-05-14 21:54:28 +00:00
Payam 0153027fd8 updating envar to HIP_ROCCLR_HOME
Change-Id: Ic0bbf40638538307377e6db1a7379fb23dec8306
2020-05-14 12:59:51 -04:00
Evgeny 5abb8e1a68 API tracing instrumentation
Change-Id: I257409b9fe299b009ded3e3a43287322d5f93a70
2020-05-14 11:03:09 -05:00
Payam 56509b0ffd adding support to disable lazy loading, HIP_ENABLE_LAZY_KERNEL_LOADING
Change-Id: Iafe0f396570ee2bccf642b537cbf9aad967e1370
2020-05-13 17:18:48 -04:00
Aaron En Ye Shi 405a051bc0 Merge "Fix missing hip_prof_str.h in hip-base package" into amd-master-next 2020-05-13 16:25:06 -04:00
German Andryeyev d461c82dc7 Merge "Correct mgpu coop launch test" into amd-master-next 2020-05-13 16:20:24 -04:00
Aaron Enye Shi 7c7c8a50e2 Fix missing hip_prof_str.h in hip-base package
Change-Id: Icd163ff83fadc2ee0e68f5755c652a45d2e781e5
2020-05-13 19:13:16 +00:00
German Andryeyev 20f0e36041 Correct mgpu coop launch test
When the original size is devided accross all GPUs rounding can
occur, causing incorrect validation. Readjust the final value
for comparison to the new size accordingly.

Change-Id: I9b42149e33dfcb328de7419e546a0202a69a8610
2020-05-13 14:39:11 -04:00
Vlad Sytchenko a45085f9b9 Correctly check if env vars are set
Change-Id: I365da786c822e9395aec5c6b75753ba297da3f64
2020-05-13 14:08:03 -04:00
Christophe Paquot a03fee04fe Make sure to remove the global var from amd::MemObjMap
We need this otherwise ROCr can give us a matching address
for another allocation and doing "insert" in ROCclr will not
update the map with the newest object. We would then end up
using stale objects (yikes)
SWDEV-234992

Change-Id: I3475adf9781a9309d64a024fae45181d7e5afb04
2020-05-13 00:26:19 -07:00
Christophe Paquot f100ae3679 Add lock to addFatBinary and removeFatBinary
In case hipModule(Un)Load is called from different thread as hipInit we need to grab the lock
as both are going to modify modules_
Also add some logging for __hipExtractCodeObjectFromFatBinary in case binary isn't found for GPU
SWDEV-236032

Change-Id: Icbd72b412502df80d5066cea42a4fbcd5b0b8a98
2020-05-12 16:04:08 -07:00
Aaron En Ye Shi da01762f1a Merge "search path logic updated for extractkernel SWDEV-230929" into amd-master-next 2020-05-12 17:59:47 -04:00
Saleel Kudchadker 7f548c5aa9 Merge "SWDEV-235495 Fix elapsed time calculation" into amd-master-next 2020-05-12 15:23:23 -04:00
Saleel Kudchadker 43986c6791 SWDEV-235495 Fix elapsed time calculation
This issue happens because we getLastQueuedCommand when recording
the event and do end_ - start_ so it takes the ticks for the
completion of the last command before event record. This may not
happen if one records a marker command for hipEventRecord

Change-Id: I1d6b06a5befb3b93f16b67692c59dca25c982e0f
2020-05-12 09:38:01 -07:00
Christophe Paquot 538f18f06c Merge "HPC : Intermittent hangs are observed while running Gromacs benchmarks" into amd-master-next 2020-05-12 11:21:47 -04:00
Karthik Jayaprakash 54d264c418 Merge "SWDEV-227602 - Fixing hipFuncGetAttribute parameters" into amd-master-next 2020-05-12 10:52:25 -04:00
Matt Arsenault 024764c337 Use find_package to find ROCclr
Maintain compatability with the old finding for now for the
convenience of commit order.

Change-Id: I99b236cbb3d61b00650e3da7fe5931d4c4b3fec6
2020-05-12 09:28:19 -04:00
kjayapra-amd 78e27488be SWDEV-227602 - Fixing hipFuncGetAttribute parameters
Change-Id: I46bd079372f453cabfaa2c709e2b30c69400dd33
2020-05-12 09:15:43 -04:00
Matt Arsenault ab285297e9 Use project relative include dir for generated header paths
Change-Id: I8c0834d68e6dac00b41a7059a978cbd2ebb933f9
2020-05-11 21:57:25 -04:00
Payam Ghafari e48fb337e3 Merge "updating hipDeviceProps_t-< arch flags according to SWDEV-234277" into amd-master-next 2020-05-11 21:28:43 -04:00
Christophe Paquot 6b361bc1a0 HPC : Intermittent hangs are observed while running Gromacs benchmarks
SWDEV-235579
Move the lock before destroying the queue as there's a multithreaded race condition if the queue
is being destroy and right after we set queue_ to nullptr, another thread can call ihipWaitStreams
which will then call create on that same stream because queue is now nullptr.
Moving the lock on streamSet prevents this from happening because we would remove the stream from that
list and therefore ihipWait will not try to call asHostQueue which tries to create the queue if not created yet
since the stream won't be in the list anymore

Change-Id: I3108657ab403d39d4123e83294fcf1f0880e5563
2020-05-11 16:35:13 -07:00
Icarus Sparry 33fcc105ea Merge "Fix runtime failure on windows" into amd-master-next 2020-05-11 18:58:17 -04:00
Icarus Sparry 64f8ff5e70 Fix runtime failure on windows
Signed-off-by: Icarus Sparry <icarus.sparry@amd.com>
Change-Id: I6c991a943e04ef29eff2905becbed0038c3b8ab9
Signed-off-by: Icarus Sparry <icarus.sparry@amd.com>
2020-05-11 17:30:47 -04:00
Payam a477612a22 updating hipDeviceProps_t-< arch flags according to SWDEV-234277
Change-Id: I6238edf9a2df15a5dfe420dbb8723e10a0725f98
2020-05-11 16:43:13 -04:00
Matt Arsenault 16dbce3776 Start cleaning up rocclr include paths
Use target_include_directories instead of global includes.

Change-Id: I3adadc905843f152a548c71b2f12701476065207
2020-05-11 15:24:19 -04:00
Matt Arsenault d2dd307c7d Remove some asm declarations for intrinsics
This technique should never be used, and only accessed through
__builtins.

There's currently no builtin for groupstaticsize. I left ds_swizzle
since for some reason it switches to the builtin based on __HCC__ or
not.

Change-Id: If1e1394221dba83ea4add6db5e94d6b715552044
2020-05-11 15:20:58 -04:00
Michael Hong Bin Liao fbc03e94c1 Merge "[hip] Fix -Wduplicate-decl-specifier warning. NFC." into amd-master-next 2020-05-11 15:18:57 -04:00
Icarus Sparry 0936d9d64d Merge "Correct quoting of arguments" into amd-master-next 2020-05-11 11:39:47 -04:00
Michael LIAO a2dbcc075c [hip] Fix -Wduplicate-decl-specifier warning. NFC.
Change-Id: Iae48bbb7805c39f1005c920df8e76504426f2d3b
2020-05-11 10:12:33 -04:00
Dittakavi Satyanvesh 83f0c4402e search path logic updated for extractkernel SWDEV-230929
Change-Id: I48d6332502774485d7ced3fee065a74f15774500
2020-05-11 02:27:40 -04:00
Icarus Sparry a4f01ffca6 Correct quoting of arguments
The hipcc script takes arguments and uses this to build up a new
command. Characters which are special to the shell need to be quoted
to prevent them being interpreted.

In particular adding
     --Wl,--enable-new-dtags -Wl,--rpath,'$ORIGIN:$ORIGIN/../lib'
to the command should pass quoted dollar signs into the resulting
string so the shell passes them on, rather than substituting the
values.

The arguments are processed in a conventional loop, but can be altered
during the course of the loop, and also by linker response files.

Tested by running
  HIPCC_VERBOSE=7 HIP_COMPILER=clang hipcc --cxxflags \
      fred.c -Wl,,--rpath,'$ORIGIN:$ORIGIN:/../lib'
and observing "-Wl,--rpath,\$ORIGIN\:\$ORIGIN\:..\/lib" in the
displayed hipcc-cmd output (and ignoring the errors due to rocm not
being installed)

Change-Id: I26b62f09ff3518cceeb85fa8823bb12a95c1c78e
Signed-off-by: Icarus Sparry <icarus.sparry@amd.com>
2020-05-10 18:16:48 -04:00
Vlad Sytchenko fec51e85b0 Correct HIP_FUNC_ATTRIBUTE_NUM_REGS query
Change-Id: I526cc7871c690260df0fa8c1b3b4b15fbc5af219
2020-05-09 12:42:30 -04:00
Christophe Paquot 507f6bb0c3 Merge "Don't add a null command to waitList in hipMemcpy" into amd-master-next 2020-05-08 17:01:38 -04:00
Christophe Paquot 3ca2d1e6a9 Don't add a null command to waitList in hipMemcpy
SWDEV-235345

Change-Id: Ib1abd0ba6414d081891a9f5209df083c45734aee
2020-05-08 12:43:41 -07:00
Vlad Sytchenko 1b1c032e9f Correct HIP_FUNC_ATTRIBUTE_MAX_THREADS_PER_BLOCK query
We should be returning the max workgroup size calculated by the compiler.

Change-Id: If86590efbb9b291f470bdbe87e5df992e661c539
2020-05-08 14:36:47 -04:00
Vlad Sytchenko a373538d72 Fix confusion in hipFuncGetAttribute()
Cuda shared == OpenCL local

Cuda local == OpenCL private

Change-Id: I5a204945ecde35919b9e9def20bbb2662fffea2b
2020-05-08 14:36:36 -04:00
Matt Arsenault 6262ce482f Don't add llvm includes to rocclr build
There are no llvm includes here.

Change-Id: I0567c6e47e717580819f3854782ba08501bd069d
2020-05-08 12:24:49 -04:00
Anusha Godavarthy Surya 39e9205044 Merge "SWDEV-233950:link to /opt/rocm/hip/lib/cmake left behind during uninstall" into amd-master-next 2020-05-08 12:12:06 -04:00
agodavar a9bc94129b SWDEV-234447:Fix error Use of uninitialized value in concatenation (.)
Change-Id: Ia6f8d23f100826438f152c175d10ac3096ee024e
2020-05-08 12:07:24 -04:00
Matt Arsenault 73bc14ec49 Don't write generated version.h to source directory
Change-Id: Ib4e7c3171cd227725ae35fed66167df528a4c431
2020-05-08 10:59:28 -04:00
Aaron Enye Shi 5fbae827c2 SWDEV-234954 - Fix hipconfig on Windows
Before setting the HIP_RUNTIME and HIP_COMPILER variables, first check the environment if these are set. We should prioritize the environment settings. For windows, it will be set, and also explicitly call perl when invoking hipconfig.

Change-Id: I89ad267285239e6d8a897dc681c4af5906e7b9d8
2020-05-08 13:27:36 +00:00
agodavar 7a32fc43a6 SWDEV-233950:link to /opt/rocm/hip/lib/cmake left behind during uninstall
Change-Id: Ideb76b73916881469e39b7a0a4d68d7ec098b4a8
Signed-off-by: agodavar <Anusha.GodavarthySurya@amd.com>
2020-05-08 07:36:28 -04:00
Sarbojit Sarkar 3612851809 Enabling hipGetDeviceFlags required in [SWDEV-229170]
Change-Id: I998d37e5847f9651345554bada86df6fce86d1eb
2020-05-08 01:37:23 -04:00
Matt Arsenault af4692f838 Fix generated hip_prof_str.h install again
Change-Id: I92ea4bedbcdc92974998c00ca7f0c00098791cb5
2020-05-07 12:40:39 -04:00
Aaron En Ye Shi 73e759eab0 Merge "Fix missed VDI names in hip-on-rocclr" into amd-master-next 2020-05-07 12:30:51 -04:00
agodavar 0231de3164 Fix hip-config.cmake build error in rocBLAS
Change-Id: I097f0dac0f67bfc22e9991350bf63bc7bfa8b269
2020-05-07 12:24:20 -04:00
Aaron Enye Shi 1c1be71b63 Fix missed VDI names in hip-on-rocclr
Change-Id: I830feb37a043656136648e92a0c6f1eaae8402d7
2020-05-07 08:37:28 -04:00
Tao Sang 861d07fbd3 Merge "Support performance tests" into amd-master-next 2020-05-07 07:15:09 -04:00