Граф коммитов

179 Коммитов

Автор SHA1 Сообщение Дата
Maneesh Gupta 160c509e23 Merge pull request #760 from eshcherb/roctracer-hip-frontend-181113
Roctracer hip frontend 181113
2018-11-23 11:08:25 +05:30
Maneesh Gupta bcea027bf1 Merge pull request #748 from mkuron/getsymboladdress
Implement hipGetSymbolAddress and hipGetSymbolSize
2018-11-21 10:32:01 +05:30
Michael Kuron 8610128c3e Merge branch 'master' into getsymboladdress 2018-11-20 12:03:22 +01:00
Rahul Garg 1a038879a9 Fix hipHostRegister 2018-11-17 05:38:35 +05:30
Evgeny e5ba097afd renaming HIP_INIT_CB_API to HIP_INIT_API 2018-11-13 15:33:26 +00:00
Evgeny b8b1637ef7 adding activity prof layer 2018-11-13 15:33:26 +00:00
Rahul Garg 11e7ab8879 Fixed hipMemcpyToSymbol doesn't work on GPU other than device 0 SWDEV-166881 2018-11-13 00:49:20 +05:30
Michael Kuron 6ebcc2922c Use correct trace macro in hipGetSymbolAddress/hipGetSymbolSize 2018-11-06 20:46:30 +01:00
Michael Kuron 31acf1c268 Introduce ihipModuleGetGlobal 2018-11-06 09:54:34 +01:00
Michael Kuron 73616582d6 Implement hipGetSymbolAddress and hipGetSymbolSize 2018-11-04 10:39:34 +01:00
Siu Chi Chan 0ff408a56c Move the global arrays for hip malloc/free
from a header into a source file such that
there's only an unique copy in an executable
and prevent wasting static memory on the host

Change-Id: Id5b62766f77809c8d7b47892cb7149c490dcbdb9
2018-11-01 16:20:35 -04:00
Anton Gorenko 21f044eac8 Fix allocation size of arrays with multiple and/or non-32-bit channels
hipMallocArray and hipMalloc3DArray must use sum of bits
of all components.
2018-10-29 18:12:00 +06:00
Rahul Garg 90f57d452a Return hipSuccess when sizeBytes=0 in hipMemset 2018-09-26 12:47:36 +05:30
Sarunya Pumma 8111fd3b8b Remove device mapping from shareWithAll memory
When shareWithAll memory (e.g., host memory) is allocated, set appId
in hc::AmPointerInfo to -1 to indicate that this memory is not mapped
to any device.  Peer checking in ihipStream_t::canSeeMemory is not
necessary if memory is shared with all devices.  Thus, it is skipped.

Note that earlier host memory is always mapped to device 0 and HIP
always performs peer checking for all kinds of hipMemcpy.  Since the
peer checking process requires context locking, hipMemcpy from/to host
memory always grabs device 0's context lock.  Therefore, if there is
another thread holding the context lock of device 0 (e.g.,
hipDeviceSynchronize on device 0), hipMemcpy will have to wait for the
lock until it can actually perform memcpy.  This can significantly
deteriorate execution performance.

Signed-off-by: Sarunya Pumma <sarunya.pumma@amd.com>
2018-07-28 23:15:16 -07:00
Rahul Garg 7cd1d5e644 Revert "Use memcpy kernel for all pinned memory cases in hipMemcpy2DAsync" 2018-07-02 14:32:11 +05:30
Rahul Garg cd23905897 TEMP- fix memcpy2dAsync for trsm issue 2018-06-15 16:08:29 +05:30
Rahul Garg 069e2c34c9 Fix stream resolution in memcpy2dasync 2018-06-14 11:58:56 +05:30
Rahul Garg 00f8a36bc7 Fix retrieved locked ptr offset 2018-06-13 23:10:05 +05:30
Maneesh Gupta 9e9c039ee4 Merge pull request #497 from gargrahul/fix_memcpy3d_fastpath
Fix hipMemcpy3D for fast path
2018-06-06 14:44:02 +05:30
Rahul Garg a46ff2afd5 Fix hipMemcpy3D for fast path 2018-06-05 18:54:33 +05:30
Rahul Garg 276c948a16 Fix depth value for 3D allocations 2018-06-04 18:00:22 +05:30
Rahul Garg 8d6357669d Fix memcpy2D for malloc+ hostRegister 2018-05-31 13:14:27 +05:30
Rahul Garg d8cb47242b Use 64x4 grid dims 2018-05-24 23:51:52 +05:30
Rahul Garg 4ff059d641 Clean up and fix remaining bytes copy 2018-05-24 23:30:27 +05:30
Rahul Garg 981e56a68f Fix memcpy2d kernel dims 2018-05-24 17:00:12 +05:30
Rahul Garg dc179e0c33 Correct remaining bytes in copy 2d kernel 2018-05-24 08:27:24 +05:30
Rahul Garg 9a76d5b94c Optimize memcpy2D kernel use 2018-05-23 14:43:47 +05:30
Maneesh Gupta 323a6226b0 Merge pull request #464 from gargrahul/fix_memcpy2d_pinned_mem_case
Fixed memcpy2D for pinned memory case using 2D kernel
2018-05-22 10:42:28 +05:30
Rahul Garg f47a8236d7 Fixed memcpy2D for pinned memory case using 2D kernel 2018-05-21 22:14:45 +05:30
Maneesh Gupta 0180a82963 hipMemcpy returns success if sizeBytes is 0.
Fixes SWDEV-153754 & SWDEV-154178.
2018-05-21 15:38:44 +05:30
Rahul Garg afe62e7030 Fix for memcpy2DAsync for pinned host memory case 2018-05-18 21:09:50 +05:30
Maneesh Gupta 03ac8e6a92 Merge pull request #433 from gargrahul/add_hipmemset3d
Added hipMemset3D
2018-05-18 14:54:15 +05:30
Rahul Garg 8f010ac68e Fixed hipMemcpy2D to handle 1D memcpy case 2018-05-16 11:07:10 +05:30
Rahul Garg da302c3e93 Added hipMemset3D 2018-05-07 10:24:30 +05:30
Lakhan Singh 6411ca1f6d Null checks added for hipmallocpitch and hipmemcpy apis 2018-05-03 09:27:50 +05:30
Rahul Garg 9de5f23d54 Fix texture 3D for HIP/NVCC 2018-05-02 11:56:37 +05:30
Lakhan Singh 1c2509dc04 SWDEV-141024 2018-04-20 17:40:00 +05:30
Rahul Garg 3cfb9c0d40 Added hipMemset2DAsync support 2018-04-17 18:27:27 +05:30
Rahul Garg 16c89d101a Correct missed ihipMemsetCopyDataType change 2018-04-12 10:27:19 +05:30
Rahul Garg 3d6eb75828 Changed ihipMemsetCopyDataType to ihipMemsetDataType 2018-04-12 09:29:22 +05:30
Rahul Garg 294bf50f68 Fix hipMemset stream resolution 2018-04-11 19:01:53 +05:30
Rahul Garg 412a35be20 hipMemset refactoring 2018-04-11 15:58:48 +05:30
Maneesh Gupta 03eca1c57e hipMemcpyAsync returns success when trying to copy 0 bytes
Change-Id: I4c0ee7ccc7563e2df657b50356cdd7fec9a1ef15
2018-04-09 12:39:44 +05:30
Maneesh Gupta 1ba06f63c4 Apply .clangformat to all repo source files
Change-Id: I7e79c6058f0303f9a98911e3b7dd2e8596079344
2018-03-12 11:29:03 +05:30
Alex Voicu dc7560ef22 Change directory name to match HIP lowercase style. 2018-02-22 13:15:10 +00:00
Maneesh Gupta 4b8ae78891 Merge pull request #321 from gargrahul/hipMemcpyArray_Functions
Added support for hipMemcpy Array functions-
2018-02-12 10:36:38 +05:30
Rahul Garg 24ab820a11 Fixed host allocated globals address lookup for host usage
Fixed texture driver APIs failure
2018-01-30 18:06:31 +05:30
Rahul Garg 487a430b5a Added support for -
- hipMemcpyFromArray
- hipMemcpyAtoH
- hipMemcpyHtoA
2018-01-16 11:44:19 +05:30
Rahul Garg 115c7f2b79 Added support for
- 3D texture driver APIs
- hipMalloc3D
- hipMemcpy3D for destination other than array
2017-12-05 14:11:13 +05:30
Ben Sander 9bba97fdcc Fix some cppcheck style issues. 2017-12-01 20:45:34 +00:00