Commit Graph

186 Commitit

Tekijä SHA1 Viesti Päivämäärä
Wen-Heng (Jack) Chung 7ebbbd3525 Add hipMemsetD32 and hipMemsetD32Async
Add 2 extra memset functions which fills memory with integer-typed data

Also change the parameters of ihipMemset to better explain the semantic
2019-03-04 17:00:33 +00:00
Wilkin Chau 8d92d1ebd7 Fix hipMemset3D test
Calculate the allocated size based on the width, height and depth.
2019-02-28 22:42:46 +00:00
Evgeny 0164464bcc fixing HSA_INIT_API cid args 2019-01-16 23:45:44 -06:00
Maneesh Gupta 56ce3e37d5 Merge pull request #797 from gargrahul/fixhipPointerGetAttributes
Fixed hipPointerGetAttributes for hostmalloced ptr
2018-12-12 10:16:07 +05:30
Maneesh Gupta 0dd26b4f63 Merge pull request #608 from gargrahul/add_pinned_2d_sdma_copy
Added support for pinned 2D SDMA copy
2018-12-12 07:44:16 +05:30
Rahul Garg 5f12067708 Fixed hipPointerGetAttributes for hostmalloced ptr 2018-12-08 01:42:08 +05:30
Maneesh Gupta 160c509e23 Merge pull request #760 from eshcherb/roctracer-hip-frontend-181113
Roctracer hip frontend 181113
2018-11-23 11:08:25 +05:30
Maneesh Gupta bcea027bf1 Merge pull request #748 from mkuron/getsymboladdress
Implement hipGetSymbolAddress and hipGetSymbolSize
2018-11-21 10:32:01 +05:30
Michael Kuron 8610128c3e Merge branch 'master' into getsymboladdress 2018-11-20 12:03:22 +01:00
Rahul Garg 1a038879a9 Fix hipHostRegister 2018-11-17 05:38:35 +05:30
Evgeny e5ba097afd renaming HIP_INIT_CB_API to HIP_INIT_API 2018-11-13 15:33:26 +00:00
Evgeny b8b1637ef7 adding activity prof layer 2018-11-13 15:33:26 +00:00
Rahul Garg 11e7ab8879 Fixed hipMemcpyToSymbol doesn't work on GPU other than device 0 SWDEV-166881 2018-11-13 00:49:20 +05:30
Michael Kuron 6ebcc2922c Use correct trace macro in hipGetSymbolAddress/hipGetSymbolSize 2018-11-06 20:46:30 +01:00
Michael Kuron 31acf1c268 Introduce ihipModuleGetGlobal 2018-11-06 09:54:34 +01:00
Michael Kuron 73616582d6 Implement hipGetSymbolAddress and hipGetSymbolSize 2018-11-04 10:39:34 +01:00
Siu Chi Chan 0ff408a56c Move the global arrays for hip malloc/free
from a header into a source file such that
there's only an unique copy in an executable
and prevent wasting static memory on the host

Change-Id: Id5b62766f77809c8d7b47892cb7149c490dcbdb9
2018-11-01 16:20:35 -04:00
Anton Gorenko 21f044eac8 Fix allocation size of arrays with multiple and/or non-32-bit channels
hipMallocArray and hipMalloc3DArray must use sum of bits
of all components.
2018-10-29 18:12:00 +06:00
Rahul Garg 90f57d452a Return hipSuccess when sizeBytes=0 in hipMemset 2018-09-26 12:47:36 +05:30
Rahul Garg 1e57764378 Added support for pinned 2D SDMA copy 2018-07-31 14:05:35 +05:30
Sarunya Pumma 8111fd3b8b Remove device mapping from shareWithAll memory
When shareWithAll memory (e.g., host memory) is allocated, set appId
in hc::AmPointerInfo to -1 to indicate that this memory is not mapped
to any device.  Peer checking in ihipStream_t::canSeeMemory is not
necessary if memory is shared with all devices.  Thus, it is skipped.

Note that earlier host memory is always mapped to device 0 and HIP
always performs peer checking for all kinds of hipMemcpy.  Since the
peer checking process requires context locking, hipMemcpy from/to host
memory always grabs device 0's context lock.  Therefore, if there is
another thread holding the context lock of device 0 (e.g.,
hipDeviceSynchronize on device 0), hipMemcpy will have to wait for the
lock until it can actually perform memcpy.  This can significantly
deteriorate execution performance.

Signed-off-by: Sarunya Pumma <sarunya.pumma@amd.com>
2018-07-28 23:15:16 -07:00
Rahul Garg 7cd1d5e644 Revert "Use memcpy kernel for all pinned memory cases in hipMemcpy2DAsync" 2018-07-02 14:32:11 +05:30
Rahul Garg cd23905897 TEMP- fix memcpy2dAsync for trsm issue 2018-06-15 16:08:29 +05:30
Rahul Garg 069e2c34c9 Fix stream resolution in memcpy2dasync 2018-06-14 11:58:56 +05:30
Rahul Garg 00f8a36bc7 Fix retrieved locked ptr offset 2018-06-13 23:10:05 +05:30
Maneesh Gupta 9e9c039ee4 Merge pull request #497 from gargrahul/fix_memcpy3d_fastpath
Fix hipMemcpy3D for fast path
2018-06-06 14:44:02 +05:30
Rahul Garg a46ff2afd5 Fix hipMemcpy3D for fast path 2018-06-05 18:54:33 +05:30
Rahul Garg 276c948a16 Fix depth value for 3D allocations 2018-06-04 18:00:22 +05:30
Rahul Garg 8d6357669d Fix memcpy2D for malloc+ hostRegister 2018-05-31 13:14:27 +05:30
Rahul Garg d8cb47242b Use 64x4 grid dims 2018-05-24 23:51:52 +05:30
Rahul Garg 4ff059d641 Clean up and fix remaining bytes copy 2018-05-24 23:30:27 +05:30
Rahul Garg 981e56a68f Fix memcpy2d kernel dims 2018-05-24 17:00:12 +05:30
Rahul Garg dc179e0c33 Correct remaining bytes in copy 2d kernel 2018-05-24 08:27:24 +05:30
Rahul Garg 9a76d5b94c Optimize memcpy2D kernel use 2018-05-23 14:43:47 +05:30
Maneesh Gupta 323a6226b0 Merge pull request #464 from gargrahul/fix_memcpy2d_pinned_mem_case
Fixed memcpy2D for pinned memory case using 2D kernel
2018-05-22 10:42:28 +05:30
Rahul Garg f47a8236d7 Fixed memcpy2D for pinned memory case using 2D kernel 2018-05-21 22:14:45 +05:30
Maneesh Gupta 0180a82963 hipMemcpy returns success if sizeBytes is 0.
Fixes SWDEV-153754 & SWDEV-154178.
2018-05-21 15:38:44 +05:30
Rahul Garg afe62e7030 Fix for memcpy2DAsync for pinned host memory case 2018-05-18 21:09:50 +05:30
Maneesh Gupta 03ac8e6a92 Merge pull request #433 from gargrahul/add_hipmemset3d
Added hipMemset3D
2018-05-18 14:54:15 +05:30
Rahul Garg 8f010ac68e Fixed hipMemcpy2D to handle 1D memcpy case 2018-05-16 11:07:10 +05:30
Rahul Garg da302c3e93 Added hipMemset3D 2018-05-07 10:24:30 +05:30
Lakhan Singh 6411ca1f6d Null checks added for hipmallocpitch and hipmemcpy apis 2018-05-03 09:27:50 +05:30
Rahul Garg 9de5f23d54 Fix texture 3D for HIP/NVCC 2018-05-02 11:56:37 +05:30
Lakhan Singh 1c2509dc04 SWDEV-141024 2018-04-20 17:40:00 +05:30
Rahul Garg 3cfb9c0d40 Added hipMemset2DAsync support 2018-04-17 18:27:27 +05:30
Rahul Garg 16c89d101a Correct missed ihipMemsetCopyDataType change 2018-04-12 10:27:19 +05:30
Rahul Garg 3d6eb75828 Changed ihipMemsetCopyDataType to ihipMemsetDataType 2018-04-12 09:29:22 +05:30
Rahul Garg 294bf50f68 Fix hipMemset stream resolution 2018-04-11 19:01:53 +05:30
Rahul Garg 412a35be20 hipMemset refactoring 2018-04-11 15:58:48 +05:30
Maneesh Gupta 03eca1c57e hipMemcpyAsync returns success when trying to copy 0 bytes
Change-Id: I4c0ee7ccc7563e2df657b50356cdd7fec9a1ef15
2018-04-09 12:39:44 +05:30