Maneesh Gupta
160c509e23
Merge pull request #760 from eshcherb/roctracer-hip-frontend-181113
...
Roctracer hip frontend 181113
2018-11-23 11:08:25 +05:30
Maneesh Gupta
bcea027bf1
Merge pull request #748 from mkuron/getsymboladdress
...
Implement hipGetSymbolAddress and hipGetSymbolSize
2018-11-21 10:32:01 +05:30
Michael Kuron
8610128c3e
Merge branch 'master' into getsymboladdress
2018-11-20 12:03:22 +01:00
Rahul Garg
1a038879a9
Fix hipHostRegister
2018-11-17 05:38:35 +05:30
Evgeny
e5ba097afd
renaming HIP_INIT_CB_API to HIP_INIT_API
2018-11-13 15:33:26 +00:00
Evgeny
b8b1637ef7
adding activity prof layer
2018-11-13 15:33:26 +00:00
Rahul Garg
11e7ab8879
Fixed hipMemcpyToSymbol doesn't work on GPU other than device 0 SWDEV-166881
2018-11-13 00:49:20 +05:30
Michael Kuron
6ebcc2922c
Use correct trace macro in hipGetSymbolAddress/hipGetSymbolSize
2018-11-06 20:46:30 +01:00
Michael Kuron
31acf1c268
Introduce ihipModuleGetGlobal
2018-11-06 09:54:34 +01:00
Michael Kuron
73616582d6
Implement hipGetSymbolAddress and hipGetSymbolSize
2018-11-04 10:39:34 +01:00
Siu Chi Chan
0ff408a56c
Move the global arrays for hip malloc/free
...
from a header into a source file such that
there's only an unique copy in an executable
and prevent wasting static memory on the host
Change-Id: Id5b62766f77809c8d7b47892cb7149c490dcbdb9
2018-11-01 16:20:35 -04:00
Anton Gorenko
21f044eac8
Fix allocation size of arrays with multiple and/or non-32-bit channels
...
hipMallocArray and hipMalloc3DArray must use sum of bits
of all components.
2018-10-29 18:12:00 +06:00
Rahul Garg
90f57d452a
Return hipSuccess when sizeBytes=0 in hipMemset
2018-09-26 12:47:36 +05:30
Sarunya Pumma
8111fd3b8b
Remove device mapping from shareWithAll memory
...
When shareWithAll memory (e.g., host memory) is allocated, set appId
in hc::AmPointerInfo to -1 to indicate that this memory is not mapped
to any device. Peer checking in ihipStream_t::canSeeMemory is not
necessary if memory is shared with all devices. Thus, it is skipped.
Note that earlier host memory is always mapped to device 0 and HIP
always performs peer checking for all kinds of hipMemcpy. Since the
peer checking process requires context locking, hipMemcpy from/to host
memory always grabs device 0's context lock. Therefore, if there is
another thread holding the context lock of device 0 (e.g.,
hipDeviceSynchronize on device 0), hipMemcpy will have to wait for the
lock until it can actually perform memcpy. This can significantly
deteriorate execution performance.
Signed-off-by: Sarunya Pumma <sarunya.pumma@amd.com >
2018-07-28 23:15:16 -07:00
Rahul Garg
7cd1d5e644
Revert "Use memcpy kernel for all pinned memory cases in hipMemcpy2DAsync"
2018-07-02 14:32:11 +05:30
Rahul Garg
cd23905897
TEMP- fix memcpy2dAsync for trsm issue
2018-06-15 16:08:29 +05:30
Rahul Garg
069e2c34c9
Fix stream resolution in memcpy2dasync
2018-06-14 11:58:56 +05:30
Rahul Garg
00f8a36bc7
Fix retrieved locked ptr offset
2018-06-13 23:10:05 +05:30
Maneesh Gupta
9e9c039ee4
Merge pull request #497 from gargrahul/fix_memcpy3d_fastpath
...
Fix hipMemcpy3D for fast path
2018-06-06 14:44:02 +05:30
Rahul Garg
a46ff2afd5
Fix hipMemcpy3D for fast path
2018-06-05 18:54:33 +05:30
Rahul Garg
276c948a16
Fix depth value for 3D allocations
2018-06-04 18:00:22 +05:30
Rahul Garg
8d6357669d
Fix memcpy2D for malloc+ hostRegister
2018-05-31 13:14:27 +05:30
Rahul Garg
d8cb47242b
Use 64x4 grid dims
2018-05-24 23:51:52 +05:30
Rahul Garg
4ff059d641
Clean up and fix remaining bytes copy
2018-05-24 23:30:27 +05:30
Rahul Garg
981e56a68f
Fix memcpy2d kernel dims
2018-05-24 17:00:12 +05:30
Rahul Garg
dc179e0c33
Correct remaining bytes in copy 2d kernel
2018-05-24 08:27:24 +05:30
Rahul Garg
9a76d5b94c
Optimize memcpy2D kernel use
2018-05-23 14:43:47 +05:30
Maneesh Gupta
323a6226b0
Merge pull request #464 from gargrahul/fix_memcpy2d_pinned_mem_case
...
Fixed memcpy2D for pinned memory case using 2D kernel
2018-05-22 10:42:28 +05:30
Rahul Garg
f47a8236d7
Fixed memcpy2D for pinned memory case using 2D kernel
2018-05-21 22:14:45 +05:30
Maneesh Gupta
0180a82963
hipMemcpy returns success if sizeBytes is 0.
...
Fixes SWDEV-153754 & SWDEV-154178.
2018-05-21 15:38:44 +05:30
Rahul Garg
afe62e7030
Fix for memcpy2DAsync for pinned host memory case
2018-05-18 21:09:50 +05:30
Maneesh Gupta
03ac8e6a92
Merge pull request #433 from gargrahul/add_hipmemset3d
...
Added hipMemset3D
2018-05-18 14:54:15 +05:30
Rahul Garg
8f010ac68e
Fixed hipMemcpy2D to handle 1D memcpy case
2018-05-16 11:07:10 +05:30
Rahul Garg
da302c3e93
Added hipMemset3D
2018-05-07 10:24:30 +05:30
Lakhan Singh
6411ca1f6d
Null checks added for hipmallocpitch and hipmemcpy apis
2018-05-03 09:27:50 +05:30
Rahul Garg
9de5f23d54
Fix texture 3D for HIP/NVCC
2018-05-02 11:56:37 +05:30
Lakhan Singh
1c2509dc04
SWDEV-141024
2018-04-20 17:40:00 +05:30
Rahul Garg
3cfb9c0d40
Added hipMemset2DAsync support
2018-04-17 18:27:27 +05:30
Rahul Garg
16c89d101a
Correct missed ihipMemsetCopyDataType change
2018-04-12 10:27:19 +05:30
Rahul Garg
3d6eb75828
Changed ihipMemsetCopyDataType to ihipMemsetDataType
2018-04-12 09:29:22 +05:30
Rahul Garg
294bf50f68
Fix hipMemset stream resolution
2018-04-11 19:01:53 +05:30
Rahul Garg
412a35be20
hipMemset refactoring
2018-04-11 15:58:48 +05:30
Maneesh Gupta
03eca1c57e
hipMemcpyAsync returns success when trying to copy 0 bytes
...
Change-Id: I4c0ee7ccc7563e2df657b50356cdd7fec9a1ef15
2018-04-09 12:39:44 +05:30
Maneesh Gupta
1ba06f63c4
Apply .clangformat to all repo source files
...
Change-Id: I7e79c6058f0303f9a98911e3b7dd2e8596079344
2018-03-12 11:29:03 +05:30
Alex Voicu
dc7560ef22
Change directory name to match HIP lowercase style.
2018-02-22 13:15:10 +00:00
Maneesh Gupta
4b8ae78891
Merge pull request #321 from gargrahul/hipMemcpyArray_Functions
...
Added support for hipMemcpy Array functions-
2018-02-12 10:36:38 +05:30
Rahul Garg
24ab820a11
Fixed host allocated globals address lookup for host usage
...
Fixed texture driver APIs failure
2018-01-30 18:06:31 +05:30
Rahul Garg
487a430b5a
Added support for -
...
- hipMemcpyFromArray
- hipMemcpyAtoH
- hipMemcpyHtoA
2018-01-16 11:44:19 +05:30
Rahul Garg
115c7f2b79
Added support for
...
- 3D texture driver APIs
- hipMalloc3D
- hipMemcpy3D for destination other than array
2017-12-05 14:11:13 +05:30
Ben Sander
9bba97fdcc
Fix some cppcheck style issues.
2017-12-01 20:45:34 +00:00