Alex Voicu
0c16497abd
dlopen() fixes ( #929 )
...
* Initial attempt to switch over to internally linked state.
* Add missing CMake update.
* hipLaunchKernelGGLImpl must be inline as well. Ensure internal linkage.
* Ensure global retrieval uses internally linked state.
* Hide HC in the implementation. Minimise ADL woes.
* Strange software exists, and must be catered to.
* Use a less spammy mechanism for ensuring internal linkage / non-export.
* Remove leftover internal detail.
[ROCm/hip commit: ea0fcf3e61 ]
2019-03-06 17:31:44 +05:30
Wen-Heng (Jack) Chung
da589e38ed
Address code review comments to use hipDeviceptr_t
...
[ROCm/hip commit: 5cbd28f29b ]
2019-03-05 05:51:05 +00:00
Wen-Heng (Jack) Chung
0b7f38d100
Add hipMemsetD32 and hipMemsetD32Async
...
Add 2 extra memset functions which fills memory with integer-typed data
Also change the parameters of ihipMemset to better explain the semantic
[ROCm/hip commit: 7ebbbd3525 ]
2019-03-04 17:00:33 +00:00
Wilkin Chau
4a0d68ba3f
Fix hipMemset3D test
...
Calculate the allocated size based on the width, height and depth.
[ROCm/hip commit: 8d92d1ebd7 ]
2019-02-28 22:42:46 +00:00
Evgeny
3f7ff3450e
fixing HSA_INIT_API cid args
...
[ROCm/hip commit: 0164464bcc ]
2019-01-16 23:45:44 -06:00
Maneesh Gupta
3cf96f31d0
Merge pull request #797 from gargrahul/fixhipPointerGetAttributes
...
Fixed hipPointerGetAttributes for hostmalloced ptr
[ROCm/hip commit: 56ce3e37d5 ]
2018-12-12 10:16:07 +05:30
Maneesh Gupta
07dcdff9e5
Merge pull request #608 from gargrahul/add_pinned_2d_sdma_copy
...
Added support for pinned 2D SDMA copy
[ROCm/hip commit: 0dd26b4f63 ]
2018-12-12 07:44:16 +05:30
Rahul Garg
b304ff5210
Fixed hipPointerGetAttributes for hostmalloced ptr
...
[ROCm/hip commit: 5f12067708 ]
2018-12-08 01:42:08 +05:30
Maneesh Gupta
05e09614be
Merge pull request #760 from eshcherb/roctracer-hip-frontend-181113
...
Roctracer hip frontend 181113
[ROCm/hip commit: 160c509e23 ]
2018-11-23 11:08:25 +05:30
Maneesh Gupta
e2bc3e49a5
Merge pull request #748 from mkuron/getsymboladdress
...
Implement hipGetSymbolAddress and hipGetSymbolSize
[ROCm/hip commit: bcea027bf1 ]
2018-11-21 10:32:01 +05:30
Michael Kuron
c35dfb71d5
Merge branch 'master' into getsymboladdress
...
[ROCm/hip commit: 8610128c3e ]
2018-11-20 12:03:22 +01:00
Rahul Garg
89efed29d7
Fix hipHostRegister
...
[ROCm/hip commit: 1a038879a9 ]
2018-11-17 05:38:35 +05:30
Evgeny
73e3c4ec42
renaming HIP_INIT_CB_API to HIP_INIT_API
...
[ROCm/hip commit: e5ba097afd ]
2018-11-13 15:33:26 +00:00
Evgeny
0a58dc9b7b
adding activity prof layer
...
[ROCm/hip commit: b8b1637ef7 ]
2018-11-13 15:33:26 +00:00
Rahul Garg
ecea878072
Fixed hipMemcpyToSymbol doesn't work on GPU other than device 0 SWDEV-166881
...
[ROCm/hip commit: 11e7ab8879 ]
2018-11-13 00:49:20 +05:30
Michael Kuron
f69866eecc
Use correct trace macro in hipGetSymbolAddress/hipGetSymbolSize
...
[ROCm/hip commit: 6ebcc2922c ]
2018-11-06 20:46:30 +01:00
Michael Kuron
cbba8221ee
Introduce ihipModuleGetGlobal
...
[ROCm/hip commit: 31acf1c268 ]
2018-11-06 09:54:34 +01:00
Michael Kuron
bc455ccf50
Implement hipGetSymbolAddress and hipGetSymbolSize
...
[ROCm/hip commit: 73616582d6 ]
2018-11-04 10:39:34 +01:00
Siu Chi Chan
1159b4aa05
Move the global arrays for hip malloc/free
...
from a header into a source file such that
there's only an unique copy in an executable
and prevent wasting static memory on the host
Change-Id: Id5b62766f77809c8d7b47892cb7149c490dcbdb9
[ROCm/hip commit: 0ff408a56c ]
2018-11-01 16:20:35 -04:00
Anton Gorenko
f2ce51bdf5
Fix allocation size of arrays with multiple and/or non-32-bit channels
...
hipMallocArray and hipMalloc3DArray must use sum of bits
of all components.
[ROCm/hip commit: 21f044eac8 ]
2018-10-29 18:12:00 +06:00
Rahul Garg
6d53af5a60
Return hipSuccess when sizeBytes=0 in hipMemset
...
[ROCm/hip commit: 90f57d452a ]
2018-09-26 12:47:36 +05:30
Rahul Garg
81074364c8
Added support for pinned 2D SDMA copy
...
[ROCm/hip commit: 1e57764378 ]
2018-07-31 14:05:35 +05:30
Sarunya Pumma
a68ea730c2
Remove device mapping from shareWithAll memory
...
When shareWithAll memory (e.g., host memory) is allocated, set appId
in hc::AmPointerInfo to -1 to indicate that this memory is not mapped
to any device. Peer checking in ihipStream_t::canSeeMemory is not
necessary if memory is shared with all devices. Thus, it is skipped.
Note that earlier host memory is always mapped to device 0 and HIP
always performs peer checking for all kinds of hipMemcpy. Since the
peer checking process requires context locking, hipMemcpy from/to host
memory always grabs device 0's context lock. Therefore, if there is
another thread holding the context lock of device 0 (e.g.,
hipDeviceSynchronize on device 0), hipMemcpy will have to wait for the
lock until it can actually perform memcpy. This can significantly
deteriorate execution performance.
Signed-off-by: Sarunya Pumma <sarunya.pumma@amd.com >
[ROCm/hip commit: 8111fd3b8b ]
2018-07-28 23:15:16 -07:00
Rahul Garg
c957c42c20
Revert "Use memcpy kernel for all pinned memory cases in hipMemcpy2DAsync"
...
[ROCm/hip commit: 7cd1d5e644 ]
2018-07-02 14:32:11 +05:30
Rahul Garg
388679efc8
TEMP- fix memcpy2dAsync for trsm issue
...
[ROCm/hip commit: cd23905897 ]
2018-06-15 16:08:29 +05:30
Rahul Garg
312999de41
Fix stream resolution in memcpy2dasync
...
[ROCm/hip commit: 069e2c34c9 ]
2018-06-14 11:58:56 +05:30
Rahul Garg
1d6396dfb9
Fix retrieved locked ptr offset
...
[ROCm/hip commit: 00f8a36bc7 ]
2018-06-13 23:10:05 +05:30
Maneesh Gupta
ac027e4092
Merge pull request #497 from gargrahul/fix_memcpy3d_fastpath
...
Fix hipMemcpy3D for fast path
[ROCm/hip commit: 9e9c039ee4 ]
2018-06-06 14:44:02 +05:30
Rahul Garg
e7bc68d347
Fix hipMemcpy3D for fast path
...
[ROCm/hip commit: a46ff2afd5 ]
2018-06-05 18:54:33 +05:30
Rahul Garg
6592b35c39
Fix depth value for 3D allocations
...
[ROCm/hip commit: 276c948a16 ]
2018-06-04 18:00:22 +05:30
Rahul Garg
46e623fb31
Fix memcpy2D for malloc+ hostRegister
...
[ROCm/hip commit: 8d6357669d ]
2018-05-31 13:14:27 +05:30
Rahul Garg
4021f68f64
Use 64x4 grid dims
...
[ROCm/hip commit: d8cb47242b ]
2018-05-24 23:51:52 +05:30
Rahul Garg
35169c5191
Clean up and fix remaining bytes copy
...
[ROCm/hip commit: 4ff059d641 ]
2018-05-24 23:30:27 +05:30
Rahul Garg
fb745baa7e
Fix memcpy2d kernel dims
...
[ROCm/hip commit: 981e56a68f ]
2018-05-24 17:00:12 +05:30
Rahul Garg
fb1425959e
Correct remaining bytes in copy 2d kernel
...
[ROCm/hip commit: dc179e0c33 ]
2018-05-24 08:27:24 +05:30
Rahul Garg
08f750571d
Optimize memcpy2D kernel use
...
[ROCm/hip commit: 9a76d5b94c ]
2018-05-23 14:43:47 +05:30
Maneesh Gupta
06db862856
Merge pull request #464 from gargrahul/fix_memcpy2d_pinned_mem_case
...
Fixed memcpy2D for pinned memory case using 2D kernel
[ROCm/hip commit: 323a6226b0 ]
2018-05-22 10:42:28 +05:30
Rahul Garg
f02803c527
Fixed memcpy2D for pinned memory case using 2D kernel
...
[ROCm/hip commit: f47a8236d7 ]
2018-05-21 22:14:45 +05:30
Maneesh Gupta
182f8ff28f
hipMemcpy returns success if sizeBytes is 0.
...
Fixes SWDEV-153754 & SWDEV-154178.
[ROCm/hip commit: 0180a82963 ]
2018-05-21 15:38:44 +05:30
Rahul Garg
14030c3f17
Fix for memcpy2DAsync for pinned host memory case
...
[ROCm/hip commit: afe62e7030 ]
2018-05-18 21:09:50 +05:30
Maneesh Gupta
3d1d7ccf30
Merge pull request #433 from gargrahul/add_hipmemset3d
...
Added hipMemset3D
[ROCm/hip commit: 03ac8e6a92 ]
2018-05-18 14:54:15 +05:30
Rahul Garg
4c44cd4a88
Fixed hipMemcpy2D to handle 1D memcpy case
...
[ROCm/hip commit: 8f010ac68e ]
2018-05-16 11:07:10 +05:30
Rahul Garg
e2a2b5bdcf
Added hipMemset3D
...
[ROCm/hip commit: da302c3e93 ]
2018-05-07 10:24:30 +05:30
Lakhan Singh
51dbf4f5ca
Null checks added for hipmallocpitch and hipmemcpy apis
...
[ROCm/hip commit: 6411ca1f6d ]
2018-05-03 09:27:50 +05:30
Rahul Garg
ab1dabe61b
Fix texture 3D for HIP/NVCC
...
[ROCm/hip commit: 9de5f23d54 ]
2018-05-02 11:56:37 +05:30
Lakhan Singh
701de3092b
SWDEV-141024
...
[ROCm/hip commit: 1c2509dc04 ]
2018-04-20 17:40:00 +05:30
Rahul Garg
e1e88f3bff
Added hipMemset2DAsync support
...
[ROCm/hip commit: 3cfb9c0d40 ]
2018-04-17 18:27:27 +05:30
Rahul Garg
89511823f0
Correct missed ihipMemsetCopyDataType change
...
[ROCm/hip commit: 16c89d101a ]
2018-04-12 10:27:19 +05:30
Rahul Garg
abe14442a5
Changed ihipMemsetCopyDataType to ihipMemsetDataType
...
[ROCm/hip commit: 3d6eb75828 ]
2018-04-12 09:29:22 +05:30
Rahul Garg
6c4236dfb6
Fix hipMemset stream resolution
...
[ROCm/hip commit: 294bf50f68 ]
2018-04-11 19:01:53 +05:30