Commit-Graf

188 Incheckningar

Upphovsman SHA1 Meddelande Datum
Alex Voicu 0c16497abd dlopen() fixes (#929)
* Initial attempt to switch over to internally linked state.

* Add missing CMake update.

* hipLaunchKernelGGLImpl must be inline as well. Ensure internal linkage.

* Ensure global retrieval uses internally linked state.

* Hide HC in the implementation. Minimise ADL woes.

* Strange software exists, and must be catered to.

* Use a less spammy mechanism for ensuring internal linkage / non-export.

* Remove leftover internal detail.


[ROCm/hip commit: ea0fcf3e61]
2019-03-06 17:31:44 +05:30
Wen-Heng (Jack) Chung da589e38ed Address code review comments to use hipDeviceptr_t
[ROCm/hip commit: 5cbd28f29b]
2019-03-05 05:51:05 +00:00
Wen-Heng (Jack) Chung 0b7f38d100 Add hipMemsetD32 and hipMemsetD32Async
Add 2 extra memset functions which fills memory with integer-typed data

Also change the parameters of ihipMemset to better explain the semantic


[ROCm/hip commit: 7ebbbd3525]
2019-03-04 17:00:33 +00:00
Wilkin Chau 4a0d68ba3f Fix hipMemset3D test
Calculate the allocated size based on the width, height and depth.


[ROCm/hip commit: 8d92d1ebd7]
2019-02-28 22:42:46 +00:00
Evgeny 3f7ff3450e fixing HSA_INIT_API cid args
[ROCm/hip commit: 0164464bcc]
2019-01-16 23:45:44 -06:00
Maneesh Gupta 3cf96f31d0 Merge pull request #797 from gargrahul/fixhipPointerGetAttributes
Fixed hipPointerGetAttributes for hostmalloced ptr

[ROCm/hip commit: 56ce3e37d5]
2018-12-12 10:16:07 +05:30
Maneesh Gupta 07dcdff9e5 Merge pull request #608 from gargrahul/add_pinned_2d_sdma_copy
Added support for pinned 2D SDMA copy

[ROCm/hip commit: 0dd26b4f63]
2018-12-12 07:44:16 +05:30
Rahul Garg b304ff5210 Fixed hipPointerGetAttributes for hostmalloced ptr
[ROCm/hip commit: 5f12067708]
2018-12-08 01:42:08 +05:30
Maneesh Gupta 05e09614be Merge pull request #760 from eshcherb/roctracer-hip-frontend-181113
Roctracer hip frontend 181113

[ROCm/hip commit: 160c509e23]
2018-11-23 11:08:25 +05:30
Maneesh Gupta e2bc3e49a5 Merge pull request #748 from mkuron/getsymboladdress
Implement hipGetSymbolAddress and hipGetSymbolSize

[ROCm/hip commit: bcea027bf1]
2018-11-21 10:32:01 +05:30
Michael Kuron c35dfb71d5 Merge branch 'master' into getsymboladdress
[ROCm/hip commit: 8610128c3e]
2018-11-20 12:03:22 +01:00
Rahul Garg 89efed29d7 Fix hipHostRegister
[ROCm/hip commit: 1a038879a9]
2018-11-17 05:38:35 +05:30
Evgeny 73e3c4ec42 renaming HIP_INIT_CB_API to HIP_INIT_API
[ROCm/hip commit: e5ba097afd]
2018-11-13 15:33:26 +00:00
Evgeny 0a58dc9b7b adding activity prof layer
[ROCm/hip commit: b8b1637ef7]
2018-11-13 15:33:26 +00:00
Rahul Garg ecea878072 Fixed hipMemcpyToSymbol doesn't work on GPU other than device 0 SWDEV-166881
[ROCm/hip commit: 11e7ab8879]
2018-11-13 00:49:20 +05:30
Michael Kuron f69866eecc Use correct trace macro in hipGetSymbolAddress/hipGetSymbolSize
[ROCm/hip commit: 6ebcc2922c]
2018-11-06 20:46:30 +01:00
Michael Kuron cbba8221ee Introduce ihipModuleGetGlobal
[ROCm/hip commit: 31acf1c268]
2018-11-06 09:54:34 +01:00
Michael Kuron bc455ccf50 Implement hipGetSymbolAddress and hipGetSymbolSize
[ROCm/hip commit: 73616582d6]
2018-11-04 10:39:34 +01:00
Siu Chi Chan 1159b4aa05 Move the global arrays for hip malloc/free
from a header into a source file such that
there's only an unique copy in an executable
and prevent wasting static memory on the host

Change-Id: Id5b62766f77809c8d7b47892cb7149c490dcbdb9


[ROCm/hip commit: 0ff408a56c]
2018-11-01 16:20:35 -04:00
Anton Gorenko f2ce51bdf5 Fix allocation size of arrays with multiple and/or non-32-bit channels
hipMallocArray and hipMalloc3DArray must use sum of bits
of all components.


[ROCm/hip commit: 21f044eac8]
2018-10-29 18:12:00 +06:00
Rahul Garg 6d53af5a60 Return hipSuccess when sizeBytes=0 in hipMemset
[ROCm/hip commit: 90f57d452a]
2018-09-26 12:47:36 +05:30
Rahul Garg 81074364c8 Added support for pinned 2D SDMA copy
[ROCm/hip commit: 1e57764378]
2018-07-31 14:05:35 +05:30
Sarunya Pumma a68ea730c2 Remove device mapping from shareWithAll memory
When shareWithAll memory (e.g., host memory) is allocated, set appId
in hc::AmPointerInfo to -1 to indicate that this memory is not mapped
to any device.  Peer checking in ihipStream_t::canSeeMemory is not
necessary if memory is shared with all devices.  Thus, it is skipped.

Note that earlier host memory is always mapped to device 0 and HIP
always performs peer checking for all kinds of hipMemcpy.  Since the
peer checking process requires context locking, hipMemcpy from/to host
memory always grabs device 0's context lock.  Therefore, if there is
another thread holding the context lock of device 0 (e.g.,
hipDeviceSynchronize on device 0), hipMemcpy will have to wait for the
lock until it can actually perform memcpy.  This can significantly
deteriorate execution performance.

Signed-off-by: Sarunya Pumma <sarunya.pumma@amd.com>


[ROCm/hip commit: 8111fd3b8b]
2018-07-28 23:15:16 -07:00
Rahul Garg c957c42c20 Revert "Use memcpy kernel for all pinned memory cases in hipMemcpy2DAsync"
[ROCm/hip commit: 7cd1d5e644]
2018-07-02 14:32:11 +05:30
Rahul Garg 388679efc8 TEMP- fix memcpy2dAsync for trsm issue
[ROCm/hip commit: cd23905897]
2018-06-15 16:08:29 +05:30
Rahul Garg 312999de41 Fix stream resolution in memcpy2dasync
[ROCm/hip commit: 069e2c34c9]
2018-06-14 11:58:56 +05:30
Rahul Garg 1d6396dfb9 Fix retrieved locked ptr offset
[ROCm/hip commit: 00f8a36bc7]
2018-06-13 23:10:05 +05:30
Maneesh Gupta ac027e4092 Merge pull request #497 from gargrahul/fix_memcpy3d_fastpath
Fix hipMemcpy3D for fast path

[ROCm/hip commit: 9e9c039ee4]
2018-06-06 14:44:02 +05:30
Rahul Garg e7bc68d347 Fix hipMemcpy3D for fast path
[ROCm/hip commit: a46ff2afd5]
2018-06-05 18:54:33 +05:30
Rahul Garg 6592b35c39 Fix depth value for 3D allocations
[ROCm/hip commit: 276c948a16]
2018-06-04 18:00:22 +05:30
Rahul Garg 46e623fb31 Fix memcpy2D for malloc+ hostRegister
[ROCm/hip commit: 8d6357669d]
2018-05-31 13:14:27 +05:30
Rahul Garg 4021f68f64 Use 64x4 grid dims
[ROCm/hip commit: d8cb47242b]
2018-05-24 23:51:52 +05:30
Rahul Garg 35169c5191 Clean up and fix remaining bytes copy
[ROCm/hip commit: 4ff059d641]
2018-05-24 23:30:27 +05:30
Rahul Garg fb745baa7e Fix memcpy2d kernel dims
[ROCm/hip commit: 981e56a68f]
2018-05-24 17:00:12 +05:30
Rahul Garg fb1425959e Correct remaining bytes in copy 2d kernel
[ROCm/hip commit: dc179e0c33]
2018-05-24 08:27:24 +05:30
Rahul Garg 08f750571d Optimize memcpy2D kernel use
[ROCm/hip commit: 9a76d5b94c]
2018-05-23 14:43:47 +05:30
Maneesh Gupta 06db862856 Merge pull request #464 from gargrahul/fix_memcpy2d_pinned_mem_case
Fixed memcpy2D for pinned memory case using 2D kernel

[ROCm/hip commit: 323a6226b0]
2018-05-22 10:42:28 +05:30
Rahul Garg f02803c527 Fixed memcpy2D for pinned memory case using 2D kernel
[ROCm/hip commit: f47a8236d7]
2018-05-21 22:14:45 +05:30
Maneesh Gupta 182f8ff28f hipMemcpy returns success if sizeBytes is 0.
Fixes SWDEV-153754 & SWDEV-154178.

[ROCm/hip commit: 0180a82963]
2018-05-21 15:38:44 +05:30
Rahul Garg 14030c3f17 Fix for memcpy2DAsync for pinned host memory case
[ROCm/hip commit: afe62e7030]
2018-05-18 21:09:50 +05:30
Maneesh Gupta 3d1d7ccf30 Merge pull request #433 from gargrahul/add_hipmemset3d
Added hipMemset3D

[ROCm/hip commit: 03ac8e6a92]
2018-05-18 14:54:15 +05:30
Rahul Garg 4c44cd4a88 Fixed hipMemcpy2D to handle 1D memcpy case
[ROCm/hip commit: 8f010ac68e]
2018-05-16 11:07:10 +05:30
Rahul Garg e2a2b5bdcf Added hipMemset3D
[ROCm/hip commit: da302c3e93]
2018-05-07 10:24:30 +05:30
Lakhan Singh 51dbf4f5ca Null checks added for hipmallocpitch and hipmemcpy apis
[ROCm/hip commit: 6411ca1f6d]
2018-05-03 09:27:50 +05:30
Rahul Garg ab1dabe61b Fix texture 3D for HIP/NVCC
[ROCm/hip commit: 9de5f23d54]
2018-05-02 11:56:37 +05:30
Lakhan Singh 701de3092b SWDEV-141024
[ROCm/hip commit: 1c2509dc04]
2018-04-20 17:40:00 +05:30
Rahul Garg e1e88f3bff Added hipMemset2DAsync support
[ROCm/hip commit: 3cfb9c0d40]
2018-04-17 18:27:27 +05:30
Rahul Garg 89511823f0 Correct missed ihipMemsetCopyDataType change
[ROCm/hip commit: 16c89d101a]
2018-04-12 10:27:19 +05:30
Rahul Garg abe14442a5 Changed ihipMemsetCopyDataType to ihipMemsetDataType
[ROCm/hip commit: 3d6eb75828]
2018-04-12 09:29:22 +05:30
Rahul Garg 6c4236dfb6 Fix hipMemset stream resolution
[ROCm/hip commit: 294bf50f68]
2018-04-11 19:01:53 +05:30