Sarbojit Sarkar
5bd8f8dbe6
SWDEV-341174 - fixed warning on cuda
...
Change-Id: I5d017bcc154a2985e23893c0d933438bf0fc1958
2022-06-27 04:49:53 -04:00
Sarbojit Sarkar
1074b22c48
SWDEV-333073 - HIP_INIT_API to handle no return case
...
Change-Id: I78d43e88cc196047a4cb5d6fcf756339c0e46e80
2022-06-24 12:53:18 -04:00
Sarbojit Sarkar
7f92a98a3c
SWDEV-331066- Added hipDeviceSetLimit
...
Change-Id: I4de099387ad34ca7c195588274b546649038e2aa
2022-06-21 06:06:05 +00:00
Maneesh Gupta
2bd0cb523e
SWDEV-340918 - make hip_runtime.h work with g++
...
Change-Id: Ief3e6dc075837dfccae3defad7da696130ed05bd
2022-06-20 23:32:58 -04:00
jaypatel
8d464a2e83
SWDEV-322688 - Added hip math constants header.
...
Change-Id: I9363f82159a53c6822e943c4950fc7f6c350b326
2022-06-16 23:56:45 -04:00
Satyanvesh Dittakavi
f7d0850eb8
SWDEV-336532 - Convert Managed Memory in hipPointerGetAttributes on Nvidia platform
...
Change-Id: I3bc6180040b6ad48f06ffdc7d01ca6fd9fe32c0e
2022-06-16 00:43:15 -04:00
Jaydeep Patel
ab8583b4d5
SWDEV-338376 - Added hipMemGetAllocationGranularity, hipMemCreate and hipMemRelease for CUDA
...
Change-Id: If8c6a00958bb39752eb202c30a7960d13de225ae
2022-06-15 23:59:56 -04:00
sdashmiz
e8194dca76
SWDEV-325711 - add Graph memory APIs skeleton
...
- hipDeviceGet/SetGraphMemAttr
- hipDeviceGraphMemTrim
- there is no memory pool for graphs currently
Signed-off-by: sdashmiz <shadi.dashmiz@amd.com >
Change-Id: I11db76ea7ea1c7732175fc93264448052357e8dc
2022-06-15 14:47:16 -04:00
sdashmiz
13c875eaf0
SWDEV-321698 - remove Memcpy1D type
...
- Memcpy1D node type is not complying with cuda
Signed-off-by: sdashmiz <shadi.dashmiz@amd.com >
Change-Id: If8113f5e699de0c62d98effc4580a2e0fee9a950
2022-06-13 08:31:02 -04:00
sdashmiz
efd2d55e9d
SWDEV-318833 - Get and Set attribute for kernel nodes
...
Signed-off-by: sdashmiz <shadi.dashmiz@amd.com >
Change-Id: Ie51aa4b56661cbb8c5b4eb1dbaad327377084ffb
Signed-off-by: sdashmiz <shadi.dashmiz@amd.com >
2022-06-09 09:47:05 -04:00
Satyanvesh Dittakavi
8dfa7e7f98
SWDEV-334908 - Guard the __noinline__ macro as it is also supported as keyword by clang
...
Change-Id: I314b3ac463ea63132a5107f4f24ce769d8112b9b
2022-05-18 05:51:32 -04:00
Tao Sang
a19b767e8a
SWDEV-319818 - Release restriction on hipResourceViewDesc
...
Let more types support hipResourceViewDesc.
Add missing symbols to fix cuda build failing.
Change-Id: Ife694cc6491427093863252e257e820b0bb4fa0f
2022-05-10 15:57:31 -04:00
Joseph Greathouse
f68149eafd
SWDEV-332811 - Clean up and extend HIP unsafe atomic add
...
Update HIP's unsafeAtomicAdd to:
- Compile properly even when not compiling for gfx90a
- Fall back to safe atomic add on non-gfx90a architectures
- use flat atomic add for FP64 on gfx90a, instead of dynamically
checking memory spaces.
In addition, when the compiler is passed -munsafe-fp-atomics, it
will define __AMDGCN_UNSAFE_FP_ATOMICS__. When this happens, the
compiler is requesting that the HIP headers force all HIP
atomicAdd() calls on floats or doubles to use their unsafe versions.
This patch thus causes unsafeAtomicAdd() calls when that define
is seen. This call to unsafeAtomicAdd() is also done for atomicSub(),
since that calls atomicAdd underneath. This is not done for
system-scope atomicAdd because, on gfx90a, system-scope atomic FP
add instructions would need to target fine-grained memory, which is
always unsafe.
This patch also creates safeAtomicAdd() functions for float and double.
These functions will create a standalone safe atomic, even when the
application is compiled with -munsafe-fp-atomics.
Finally, this patch adds wrappers in the Nvidia path of HIP so that
these HIP functions call through to atomicAdd there as well.
Change-Id: I8af0621d3d28ea30c9278bfeea7393d03bbdac6d
2022-04-29 07:19:58 -04:00
Sarbojit Sarkar
fc1f02bbed
SWDEV-323441 - support for default stream per thread
...
Change-Id: I0032da0357f5cffbf5e4ec4a02435d2a128a262b
2022-04-18 23:19:55 -04:00
Christophe Paquot
cd447a5aaa
SWDEV-322620 - Virtual Memory Management APIs
...
Adding skeletons.
Implemented:
- hipMemGetAllocationGranularity with basic granularity
- hipMemGetAllocationPropertiesFromHandle
- hipMemCreate
Change-Id: I4bd2c75fae45d6a057efcad19d1d3c9715e9ae67
2022-04-08 11:38:51 -04:00
Rakesh Roy
5f0f7ffe92
SWDEV-326789 - Mark texture reference APIs as deprecated
...
Change-Id: I7a39cc1f036cd185ac3aab6c5f44730f16225f69
2022-04-05 00:04:34 -04:00
German Andryeyev
539d264eb0
SWDEV-311271 - Initial mempool implementation
...
HIP_MEM_POOL_SUPPORT controls memory pool support in runtime.
Currently it's disabled by default. The initial change doesn't
include: IPC, MGPU, virtual memory alloc, suballoc, defragmentation,
internal dependencies.
Change-Id: Ibed8528ebec698b045ebb247e49c0ecd6e587ed7
2022-03-31 09:21:37 -04:00
Rakesh Roy
1e77357c7f
SWDEV-326789 - Add Missing HIP API Functions For Nvidia Platform
...
Change-Id: I828028c5d893ca1487f33017a6902c3b7f786762
2022-03-25 01:34:57 -04:00
Satyanvesh Dittakavi
2fd90ec7a4
SWDEV-317716 - Add hipDeviceGetUuid API
...
Change-Id: I320c7bc11ddd7617e0246f6faf19135ad7363e73
2022-03-16 00:23:44 -04:00
Satyanvesh Dittakavi
3cc0a3fcaf
SWDEV-321656 - Address incorrect warp size with hiprtc
...
Change-Id: I97051d4e2cf27cb073c39663c62e43d5c19b1a12
2022-03-15 06:52:21 -04:00
Sarbojit Sarkar
2b86f24fda
SWDEV-323441 - New header for per-thread-default-stream
...
Change-Id: If74ea1464117abd2bfeadd543b94d26f9e934cbb
2022-03-14 01:13:01 -04:00
haoyuan2
bed3995147
SWDEV-316128 - HIP surface API support
...
align with CUDA to support surf1DLayeredread/write,
surf2DLayeredread/write, surfCubemapread/write functions.
Change-Id: Ie3f6ca75e23accd97cb35cdc8362d88a1e5dbd54
2022-03-09 21:19:48 -05:00
Satyanvesh Dittakavi
1524c9515e
SWDEV-315118 - Add version check for hipGraphInstantiateWithFlags on nvidia path
...
Change-Id: I672b4a5287348bddf6eb8955decc9cce786275a2
2022-03-09 02:10:08 -05:00
Shadi Dashmiz
1965dd5795
Revert "SWDEV-321698 - hipGraphNodeTypeMemcpy1D NVDIA build failure"
...
This reverts commit 74beb4583c .
Reason for revert: hip changes not in yet
Change-Id: I89e35d171cf01fb82d361279998efcc90e54ef33
2022-03-08 15:08:31 -05:00
sdashmiz
74beb4583c
SWDEV-321698 - hipGraphNodeTypeMemcpy1D NVDIA build failure
...
Signed-off-by: sdashmiz <shadi.dashmiz@amd.com >
Change-Id: I7aea11bf14adbeb7fea68eb862df74fcf6f13d75
2022-03-07 16:26:11 -05:00
Christophe Paquot
27f102b167
SWDEV-323472 - Add hipThreadExchangeStreamCaptureMode
...
Also fix an issue in hip_prof_gen.py with GLenum.
Change-Id: Ib03fe9dae87b850c53b7d8aeb78b8dbdd01f794c
2022-02-24 15:24:22 -05:00
Satyanvesh Dittakavi
f3f11f4c7d
EXSWCPHIPT-13 - Fix Segfault on Nvidia Platform
...
Change-Id: Ib85bb0b0e91b7703afc4858c4a5ada0115ad68f4
2022-02-19 12:53:46 +00:00
Satyanvesh Dittakavi
aee5c48848
SWDEV-315981 - make nvidia hiprtc functions static
...
Change-Id: I945426c155ed9ce21c308745f384bbf8669d73c4
2022-02-15 23:38:19 -05:00
Satyanvesh Dittakavi
e5496b3159
SWDEV-314485 - Remove the older implementation of device-side malloc/free
...
Change-Id: I9ba3c6030a6dd7a5e064073d5c21223041282f61
2022-02-15 07:49:56 -05:00
Satyanvesh Dittakavi
087c0c25b2
SWDEV-315118 - Add version checks for few graph APIs on Nvidia path
...
HIP apps running on CUDA 11.0 are failing due to some graph APIs/enums
added only in CUDA 11.1 or CUDA 11.3
Change-Id: I0d32b412cb76c42c7b3a9c612d750990f9e89908
2022-02-15 02:03:49 -05:00
Satyanvesh Dittakavi
f371300933
SWDEV-315017 - Add missing graph error strings
...
Change-Id: I11b5a623756c5bef88cbc93e49c124d0caf62bd1
2022-02-15 01:41:44 -05:00
Satyanvesh Dittakavi
9c31a738c2
SWDEV-317135 - Fix mapping of hipStreamUpdateCaptureDependencies on cuda path
...
Change-Id: Ibbcf8136fa527a20684a377311bc76182345d17b
2022-02-14 09:10:46 -05:00
Sourabh Betigeri
7d8c63cc34
SWDEV-294375 - Declares fns() functions static
...
Change-Id: Ife0150b8d6b9531522e61a82df6b88af3282f437
2022-02-13 23:35:46 -05:00
haoyuan2
d7beddbfee
SWDEV-316128 - HIP surface API support
...
Change-Id: I5b98f86168e64b36588f5d7efb82191d895e27d7
2022-02-11 11:07:04 -05:00
Satyanvesh Dittakavi
0759f4ff81
SWDEV-314661 - Add hipGraphInstantiateWithFlags API mapping in nvidia path
...
Change-Id: Ibbf54ef18978448e92571f0535124844154726b6
2022-02-09 00:02:55 -05:00
Sourabh Betigeri
523e5a007c
SWDEV-294375 - Adding coalesced_groups to ROCm CG
...
Change-Id: I4098e1970ae2c388171fa73deb5bc023ed1b7a68
2022-02-07 11:29:46 -05:00
Satyanvesh Dittakavi
760f275861
SWDEV-292714 - Added support for hipPointerGetAttribute and
...
hipDrvPointerGetAttributes
Change-Id: I44a0b5c5d9eb5a9a63406b11e8de83e5cb245a59
2022-02-02 05:33:07 -05:00
pghafari
44e0c246f8
SWDEV-245531 - GLInterop adding register image
...
Change-Id: I075bec49b7ad5f78336586b9d55e259717c4c000
2022-01-23 14:59:18 -05:00
Payam
5f9d779324
SWDEV-245531 - GLInterop adding register image
...
Change-Id: Ib16b647e2a1b8f3c578a9689d3987121c984250b
2022-01-14 15:57:24 -05:00
Nick Curtis
c8103470d5
Fix missing system-scope for CAS in atomic[Min|Max]_system
...
Change-Id: Ieb0ca79eaeba3a0ae56739246e08135a25a8d6dd
2022-01-12 22:53:11 -05:00
haoyuan2
0f94653447
SWDEV-315924 - align with CUDA to define vector3 data type
...
Change-Id: I7654fb757b4c387a762f384ccdd3f2c8555d149f
2022-01-06 17:35:56 -05:00
Tao Sang
78de8cd672
SWDEV-315590 - Fix texture vector issue
...
Apply mapFrom() to related texture device functions.
Change-Id: Ifed918c87373b784780b2180320ccb746845daff
2022-01-04 19:26:14 -05:00
Tao Sang
7de480760b
SWDEV-315590 - Fix texture fetch issue
...
Add mapFrom() to map "float __attribute__((ext_vector_type(4)))"
of output of low-level device function to right vector.
Change-Id: Ifed915c87273b784780b2180320ccb746845dafb
2022-01-04 13:38:57 -05:00
haoyuan2
8a8581edf5
SWDEV-316052 - fix HIP_Directed tests build error
...
Change-Id: I782c68859f77934f2cf672ee53378e51b1cb1ddd
2021-12-21 03:21:53 -05:00
haoyuan2
a8b30291c3
SWDEV-303956 - add make_int2/3/4 and make_float2/3/4 function support
...
Change-Id: I0260d1a9d2ac66e84dc31a80b0995616ef78b0fe
2021-12-15 14:57:36 -08:00
haoyuan2
937415141c
SWDEV-303956 - align with CUDA to define float4 data type
...
Change-Id: I9ad1315e5899a8c0cf1812302b27010d88015f88
2021-12-14 15:22:43 -05:00
Satyanvesh Dittakavi
f3c5f12d26
SWDEV-311744 - Update floating point max/min atomic implementation
...
Clang builtins doesnt yet support max/min atomics with float/double
parameters, Hence modify atomicMax/Min to use compare and swap
implementation
Change-Id: Ia351f9b136129fe25f0eddcbbb17549553247182
2021-12-07 13:29:44 -05:00
Tao Sang
5acc8b5102
SWDEV-303204 - Disable texture/image Apis on some devices
...
If the target device doesn't support image,
compiler will send out unavailable error if device texture apis
are called; and host texture apis will return hipErrorNotSupported.
Remove useless include/hip/amd_detail/texture_functions.h.
Add hipDeviceAttributeImageSupport attribute to check image
support in runtime.
Add __HIP_NO_IMAGE_SUPPORT(=1) macro in device code to indicate
texture apis not supported.
Change-Id: Ia891493e681a61980a2444859a90e046a9ebea39
2021-12-04 08:14:08 -05:00
haoyuan2
69117705c8
SWDEV-309292 - add hipPos definition in CUDA path
...
Change-Id: Ia81d5db77d7403cafb583fda8083d0f4dbe33835
2021-12-04 04:40:34 -05:00
cjatin
5b08779b25
SWDEV-307407 - Add hiprtc's type traits function to match implementation of libcxx
...
Change-Id: I0a10eca329f817bbe31bdc47869b2de104d12d38
2021-12-02 23:32:23 -05:00