Aryan Salmanpour
d9a335bccf
Add support for hipExtStreamGetCUMask API
...
Change-Id: I0fa67ad581dd75556f17c7410af2c1d5cb6ae99a
2020-11-20 13:41:38 -05:00
Vladislav Sytchenko
fb04bdc4f2
Report maxTexture1DLinear property
...
Change-Id: If562bbee80d8b98712806cb8258b5a7482d33bf6
2020-11-20 10:18:20 -05:00
Julia Jiang
3ccce6a642
SWDEV-249486-re-organize doxygen HIP API
...
Change-Id: I3d107de526e428ae16c6dffd97d9388bf54a6358
2020-11-11 13:31:56 -05:00
Sourabh Betigeri
81ca27783b
SWDEV-255694: Include attributes - hipFuncAttributePreferredSharedMemoryCarveout and hipFuncAttributeMaxDynamicSharedMemorySize
...
Change-Id: I3569859e6db0c59a10a76c8ea356d4bc0a0bb906
2020-11-02 19:39:39 -05:00
German Andryeyev
2bf7282bd5
Update documentation
...
Add managed memory section for the doxygen documentation.
No code changes.
Change-Id: I2e9db4ec13422159e265193916f0a3cdb721732d
2020-10-23 18:28:43 -04:00
Aaron En Ye Shi
2005c3437e
Fix header for OpenMP AMDGCN support
...
Add ifndef for __OPENMP_AMDGCN__ target, which shares
these header files with HIP.
Change-Id: I720693cfbcfb7836cd2f361d48530fb55ee3557a
2020-10-20 14:04:48 -04:00
Aaron En Ye Shi
20999c719d
Fix header invalid sqrt fwd decl and ifdef
...
The __ocml_sqrt_* functions only accept a single arg,
not two args. Also, change ifdef for if because the flag
__HCC_OR_HIP_CLANG__ is always defined.
Change-Id: Id9dc4eaf0f25df1df27a1bf643cb545ac23653f8
2020-10-19 18:01:14 +00:00
Julia Jiang
4e7f420722
SWDEV-249486-Update HIP doxygen
...
Change-Id: Id32772b8f4d495e0f1bd3cb5cd1ec211a06298c3
2020-10-14 13:59:58 -04:00
Sarbojit Sarkar
86b1600ca7
SWDEV-255975: fix for gcc + hip_fp16.h compilation error
...
Change-Id: Ia152bad4af71a5b53a16e2f9fd9aeeacc61b39ac
2020-10-14 06:57:25 -04:00
German Andryeyev
d9f655f50e
Report managed memory capabilities
...
Change-Id: I241a42e26ee49fb47dcb3ba87c4bd8fb691dffdf
2020-10-13 16:09:43 -04:00
Julia Jiang
c2a3d19acb
SWDEV-249487-remove two HCC related APIs
...
Change-Id: Icf90ece0808643e0c5f0cc00b6644d5171b553d8
2020-10-13 11:08:37 -04:00
agodavar
e0be2ca8dd
SWDEV-255853: CUDA device P2P attributes
...
Change-Id: I7deffd04101b565f68a72a179f9faeaa22cab97d
2020-10-13 05:32:29 -04:00
agodavar
d96e810ccd
SWDEV-254909:Support for missing texture/array cuda driver types
...
Change-Id: I9c769f6335ec3e7feacaa8929719c666bff91934
2020-10-13 05:30:31 -04:00
Sameer Sahasrabuddhe
c36c0bad22
Do not inline the body of __assert_fail()
...
The device side assertion calls printf to write out a message. In the
device compiler, printf is expanded into a series of hostcalls that
transmit the printf payload to the host. This expansion increases the
length of the kernel, resulting in sub-optimal compilation. The
solution is to ensure that the assert() implementation is not inlined
into the kernel.
Change-Id: Ia3a075461a755cf007218f262b0863e1926c76aa
2020-10-10 01:03:04 -04:00
Yaxun (Sam) Liu
dc20086cbb
Disable wave64 support on GFX10 for HIP
...
Change-Id: Ie3d4fe91516167ea26ce704689b6b6f1951a7521
2020-10-09 17:02:56 -04:00
Yaxun (Sam) Liu
c7ac5531d3
Fix warpSize for gfx10
...
Change-Id: I8029ebeb91db1efa8e166ad349aaae6364e8069d
2020-10-05 20:45:07 -04:00
Vladislav Sytchenko
ad2d55c144
Revert "Revert "Merge branch 'amd-master-next' into amd-npi-next""
...
This reverts commit 374ead1d19 .
Reason for revert: <INSERT REASONING HERE>
Change-Id: I92ceb171e31026ed1864704cef2fc1497b883ef9
2020-10-05 13:20:58 -04:00
Vladislav Sytchenko
374ead1d19
Revert "Merge branch 'amd-master-next' into amd-npi-next"
...
This reverts commit 73558e3363 .
Reason for revert: <INSERT REASONING HERE>
Change-Id: I53322718dadde2c98f96140b8e260ec7ee9ef721
2020-10-05 13:02:39 -04:00
Aryan Salmanpour
3c72e7beea
rename abort in nvcc path
...
[background] There is currently a compilation issue when both hip/hip_runtime.h and hip/hip_cooperative_groups.h included
in a file and hipcc used to compile it on NV platform. The issue is that an abort is defined in hip/nvcc_detail/hip_runtime.h
and it is also defined in the CUDA cooperative groups header (/cuda/include/cooperative_groups/details/helpers.h).
this is problematic and leads to a compilation issue in hipcc on NV platform.
Change-Id: I2ab6982ac4103822a1a4a0ced942cd604d6c19c1
2020-10-01 14:28:51 -04:00
mshivama
b8640f1f0d
Add missing definitions for class static members
...
The static members __HIP_Coordinates::x, ::y and ::z must be defined
outside the class. Otherwise, linker throws `undefined reference error`
when these definitions are needed in the HIP application.
Change-Id: Iabc09744b478c22e4b13cf9824877ec9cfdd4f7a
2020-09-30 03:38:24 -04:00
Sarbojit Sarkar
bf20337fc1
SWDEV-253247: add ulong and ulonglong version of__shfl*
...
Change-Id: I40ab6cfa12175f334e8392b71f567054d8256e2a
2020-09-30 01:58:22 -04:00
mshivama
4b6d92798f
Layout the cooperative group related headers to target Nvidia
...
Change-Id: I530ffc72d6fae1e273b5322d508629221e11ea6f
2020-09-29 21:32:24 -04:00
Sarbojit Sarkar
0be1f7843a
SWDEV-252337: Fix for V100 compilation issue
...
Change-Id: Ib57ff3ac28f0a581f395f32181974e64eca6d858
2020-09-25 21:55:22 -04:00
agodavar
288fbe5af3
SWDEV-249870 - Short-Term solution for Pre-Compiled Headers for Online Compilation
...
Change-Id: Iec989787e546ff2eb68c4b146dc540655698b569
2020-09-24 12:18:06 -04:00
Aryan Salmanpour
5831687985
use only hip compiler to compile hip cooperative groups
...
SWDEV-252377
Change-Id: I6cf749dad2bf05c2e49fed495ab2833c574c2215
2020-09-22 14:34:05 -04:00
Aryan Salmanpour
ddceeac8eb
Fix a warning with cooperative groups header
...
SWDEV-252413
Change-Id: Ia098464b670f26416b79cc67805b8accb6bec903
2020-09-18 13:38:47 -04:00
Rahul Garg
188278b38a
grid size optimization
...
Change-Id: I68b197548b961e1c0aa63119eecc8d9a25350cf2
2020-09-10 15:10:55 -04:00
agodavar
9e2fa6e215
SWDEV-249870 - Short-Term solution for Pre-Compiled Headers for Online Compilation
...
Change-Id: Ibcb365ce2ff27c4c2379609964078da42e1226b1
2020-09-09 13:44:13 -04:00
Siu Chi Chan
bf0d8f37e4
fix no matching push_macro warmnings
...
Change-Id: I352fe1170cfc9e8de5307536a21c86f0b483c68e
2020-09-09 11:41:07 -04:00
Jatin Chaudhary
fef8df6e2e
Adding dim3 conversion operator to __hip_builtin_threadIdx_t
...
Change-Id: Ib7f9e45872ec6d48c0e5b7fc93ccecaafaeac754
2020-08-31 21:30:14 -04:00
mshivama
15aff89ba1
Support public apis for cooperative group types.
...
Change-Id: I01346afde485e82c34b7868b9241b34c542d0cf9
2020-08-25 10:50:38 +05:30
Todd tiantuo Li
99eb486937
SWDEV-240803 - add hipFuncSetSharedMemConfig
...
Change-Id: I160b04677b3e7b99b3981ae7ecc84a0e3811d5e8
2020-08-20 18:18:24 -04:00
Jatin Chaudhary
1340b3f07f
Adding Anyorder flag to HIP
...
Change-Id: Ie20931541b3febe23fa9ac36ebc0c90de75a5f0a
2020-08-17 08:46:52 -04:00
Todd tiantuo Li
9dfe15a843
SWDEV-240803 - add hipFuncSetAttribute and hipFuncAttribute
...
Change-Id: I3f4d67b19d89fd348fa5b884af4a2542ee4aba60
2020-08-14 17:39:29 -04:00
Jatin Chaudhary
1e886f432d
Cuda 11 changes: https://docs.nvidia.com/cuda/cuda-runtime-api/structcudaPointerAttributes.html\#structcudaPointerAttributes
...
Change-Id: I8a5389b88df286043c365a734983a4c5de352102
2020-08-07 05:33:33 -04:00
kjayapra-amd
4d42a969fd
SWDEV-240800 - Adding initial support for hipDeviceGetP2PAttribute. Some attr support still pending.
...
Change-Id: I0611aed136270db497dfa374144f6f5e35352a8f
2020-07-22 14:54:34 -04:00
mshivama
27e237ef48
Add support for few CUDA compatible reduction operations like __syncthreads_count()
...
Change-Id: I0917820058f0edad5c0bdd42dea37f06353fc63c
2020-07-14 12:03:14 -04:00
Yaxun (Sam) Liu
13316e2919
Add pow(float/double/_Float16,int)
...
Change-Id: Ie65d15cd3df9853a3bbd613d8c7188ae39c327c7
2020-07-06 07:38:57 -04:00
Ronak Chauhan
affe9ab9b5
Support passing macros to hipLaunchKernelGGL
...
This makes hipLaunchKernelGGL take a variable argument list, that will be
expanded before being fed to hipLaunchKernelGGLInternal.
This is different from 961717879d .
We try to accomodate the case when a kernel template has multiple
type parameters.
Change-Id: I87577d402c92b0f3b51e298f8293f4065e1f6de8
2020-06-30 10:44:55 -04:00
Daniil Fukalov
63e44d16a3
Add __attribute__((const)) to grid related functions declarations
...
This is cherrypick of Daniil Fukalov's PR https://github.com/ROCm-Developer-Tools/HIP/pull/2110
which has been committed to master branch.
Make declarations consistent with https://github.com/RadeonOpenCompute/ROCm-Device-Libs/blob/amd-stg-open/ockl/src/workitem.cl
Without the attribute these functions don't have "readnone" LLVM IR attribute. Without it some optimizations fails, e.g. Loop Invariant Code Motion doesn't hoist these calls out of a loop.
Change-Id: Idb599570d142152cc4f6a3c8986384ad7f0c4729
2020-06-29 13:33:18 -04:00
Ronak Nilesh Chauhan
b7101af203
Revert "Support passing macros to hipLaunchKernelGGL"
...
This reverts commit 961717879d .
Reason for revert: This patch breaks ROCPrim tests
Change-Id: Ib2235f719861c9f4317c33e86b6c1f8bc669cfd4
2020-06-24 04:28:46 -04:00
Michael LIAO
cea6b99a28
[hip] Disable assert workaround for HCC on HIP-Clang.
...
- HIP-Clang follows the standard assert definition by providing
`__assert_fail`. But, `assert` macro is added as an HCC-specific
workaround due to the missing implementation. Only enable that on the
HCC compilation to avoid unexpected behaviors on HIP-Clang
compilation.
Change-Id: I1c9a707baff9b85c30faef58c52ebfe07e3fc3fc
2020-06-22 10:33:56 -04:00
Ronak Chauhan
961717879d
Support passing macros to hipLaunchKernelGGL
...
This makes hipLaunchKernelGGL take a variable argument list, that will be
expanded before being fed to hipLaunchKernelGGLInternal.
Change-Id: Id76e2bf91acd5d68f56a24fc39f219f2eeb06d33
2020-06-22 04:35:29 -04:00
Tao Sang
63051ca2e1
Support numa policy set by user
...
Add hipHostMallocNumaUser flag to hipHostMalloc() in order to support
numa policy set by user.
Change-Id: I6d70ed539a5f97f27187f2242b68849c0e27e4d6
2020-06-19 21:23:58 -04:00
Yaxun (Sam) Liu
b907505d55
Fix missing ldexp(float,int)
...
Change-Id: I2c1553407dfc26948d3ab7aa532eef42a0f6b204
2020-06-18 15:16:59 -04:00
Jason Tang
38cd2b96c7
Add asicRevision
...
Change-Id: I59f3ad20b9bdadf77bd1e0725f7a401d7ad423a3
2020-06-16 17:54:20 -04:00
German Andryeyev
f4211c3905
Initial support for HIP managed memory
...
- Call the new ROCclr interfaces for HMM
Change-Id: I2cd1bf438f712a9e9e328340e7d0c025257ca6c1
2020-06-15 18:10:41 -04:00
Rahul Garg
00301b1665
Addback __mbcnt_lo and __mbcnt_hi
...
Change-Id: Ic3facba2e2245461515799f6a17842da0f5d9933
2020-06-11 21:21:36 -04:00
Dittakavi Satyanvesh
6ed1868203
SWDEV-236670 Address Eigen unit test failure by adding __host__ attribute to half2 functions
...
Change-Id: Ifdc852c30a1b3704871e0ee58cb7a55d3d37fc6e
2020-06-10 03:01:42 -04:00
Yaxun (Sam) Liu
087c579625
Fix include path and wrapper header
...
Currently std::complex and some other std functions require uses to
include hip_runtime.h before any other headers to work, which is not
reliable.
changes are made in clang to fix this issue:
https://reviews.llvm.org/D81176
which requires hipcc and HIP headers to make corresponding changes.
This patch will make sure the clang change will not break
HIP/ROCclr during this transition.
After the transition is done, we can remove explicitly setting
include path for HIP-Clang and HIP header in hipcc and hip config
cmake files and rely on clang driver to set it automatically.
Change-Id: I5d226861c2560ffa6c5ab17343a43cc378048061
2020-06-09 17:37:20 -04:00