Yaxun (Sam) Liu
c7ac5531d3
Fix warpSize for gfx10
...
Change-Id: I8029ebeb91db1efa8e166ad349aaae6364e8069d
2020-10-05 20:45:07 -04:00
Vladislav Sytchenko
ad2d55c144
Revert "Revert "Merge branch 'amd-master-next' into amd-npi-next""
...
This reverts commit 374ead1d19 .
Reason for revert: <INSERT REASONING HERE>
Change-Id: I92ceb171e31026ed1864704cef2fc1497b883ef9
2020-10-05 13:20:58 -04:00
Vladislav Sytchenko
374ead1d19
Revert "Merge branch 'amd-master-next' into amd-npi-next"
...
This reverts commit 73558e3363 .
Reason for revert: <INSERT REASONING HERE>
Change-Id: I53322718dadde2c98f96140b8e260ec7ee9ef721
2020-10-05 13:02:39 -04:00
Aryan Salmanpour
3c72e7beea
rename abort in nvcc path
...
[background] There is currently a compilation issue when both hip/hip_runtime.h and hip/hip_cooperative_groups.h included
in a file and hipcc used to compile it on NV platform. The issue is that an abort is defined in hip/nvcc_detail/hip_runtime.h
and it is also defined in the CUDA cooperative groups header (/cuda/include/cooperative_groups/details/helpers.h).
this is problematic and leads to a compilation issue in hipcc on NV platform.
Change-Id: I2ab6982ac4103822a1a4a0ced942cd604d6c19c1
2020-10-01 14:28:51 -04:00
mshivama
b8640f1f0d
Add missing definitions for class static members
...
The static members __HIP_Coordinates::x, ::y and ::z must be defined
outside the class. Otherwise, linker throws `undefined reference error`
when these definitions are needed in the HIP application.
Change-Id: Iabc09744b478c22e4b13cf9824877ec9cfdd4f7a
2020-09-30 03:38:24 -04:00
Sarbojit Sarkar
bf20337fc1
SWDEV-253247: add ulong and ulonglong version of__shfl*
...
Change-Id: I40ab6cfa12175f334e8392b71f567054d8256e2a
2020-09-30 01:58:22 -04:00
mshivama
4b6d92798f
Layout the cooperative group related headers to target Nvidia
...
Change-Id: I530ffc72d6fae1e273b5322d508629221e11ea6f
2020-09-29 21:32:24 -04:00
Sarbojit Sarkar
0be1f7843a
SWDEV-252337: Fix for V100 compilation issue
...
Change-Id: Ib57ff3ac28f0a581f395f32181974e64eca6d858
2020-09-25 21:55:22 -04:00
agodavar
288fbe5af3
SWDEV-249870 - Short-Term solution for Pre-Compiled Headers for Online Compilation
...
Change-Id: Iec989787e546ff2eb68c4b146dc540655698b569
2020-09-24 12:18:06 -04:00
Aryan Salmanpour
5831687985
use only hip compiler to compile hip cooperative groups
...
SWDEV-252377
Change-Id: I6cf749dad2bf05c2e49fed495ab2833c574c2215
2020-09-22 14:34:05 -04:00
Aryan Salmanpour
ddceeac8eb
Fix a warning with cooperative groups header
...
SWDEV-252413
Change-Id: Ia098464b670f26416b79cc67805b8accb6bec903
2020-09-18 13:38:47 -04:00
Rahul Garg
188278b38a
grid size optimization
...
Change-Id: I68b197548b961e1c0aa63119eecc8d9a25350cf2
2020-09-10 15:10:55 -04:00
agodavar
9e2fa6e215
SWDEV-249870 - Short-Term solution for Pre-Compiled Headers for Online Compilation
...
Change-Id: Ibcb365ce2ff27c4c2379609964078da42e1226b1
2020-09-09 13:44:13 -04:00
Siu Chi Chan
bf0d8f37e4
fix no matching push_macro warmnings
...
Change-Id: I352fe1170cfc9e8de5307536a21c86f0b483c68e
2020-09-09 11:41:07 -04:00
Jatin Chaudhary
fef8df6e2e
Adding dim3 conversion operator to __hip_builtin_threadIdx_t
...
Change-Id: Ib7f9e45872ec6d48c0e5b7fc93ccecaafaeac754
2020-08-31 21:30:14 -04:00
mshivama
15aff89ba1
Support public apis for cooperative group types.
...
Change-Id: I01346afde485e82c34b7868b9241b34c542d0cf9
2020-08-25 10:50:38 +05:30
Todd tiantuo Li
99eb486937
SWDEV-240803 - add hipFuncSetSharedMemConfig
...
Change-Id: I160b04677b3e7b99b3981ae7ecc84a0e3811d5e8
2020-08-20 18:18:24 -04:00
Jatin Chaudhary
1340b3f07f
Adding Anyorder flag to HIP
...
Change-Id: Ie20931541b3febe23fa9ac36ebc0c90de75a5f0a
2020-08-17 08:46:52 -04:00
Todd tiantuo Li
9dfe15a843
SWDEV-240803 - add hipFuncSetAttribute and hipFuncAttribute
...
Change-Id: I3f4d67b19d89fd348fa5b884af4a2542ee4aba60
2020-08-14 17:39:29 -04:00
Jatin Chaudhary
1e886f432d
Cuda 11 changes: https://docs.nvidia.com/cuda/cuda-runtime-api/structcudaPointerAttributes.html\#structcudaPointerAttributes
...
Change-Id: I8a5389b88df286043c365a734983a4c5de352102
2020-08-07 05:33:33 -04:00
kjayapra-amd
4d42a969fd
SWDEV-240800 - Adding initial support for hipDeviceGetP2PAttribute. Some attr support still pending.
...
Change-Id: I0611aed136270db497dfa374144f6f5e35352a8f
2020-07-22 14:54:34 -04:00
mshivama
27e237ef48
Add support for few CUDA compatible reduction operations like __syncthreads_count()
...
Change-Id: I0917820058f0edad5c0bdd42dea37f06353fc63c
2020-07-14 12:03:14 -04:00
Yaxun (Sam) Liu
13316e2919
Add pow(float/double/_Float16,int)
...
Change-Id: Ie65d15cd3df9853a3bbd613d8c7188ae39c327c7
2020-07-06 07:38:57 -04:00
Ronak Chauhan
affe9ab9b5
Support passing macros to hipLaunchKernelGGL
...
This makes hipLaunchKernelGGL take a variable argument list, that will be
expanded before being fed to hipLaunchKernelGGLInternal.
This is different from 961717879d .
We try to accomodate the case when a kernel template has multiple
type parameters.
Change-Id: I87577d402c92b0f3b51e298f8293f4065e1f6de8
2020-06-30 10:44:55 -04:00
Daniil Fukalov
63e44d16a3
Add __attribute__((const)) to grid related functions declarations
...
This is cherrypick of Daniil Fukalov's PR https://github.com/ROCm-Developer-Tools/HIP/pull/2110
which has been committed to master branch.
Make declarations consistent with https://github.com/RadeonOpenCompute/ROCm-Device-Libs/blob/amd-stg-open/ockl/src/workitem.cl
Without the attribute these functions don't have "readnone" LLVM IR attribute. Without it some optimizations fails, e.g. Loop Invariant Code Motion doesn't hoist these calls out of a loop.
Change-Id: Idb599570d142152cc4f6a3c8986384ad7f0c4729
2020-06-29 13:33:18 -04:00
Ronak Nilesh Chauhan
b7101af203
Revert "Support passing macros to hipLaunchKernelGGL"
...
This reverts commit 961717879d .
Reason for revert: This patch breaks ROCPrim tests
Change-Id: Ib2235f719861c9f4317c33e86b6c1f8bc669cfd4
2020-06-24 04:28:46 -04:00
Michael LIAO
cea6b99a28
[hip] Disable assert workaround for HCC on HIP-Clang.
...
- HIP-Clang follows the standard assert definition by providing
`__assert_fail`. But, `assert` macro is added as an HCC-specific
workaround due to the missing implementation. Only enable that on the
HCC compilation to avoid unexpected behaviors on HIP-Clang
compilation.
Change-Id: I1c9a707baff9b85c30faef58c52ebfe07e3fc3fc
2020-06-22 10:33:56 -04:00
Ronak Chauhan
961717879d
Support passing macros to hipLaunchKernelGGL
...
This makes hipLaunchKernelGGL take a variable argument list, that will be
expanded before being fed to hipLaunchKernelGGLInternal.
Change-Id: Id76e2bf91acd5d68f56a24fc39f219f2eeb06d33
2020-06-22 04:35:29 -04:00
Tao Sang
63051ca2e1
Support numa policy set by user
...
Add hipHostMallocNumaUser flag to hipHostMalloc() in order to support
numa policy set by user.
Change-Id: I6d70ed539a5f97f27187f2242b68849c0e27e4d6
2020-06-19 21:23:58 -04:00
Yaxun (Sam) Liu
b907505d55
Fix missing ldexp(float,int)
...
Change-Id: I2c1553407dfc26948d3ab7aa532eef42a0f6b204
2020-06-18 15:16:59 -04:00
Jason Tang
38cd2b96c7
Add asicRevision
...
Change-Id: I59f3ad20b9bdadf77bd1e0725f7a401d7ad423a3
2020-06-16 17:54:20 -04:00
German Andryeyev
f4211c3905
Initial support for HIP managed memory
...
- Call the new ROCclr interfaces for HMM
Change-Id: I2cd1bf438f712a9e9e328340e7d0c025257ca6c1
2020-06-15 18:10:41 -04:00
Rahul Garg
00301b1665
Addback __mbcnt_lo and __mbcnt_hi
...
Change-Id: Ic3facba2e2245461515799f6a17842da0f5d9933
2020-06-11 21:21:36 -04:00
Dittakavi Satyanvesh
6ed1868203
SWDEV-236670 Address Eigen unit test failure by adding __host__ attribute to half2 functions
...
Change-Id: Ifdc852c30a1b3704871e0ee58cb7a55d3d37fc6e
2020-06-10 03:01:42 -04:00
Yaxun (Sam) Liu
087c579625
Fix include path and wrapper header
...
Currently std::complex and some other std functions require uses to
include hip_runtime.h before any other headers to work, which is not
reliable.
changes are made in clang to fix this issue:
https://reviews.llvm.org/D81176
which requires hipcc and HIP headers to make corresponding changes.
This patch will make sure the clang change will not break
HIP/ROCclr during this transition.
After the transition is done, we can remove explicitly setting
include path for HIP-Clang and HIP header in hipcc and hip config
cmake files and rely on clang driver to set it automatically.
Change-Id: I5d226861c2560ffa6c5ab17343a43cc378048061
2020-06-09 17:37:20 -04:00
Jason Tang
1c0d737e1f
SWDEV-227909 - Add gcnArchName
...
Change-Id: Iea6d16b5d693dd0d900fa424d7a321c39315430e
2020-06-05 15:33:55 -04:00
Siu Chi Chan
784ca6f43c
add constexpr constructor for vector types
...
Change-Id: I45bb0537d6a24ee50b548c2fd8b4f20518764813
2020-06-04 01:57:03 -04:00
Evgeny
cad3f805c0
adding hipGetStreamDeviceId() profiling API
...
Change-Id: I5ccf88ddac123260d7c17defefcf20ff3b2504e2
2020-06-03 18:57:49 -04:00
Jatin
2d517fdcc6
Adding changes for hipExtLaunchKernel for rocCLR
...
Change-Id: Iba52bc3bde7c37f3fb375a55ba0947e87b3cdc9b
2020-06-02 14:16:41 -04:00
Evgeny
ef7ff69ff0
adding hipKernelNameRefByPtr function
...
Change-Id: Iefc18967b10394b85a207ffdb5bbfe5e3601474d
2020-05-28 10:59:48 -04:00
Michael LIAO
f6addba699
[hip] Those texture interfaces are C interfaces should be always exposed.
...
Change-Id: Ie34f1420839b17486346149b1672e70ec0088b54
2020-05-27 15:03:59 -04:00
Sarbojit Sarkar
83b11f9a61
[doc]shfl*sync update
...
1. Updated FAQ with shft*sync not supported hip_faq.md
2. Corrected some of input parameter description in hcc_details/hip_runtime_api.h
3. Redirect shfl*() to shfl_*_sync() for nvcc path where CUDA > 9.0
Change-Id: I3d8184db5fcc622852c9bad96b706348e8dfc16c
2020-05-27 02:17:40 -04:00
Mahesha Shivamallappa
01dae52d64
Add support for cooperative group type - thread_block
...
Change-Id: If3770b6d6718a638b70f527ae2533d9ef3267ff4
2020-05-22 23:08:42 -04:00
Aryan Salmanpour
7dd5b19290
Add support for hipExtStreamCreateWithCUMask API
...
Change-Id: I369d0eaca493821c4badc6b18ac02daa2fddc95f
2020-05-22 11:34:06 -04:00
Evgeny
5abb8e1a68
API tracing instrumentation
...
Change-Id: I257409b9fe299b009ded3e3a43287322d5f93a70
2020-05-14 11:03:09 -05:00
Matt Arsenault
d2dd307c7d
Remove some asm declarations for intrinsics
...
This technique should never be used, and only accessed through
__builtins.
There's currently no builtin for groupstaticsize. I left ds_swizzle
since for some reason it switches to the builtin based on __HCC__ or
not.
Change-Id: If1e1394221dba83ea4add6db5e94d6b715552044
2020-05-11 15:20:58 -04:00
Michael LIAO
a2dbcc075c
[hip] Fix -Wduplicate-decl-specifier warning. NFC.
...
Change-Id: Iae48bbb7805c39f1005c920df8e76504426f2d3b
2020-05-11 10:12:33 -04:00
Sarbojit Sarkar
3612851809
Enabling hipGetDeviceFlags required in [SWDEV-229170]
...
Change-Id: I998d37e5847f9651345554bada86df6fce86d1eb
2020-05-08 01:37:23 -04:00
Payam
c5f76c3de3
name change vdi to rocclr
...
Change-Id: I06d198bbb4a499e153b290b73a92afed3553b252
2020-05-06 09:14:30 -04:00
Rahul Garg
60c34fbd4d
Make HIP C compliant
...
Change-Id: Ic2fa650675e68200c841ce3db622da836b169f33
2020-05-05 12:49:40 -04:00