Rahul Garg
188278b38a
grid size optimization
...
Change-Id: I68b197548b961e1c0aa63119eecc8d9a25350cf2
2020-09-10 15:10:55 -04:00
agodavar
9e2fa6e215
SWDEV-249870 - Short-Term solution for Pre-Compiled Headers for Online Compilation
...
Change-Id: Ibcb365ce2ff27c4c2379609964078da42e1226b1
2020-09-09 13:44:13 -04:00
Siu Chi Chan
bf0d8f37e4
fix no matching push_macro warmnings
...
Change-Id: I352fe1170cfc9e8de5307536a21c86f0b483c68e
2020-09-09 11:41:07 -04:00
Jatin Chaudhary
fef8df6e2e
Adding dim3 conversion operator to __hip_builtin_threadIdx_t
...
Change-Id: Ib7f9e45872ec6d48c0e5b7fc93ccecaafaeac754
2020-08-31 21:30:14 -04:00
mshivama
15aff89ba1
Support public apis for cooperative group types.
...
Change-Id: I01346afde485e82c34b7868b9241b34c542d0cf9
2020-08-25 10:50:38 +05:30
Todd tiantuo Li
99eb486937
SWDEV-240803 - add hipFuncSetSharedMemConfig
...
Change-Id: I160b04677b3e7b99b3981ae7ecc84a0e3811d5e8
2020-08-20 18:18:24 -04:00
Jatin Chaudhary
1340b3f07f
Adding Anyorder flag to HIP
...
Change-Id: Ie20931541b3febe23fa9ac36ebc0c90de75a5f0a
2020-08-17 08:46:52 -04:00
Todd tiantuo Li
9dfe15a843
SWDEV-240803 - add hipFuncSetAttribute and hipFuncAttribute
...
Change-Id: I3f4d67b19d89fd348fa5b884af4a2542ee4aba60
2020-08-14 17:39:29 -04:00
Jatin Chaudhary
1e886f432d
Cuda 11 changes: https://docs.nvidia.com/cuda/cuda-runtime-api/structcudaPointerAttributes.html\#structcudaPointerAttributes
...
Change-Id: I8a5389b88df286043c365a734983a4c5de352102
2020-08-07 05:33:33 -04:00
kjayapra-amd
4d42a969fd
SWDEV-240800 - Adding initial support for hipDeviceGetP2PAttribute. Some attr support still pending.
...
Change-Id: I0611aed136270db497dfa374144f6f5e35352a8f
2020-07-22 14:54:34 -04:00
mshivama
27e237ef48
Add support for few CUDA compatible reduction operations like __syncthreads_count()
...
Change-Id: I0917820058f0edad5c0bdd42dea37f06353fc63c
2020-07-14 12:03:14 -04:00
Yaxun (Sam) Liu
13316e2919
Add pow(float/double/_Float16,int)
...
Change-Id: Ie65d15cd3df9853a3bbd613d8c7188ae39c327c7
2020-07-06 07:38:57 -04:00
Ronak Chauhan
affe9ab9b5
Support passing macros to hipLaunchKernelGGL
...
This makes hipLaunchKernelGGL take a variable argument list, that will be
expanded before being fed to hipLaunchKernelGGLInternal.
This is different from 961717879d .
We try to accomodate the case when a kernel template has multiple
type parameters.
Change-Id: I87577d402c92b0f3b51e298f8293f4065e1f6de8
2020-06-30 10:44:55 -04:00
Daniil Fukalov
63e44d16a3
Add __attribute__((const)) to grid related functions declarations
...
This is cherrypick of Daniil Fukalov's PR https://github.com/ROCm-Developer-Tools/HIP/pull/2110
which has been committed to master branch.
Make declarations consistent with https://github.com/RadeonOpenCompute/ROCm-Device-Libs/blob/amd-stg-open/ockl/src/workitem.cl
Without the attribute these functions don't have "readnone" LLVM IR attribute. Without it some optimizations fails, e.g. Loop Invariant Code Motion doesn't hoist these calls out of a loop.
Change-Id: Idb599570d142152cc4f6a3c8986384ad7f0c4729
2020-06-29 13:33:18 -04:00
Ronak Nilesh Chauhan
b7101af203
Revert "Support passing macros to hipLaunchKernelGGL"
...
This reverts commit 961717879d .
Reason for revert: This patch breaks ROCPrim tests
Change-Id: Ib2235f719861c9f4317c33e86b6c1f8bc669cfd4
2020-06-24 04:28:46 -04:00
Michael LIAO
cea6b99a28
[hip] Disable assert workaround for HCC on HIP-Clang.
...
- HIP-Clang follows the standard assert definition by providing
`__assert_fail`. But, `assert` macro is added as an HCC-specific
workaround due to the missing implementation. Only enable that on the
HCC compilation to avoid unexpected behaviors on HIP-Clang
compilation.
Change-Id: I1c9a707baff9b85c30faef58c52ebfe07e3fc3fc
2020-06-22 10:33:56 -04:00
Ronak Chauhan
961717879d
Support passing macros to hipLaunchKernelGGL
...
This makes hipLaunchKernelGGL take a variable argument list, that will be
expanded before being fed to hipLaunchKernelGGLInternal.
Change-Id: Id76e2bf91acd5d68f56a24fc39f219f2eeb06d33
2020-06-22 04:35:29 -04:00
Tao Sang
63051ca2e1
Support numa policy set by user
...
Add hipHostMallocNumaUser flag to hipHostMalloc() in order to support
numa policy set by user.
Change-Id: I6d70ed539a5f97f27187f2242b68849c0e27e4d6
2020-06-19 21:23:58 -04:00
Yaxun (Sam) Liu
b907505d55
Fix missing ldexp(float,int)
...
Change-Id: I2c1553407dfc26948d3ab7aa532eef42a0f6b204
2020-06-18 15:16:59 -04:00
Jason Tang
38cd2b96c7
Add asicRevision
...
Change-Id: I59f3ad20b9bdadf77bd1e0725f7a401d7ad423a3
2020-06-16 17:54:20 -04:00
German Andryeyev
f4211c3905
Initial support for HIP managed memory
...
- Call the new ROCclr interfaces for HMM
Change-Id: I2cd1bf438f712a9e9e328340e7d0c025257ca6c1
2020-06-15 18:10:41 -04:00
Rahul Garg
00301b1665
Addback __mbcnt_lo and __mbcnt_hi
...
Change-Id: Ic3facba2e2245461515799f6a17842da0f5d9933
2020-06-11 21:21:36 -04:00
Dittakavi Satyanvesh
6ed1868203
SWDEV-236670 Address Eigen unit test failure by adding __host__ attribute to half2 functions
...
Change-Id: Ifdc852c30a1b3704871e0ee58cb7a55d3d37fc6e
2020-06-10 03:01:42 -04:00
Yaxun (Sam) Liu
087c579625
Fix include path and wrapper header
...
Currently std::complex and some other std functions require uses to
include hip_runtime.h before any other headers to work, which is not
reliable.
changes are made in clang to fix this issue:
https://reviews.llvm.org/D81176
which requires hipcc and HIP headers to make corresponding changes.
This patch will make sure the clang change will not break
HIP/ROCclr during this transition.
After the transition is done, we can remove explicitly setting
include path for HIP-Clang and HIP header in hipcc and hip config
cmake files and rely on clang driver to set it automatically.
Change-Id: I5d226861c2560ffa6c5ab17343a43cc378048061
2020-06-09 17:37:20 -04:00
Jason Tang
1c0d737e1f
SWDEV-227909 - Add gcnArchName
...
Change-Id: Iea6d16b5d693dd0d900fa424d7a321c39315430e
2020-06-05 15:33:55 -04:00
Siu Chi Chan
784ca6f43c
add constexpr constructor for vector types
...
Change-Id: I45bb0537d6a24ee50b548c2fd8b4f20518764813
2020-06-04 01:57:03 -04:00
Evgeny
cad3f805c0
adding hipGetStreamDeviceId() profiling API
...
Change-Id: I5ccf88ddac123260d7c17defefcf20ff3b2504e2
2020-06-03 18:57:49 -04:00
Jatin
2d517fdcc6
Adding changes for hipExtLaunchKernel for rocCLR
...
Change-Id: Iba52bc3bde7c37f3fb375a55ba0947e87b3cdc9b
2020-06-02 14:16:41 -04:00
Evgeny
ef7ff69ff0
adding hipKernelNameRefByPtr function
...
Change-Id: Iefc18967b10394b85a207ffdb5bbfe5e3601474d
2020-05-28 10:59:48 -04:00
Michael LIAO
f6addba699
[hip] Those texture interfaces are C interfaces should be always exposed.
...
Change-Id: Ie34f1420839b17486346149b1672e70ec0088b54
2020-05-27 15:03:59 -04:00
Sarbojit Sarkar
83b11f9a61
[doc]shfl*sync update
...
1. Updated FAQ with shft*sync not supported hip_faq.md
2. Corrected some of input parameter description in hcc_details/hip_runtime_api.h
3. Redirect shfl*() to shfl_*_sync() for nvcc path where CUDA > 9.0
Change-Id: I3d8184db5fcc622852c9bad96b706348e8dfc16c
2020-05-27 02:17:40 -04:00
Mahesha Shivamallappa
01dae52d64
Add support for cooperative group type - thread_block
...
Change-Id: If3770b6d6718a638b70f527ae2533d9ef3267ff4
2020-05-22 23:08:42 -04:00
Aryan Salmanpour
7dd5b19290
Add support for hipExtStreamCreateWithCUMask API
...
Change-Id: I369d0eaca493821c4badc6b18ac02daa2fddc95f
2020-05-22 11:34:06 -04:00
Evgeny
5abb8e1a68
API tracing instrumentation
...
Change-Id: I257409b9fe299b009ded3e3a43287322d5f93a70
2020-05-14 11:03:09 -05:00
Matt Arsenault
d2dd307c7d
Remove some asm declarations for intrinsics
...
This technique should never be used, and only accessed through
__builtins.
There's currently no builtin for groupstaticsize. I left ds_swizzle
since for some reason it switches to the builtin based on __HCC__ or
not.
Change-Id: If1e1394221dba83ea4add6db5e94d6b715552044
2020-05-11 15:20:58 -04:00
Michael LIAO
a2dbcc075c
[hip] Fix -Wduplicate-decl-specifier warning. NFC.
...
Change-Id: Iae48bbb7805c39f1005c920df8e76504426f2d3b
2020-05-11 10:12:33 -04:00
Sarbojit Sarkar
3612851809
Enabling hipGetDeviceFlags required in [SWDEV-229170]
...
Change-Id: I998d37e5847f9651345554bada86df6fce86d1eb
2020-05-08 01:37:23 -04:00
Payam
c5f76c3de3
name change vdi to rocclr
...
Change-Id: I06d198bbb4a499e153b290b73a92afed3553b252
2020-05-06 09:14:30 -04:00
Rahul Garg
60c34fbd4d
Make HIP C compliant
...
Change-Id: Ic2fa650675e68200c841ce3db622da836b169f33
2020-05-05 12:49:40 -04:00
Vlad Sytchenko
bfad8d2833
Fix even more typos from 5429b40afe
...
Change-Id: I4f44261547b321a214348943ff5117eb5bd55b06
2020-05-04 15:26:56 -04:00
Alex Xie
d890d77da4
SWDEV-221166 - Detect support for large bar access through HIP runtime API
...
Change-Id: Iaa9756c1b5e40c1ab5afb38e44a6699fa5f6c13f
2020-05-01 20:39:52 -04:00
Michael LIAO
64507de694
Fix more typos from 5429b40afe.
...
Change-Id: I75ed28a5862daffc0778910d7ba3b97f51a87949
2020-05-01 12:19:30 -04:00
root
2689246de6
Merge master into amd-master-next
...
Change-Id: I3fc1dc0c860d627053537581e75561e8a7efe327
2020-04-26 22:19:37 +00:00
Yaxun (Sam) Liu
808dae6813
Enable template max and min for HIP-Clang ( #2028 )
...
It was for HCC only. HIP-Clang also needs it for __fp16 since AMDMIGraphX uses it.
Change-Id: Id49322b7b89ef799accdf6b47627a6fce51d1ab5
2020-04-24 12:30:28 -07:00
Yaxun (Sam) Liu
4143d81618
Enable template max and min for HIP-Clang
...
This change is required by AMDMIGraphX.
It was for HCC only. HIP-Clang also needs it for __fp16 since AMDMIGraphX uses it.
Change-Id: Id49322b7b89ef799accdf6b47627a6fce51d1ab5
2020-04-24 09:51:17 -04:00
Vlad Sytchenko
8d6347c6b8
Make sure to zero out all the unset texture fields
...
These might contain garbage causing the runtime to incorrectly parse the state of the texture references.
Change-Id: I93c726fa30b580b3e14c50ac939f3c71b0d1c8d9
2020-04-23 16:38:52 -04:00
Maneesh Gupta
a0b5dfd625
Merge in the rocclr based hip runtime ( #2032 )
...
* Merge master-next changes in master (include vdi development in master branch)
2020-04-23 09:12:06 -07:00
Michael LIAO
218044577e
[hip] Fix typos.
...
Change-Id: I9d85d0e70033d144dbd4d61cb434ffbe023af8c0
2020-04-22 16:44:54 -04:00
Michael LIAO
19f793f1cd
[hip] Generate assertion message in assertion.
...
Change-Id: Ie66f6563e8728fd0e21cf22dcc6619e4a0e5c28d
2020-04-21 16:44:40 -04:00
Michael LIAO
16d9fe5e37
[vdi] Refactor texture/surface reference support.
...
Change-Id: I8014d82aae7139ef5f95e4b50c4fc6da200dbc9d
2020-04-21 11:56:48 -04:00