Граф коммитов

1309 Коммитов

Автор SHA1 Сообщение Дата
Aryan Salmanpour d9a335bccf Add support for hipExtStreamGetCUMask API
Change-Id: I0fa67ad581dd75556f17c7410af2c1d5cb6ae99a
2020-11-20 13:41:38 -05:00
Vladislav Sytchenko fb04bdc4f2 Report maxTexture1DLinear property
Change-Id: If562bbee80d8b98712806cb8258b5a7482d33bf6
2020-11-20 10:18:20 -05:00
Julia Jiang 3ccce6a642 SWDEV-249486-re-organize doxygen HIP API
Change-Id: I3d107de526e428ae16c6dffd97d9388bf54a6358
2020-11-11 13:31:56 -05:00
Sourabh Betigeri 81ca27783b SWDEV-255694: Include attributes - hipFuncAttributePreferredSharedMemoryCarveout and hipFuncAttributeMaxDynamicSharedMemorySize
Change-Id: I3569859e6db0c59a10a76c8ea356d4bc0a0bb906
2020-11-02 19:39:39 -05:00
German Andryeyev 2bf7282bd5 Update documentation
Add managed memory section for the doxygen documentation.
No code changes.

Change-Id: I2e9db4ec13422159e265193916f0a3cdb721732d
2020-10-23 18:28:43 -04:00
Aaron En Ye Shi 2005c3437e Fix header for OpenMP AMDGCN support
Add ifndef for __OPENMP_AMDGCN__ target, which shares
these header files with HIP.

Change-Id: I720693cfbcfb7836cd2f361d48530fb55ee3557a
2020-10-20 14:04:48 -04:00
Aaron En Ye Shi 20999c719d Fix header invalid sqrt fwd decl and ifdef
The __ocml_sqrt_* functions only accept a single arg,
not two args. Also, change ifdef for if because the flag
__HCC_OR_HIP_CLANG__ is always defined.

Change-Id: Id9dc4eaf0f25df1df27a1bf643cb545ac23653f8
2020-10-19 18:01:14 +00:00
Julia Jiang 4e7f420722 SWDEV-249486-Update HIP doxygen
Change-Id: Id32772b8f4d495e0f1bd3cb5cd1ec211a06298c3
2020-10-14 13:59:58 -04:00
Sarbojit Sarkar 86b1600ca7 SWDEV-255975: fix for gcc + hip_fp16.h compilation error
Change-Id: Ia152bad4af71a5b53a16e2f9fd9aeeacc61b39ac
2020-10-14 06:57:25 -04:00
German Andryeyev d9f655f50e Report managed memory capabilities
Change-Id: I241a42e26ee49fb47dcb3ba87c4bd8fb691dffdf
2020-10-13 16:09:43 -04:00
Julia Jiang c2a3d19acb SWDEV-249487-remove two HCC related APIs
Change-Id: Icf90ece0808643e0c5f0cc00b6644d5171b553d8
2020-10-13 11:08:37 -04:00
agodavar e0be2ca8dd SWDEV-255853: CUDA device P2P attributes
Change-Id: I7deffd04101b565f68a72a179f9faeaa22cab97d
2020-10-13 05:32:29 -04:00
agodavar d96e810ccd SWDEV-254909:Support for missing texture/array cuda driver types
Change-Id: I9c769f6335ec3e7feacaa8929719c666bff91934
2020-10-13 05:30:31 -04:00
Sameer Sahasrabuddhe c36c0bad22 Do not inline the body of __assert_fail()
The device side assertion calls printf to write out a message. In the
device compiler, printf is expanded into a series of hostcalls that
transmit the printf payload to the host. This expansion increases the
length of the kernel, resulting in sub-optimal compilation. The
solution is to ensure that the assert() implementation is not inlined
into the kernel.

Change-Id: Ia3a075461a755cf007218f262b0863e1926c76aa
2020-10-10 01:03:04 -04:00
Yaxun (Sam) Liu dc20086cbb Disable wave64 support on GFX10 for HIP
Change-Id: Ie3d4fe91516167ea26ce704689b6b6f1951a7521
2020-10-09 17:02:56 -04:00
Yaxun (Sam) Liu c7ac5531d3 Fix warpSize for gfx10
Change-Id: I8029ebeb91db1efa8e166ad349aaae6364e8069d
2020-10-05 20:45:07 -04:00
Vladislav Sytchenko ad2d55c144 Revert "Revert "Merge branch 'amd-master-next' into amd-npi-next""
This reverts commit 374ead1d19.

Reason for revert: <INSERT REASONING HERE>

Change-Id: I92ceb171e31026ed1864704cef2fc1497b883ef9
2020-10-05 13:20:58 -04:00
Vladislav Sytchenko 374ead1d19 Revert "Merge branch 'amd-master-next' into amd-npi-next"
This reverts commit 73558e3363.

Reason for revert: <INSERT REASONING HERE>

Change-Id: I53322718dadde2c98f96140b8e260ec7ee9ef721
2020-10-05 13:02:39 -04:00
Aryan Salmanpour 3c72e7beea rename abort in nvcc path
[background] There is currently a compilation issue when both hip/hip_runtime.h and hip/hip_cooperative_groups.h included
in a file and hipcc used to compile it on NV platform. The issue is that an abort is defined in hip/nvcc_detail/hip_runtime.h
and it is also defined in the CUDA cooperative groups header (/cuda/include/cooperative_groups/details/helpers.h).
this is problematic and leads to a compilation issue in hipcc on NV platform.

Change-Id: I2ab6982ac4103822a1a4a0ced942cd604d6c19c1
2020-10-01 14:28:51 -04:00
mshivama b8640f1f0d Add missing definitions for class static members
The static members __HIP_Coordinates::x, ::y and ::z must be defined
outside the class. Otherwise, linker throws `undefined reference error`
when these definitions are needed in the HIP application.

Change-Id: Iabc09744b478c22e4b13cf9824877ec9cfdd4f7a
2020-09-30 03:38:24 -04:00
Sarbojit Sarkar bf20337fc1 SWDEV-253247: add ulong and ulonglong version of__shfl*
Change-Id: I40ab6cfa12175f334e8392b71f567054d8256e2a
2020-09-30 01:58:22 -04:00
mshivama 4b6d92798f Layout the cooperative group related headers to target Nvidia
Change-Id: I530ffc72d6fae1e273b5322d508629221e11ea6f
2020-09-29 21:32:24 -04:00
Sarbojit Sarkar 0be1f7843a SWDEV-252337: Fix for V100 compilation issue
Change-Id: Ib57ff3ac28f0a581f395f32181974e64eca6d858
2020-09-25 21:55:22 -04:00
agodavar 288fbe5af3 SWDEV-249870 - Short-Term solution for Pre-Compiled Headers for Online Compilation
Change-Id: Iec989787e546ff2eb68c4b146dc540655698b569
2020-09-24 12:18:06 -04:00
Aryan Salmanpour 5831687985 use only hip compiler to compile hip cooperative groups
SWDEV-252377

Change-Id: I6cf749dad2bf05c2e49fed495ab2833c574c2215
2020-09-22 14:34:05 -04:00
Aryan Salmanpour ddceeac8eb Fix a warning with cooperative groups header
SWDEV-252413

Change-Id: Ia098464b670f26416b79cc67805b8accb6bec903
2020-09-18 13:38:47 -04:00
Rahul Garg 188278b38a grid size optimization
Change-Id: I68b197548b961e1c0aa63119eecc8d9a25350cf2
2020-09-10 15:10:55 -04:00
agodavar 9e2fa6e215 SWDEV-249870 - Short-Term solution for Pre-Compiled Headers for Online Compilation
Change-Id: Ibcb365ce2ff27c4c2379609964078da42e1226b1
2020-09-09 13:44:13 -04:00
Siu Chi Chan bf0d8f37e4 fix no matching push_macro warmnings
Change-Id: I352fe1170cfc9e8de5307536a21c86f0b483c68e
2020-09-09 11:41:07 -04:00
Jatin Chaudhary fef8df6e2e Adding dim3 conversion operator to __hip_builtin_threadIdx_t
Change-Id: Ib7f9e45872ec6d48c0e5b7fc93ccecaafaeac754
2020-08-31 21:30:14 -04:00
mshivama 15aff89ba1 Support public apis for cooperative group types.
Change-Id: I01346afde485e82c34b7868b9241b34c542d0cf9
2020-08-25 10:50:38 +05:30
Todd tiantuo Li 99eb486937 SWDEV-240803 - add hipFuncSetSharedMemConfig
Change-Id: I160b04677b3e7b99b3981ae7ecc84a0e3811d5e8
2020-08-20 18:18:24 -04:00
Jatin Chaudhary 1340b3f07f Adding Anyorder flag to HIP
Change-Id: Ie20931541b3febe23fa9ac36ebc0c90de75a5f0a
2020-08-17 08:46:52 -04:00
Todd tiantuo Li 9dfe15a843 SWDEV-240803 - add hipFuncSetAttribute and hipFuncAttribute
Change-Id: I3f4d67b19d89fd348fa5b884af4a2542ee4aba60
2020-08-14 17:39:29 -04:00
Jatin Chaudhary 1e886f432d Cuda 11 changes: https://docs.nvidia.com/cuda/cuda-runtime-api/structcudaPointerAttributes.html\#structcudaPointerAttributes
Change-Id: I8a5389b88df286043c365a734983a4c5de352102
2020-08-07 05:33:33 -04:00
kjayapra-amd 4d42a969fd SWDEV-240800 - Adding initial support for hipDeviceGetP2PAttribute. Some attr support still pending.
Change-Id: I0611aed136270db497dfa374144f6f5e35352a8f
2020-07-22 14:54:34 -04:00
mshivama 27e237ef48 Add support for few CUDA compatible reduction operations like __syncthreads_count()
Change-Id: I0917820058f0edad5c0bdd42dea37f06353fc63c
2020-07-14 12:03:14 -04:00
Yaxun (Sam) Liu 13316e2919 Add pow(float/double/_Float16,int)
Change-Id: Ie65d15cd3df9853a3bbd613d8c7188ae39c327c7
2020-07-06 07:38:57 -04:00
Ronak Chauhan affe9ab9b5 Support passing macros to hipLaunchKernelGGL
This makes hipLaunchKernelGGL take a variable argument list, that will be
expanded before being fed to hipLaunchKernelGGLInternal.

This is different from 961717879d.

We try to accomodate the case when a kernel template has multiple
type parameters.

Change-Id: I87577d402c92b0f3b51e298f8293f4065e1f6de8
2020-06-30 10:44:55 -04:00
Daniil Fukalov 63e44d16a3 Add __attribute__((const)) to grid related functions declarations
This is cherrypick of Daniil Fukalov's PR https://github.com/ROCm-Developer-Tools/HIP/pull/2110
which has been committed to master branch.

Make declarations consistent with https://github.com/RadeonOpenCompute/ROCm-Device-Libs/blob/amd-stg-open/ockl/src/workitem.cl
Without the attribute these functions don't have "readnone" LLVM IR attribute. Without it some optimizations fails, e.g. Loop Invariant Code Motion doesn't hoist these calls out of a loop.

Change-Id: Idb599570d142152cc4f6a3c8986384ad7f0c4729
2020-06-29 13:33:18 -04:00
Ronak Nilesh Chauhan b7101af203 Revert "Support passing macros to hipLaunchKernelGGL"
This reverts commit 961717879d.

Reason for revert: This patch breaks ROCPrim tests

Change-Id: Ib2235f719861c9f4317c33e86b6c1f8bc669cfd4
2020-06-24 04:28:46 -04:00
Michael LIAO cea6b99a28 [hip] Disable assert workaround for HCC on HIP-Clang.
- HIP-Clang follows the standard assert definition by providing
  `__assert_fail`.  But, `assert` macro is added as an HCC-specific
  workaround due to the missing implementation. Only enable that on the
  HCC compilation to avoid unexpected behaviors on HIP-Clang
  compilation.

Change-Id: I1c9a707baff9b85c30faef58c52ebfe07e3fc3fc
2020-06-22 10:33:56 -04:00
Ronak Chauhan 961717879d Support passing macros to hipLaunchKernelGGL
This makes hipLaunchKernelGGL take a variable argument list, that will be
expanded before being fed to hipLaunchKernelGGLInternal.

Change-Id: Id76e2bf91acd5d68f56a24fc39f219f2eeb06d33
2020-06-22 04:35:29 -04:00
Tao Sang 63051ca2e1 Support numa policy set by user
Add hipHostMallocNumaUser flag to hipHostMalloc() in order to support
numa policy set by user.

Change-Id: I6d70ed539a5f97f27187f2242b68849c0e27e4d6
2020-06-19 21:23:58 -04:00
Yaxun (Sam) Liu b907505d55 Fix missing ldexp(float,int)
Change-Id: I2c1553407dfc26948d3ab7aa532eef42a0f6b204
2020-06-18 15:16:59 -04:00
Jason Tang 38cd2b96c7 Add asicRevision
Change-Id: I59f3ad20b9bdadf77bd1e0725f7a401d7ad423a3
2020-06-16 17:54:20 -04:00
German Andryeyev f4211c3905 Initial support for HIP managed memory
- Call the new ROCclr interfaces for HMM

Change-Id: I2cd1bf438f712a9e9e328340e7d0c025257ca6c1
2020-06-15 18:10:41 -04:00
Rahul Garg 00301b1665 Addback __mbcnt_lo and __mbcnt_hi
Change-Id: Ic3facba2e2245461515799f6a17842da0f5d9933
2020-06-11 21:21:36 -04:00
Dittakavi Satyanvesh 6ed1868203 SWDEV-236670 Address Eigen unit test failure by adding __host__ attribute to half2 functions
Change-Id: Ifdc852c30a1b3704871e0ee58cb7a55d3d37fc6e
2020-06-10 03:01:42 -04:00
Yaxun (Sam) Liu 087c579625 Fix include path and wrapper header
Currently std::complex and some other std functions require uses to
include hip_runtime.h before any other headers to work, which is not
reliable.

changes are made in clang to fix this issue:
https://reviews.llvm.org/D81176

which requires hipcc and HIP headers to make corresponding changes.

This patch will make sure the clang change will not break
HIP/ROCclr during this transition.

After the transition is done, we can remove explicitly setting
include path for HIP-Clang and HIP header in hipcc and hip config
cmake files and rely on clang driver to set it automatically.

Change-Id: I5d226861c2560ffa6c5ab17343a43cc378048061
2020-06-09 17:37:20 -04:00