Make hipMemset asynchronous with respect to the host unless
destination pointer refers to pinned host memory.
Change-Id: I5ba230e0cc622aa076dc94692a4c43a57dd2df7b
[ROCm/hip commit: 08f827b28b]
Modified hipPointerGetAttributes to support hipHostRegister ptrs and mapped device ptrs from hipHostRegister
Change-Id: I2f4cf12e09578768308f644123d6d4bdccbd77a8
[ROCm/hip commit: a7b5347c90]
1.Fix unexpected repeated hip runtime lib building
2.Seperate test cmake from top cmake.
Change-Id: I1ae457effd639d6c25bca293c1724b026be21ecd
[ROCm/hip commit: d074c2491f]
The older compiler does not promote integral
arguments to double, instead it uses float.
This patch tries to support the old compiler
headers, and also the newer one.
Change-Id: I01e0b96a82ecddc6230cb5baf2573a318b25445b
[ROCm/hip commit: 09a1c22208]
Remove hip-hcc codes from hip code base
Simplify hip CMakeLists.txt to exclude hip-hcc
Simplify cmake cmd for hip-rocclr building
Some minor fixes
Change-Id: I1ae357ecfd638d6c25bca293c1724b026be21ecd
[ROCm/hip commit: 186f95ea30]
HIP supports compiling kernels from LLVM IR into executable.
The device LLVM IR needs to be compiled into a fat binary
object. This device object is embedded into a host object using
llvm-mc directives. Then, any host linker may link the host and
device objects together into an executable. A README was added.
Change-Id: I8ebb6ae86b7ab4290f7cba2eea5584d73a7c453e
[ROCm/hip commit: 8a5b8a36f2]
HIP supports compiling kernels from assembly into exec.
The device assembly needs to be compiled into a fat binary
object. This device object is embedded into a host object using
llvm-mc directives. Then, any host linker may link the host and
device objects together into an executable. A README is added.
Change-Id: I59d3a8b5363073810ffc3aa0d57f21b0df272369
[ROCm/hip commit: 33f0a41c7a]
The HIP-Clang math headers upstream have been updated
to perform type promotion on math function overloads.
If any argument has integral type, it will be cast to
double.
Change-Id: I38d9e6240c40c4092b946851469498b2ae6500ae
[ROCm/hip commit: 1f0037eefc]
SWDEV-261229: Change atomicAdd float implementation to use clang builtin
along with deprecating atomicAddNoRet
Change-Id: I5e98034fec54ee1a2b85d5f56273e34945d49ea3
[ROCm/hip commit: 019c556c7d]
Add a flag to record gpu TS in a marker, also disable marker flush
if there was already a marker submitted before.
Change-Id: I9cdf8e49137690c0050fe9370764dd059855a2f8
[ROCm/hip commit: 390d8c54cb]
- As `long`/`ulong` on Windows has different bit sizes from Linux
platforms, even compiled in 64-bit mode. Need to use portable data
type `int64_t`/`uint64_t` as possible or use `long long` ones to
match the prototype in LLVM IR.
Change-Id: I62fe16a2e53aa0f2f2b4069b8a1596b8db7050c9
[ROCm/hip commit: be786dff8e]
1. Negative Test Cases
2. API functionality tests for hopcount and linktype using rocm-smi library.
3. API functionality test for hopcount when devices are reversed.
4. Disabling test case 7.
5. Implemented External Review Comments.
6. Created a new header file hipP2pLinkTypeAndHopFunc.h.
7. Removed test case 0x7 from directed test.
8. Put testMaskedDevice() under __linux__, placed TODO in line 404 (comments) and replaced '&=' with '='.
SWDEV-238517 for enhancing hip unit tests
Change-Id: I1ae256ecfd637d6c25bca293c1724b026be20eca
[ROCm/hip commit: 0a05c79606]
- Test d2d transfer after doing copy from pinned host buffer
- Test d2d transfer after doing copy from unpinned host buffer
SWDEV-238517 for enhancing hip unit tests
Change-Id: Ib262185ce2692036e11c0f95ccb850eb3bce1c81
[ROCm/hip commit: 8a8af8e992]
1. Negative Test Cases
2. API functionality test using multiple streams.
3. API functionality validation using multiple GPUs.
4. Setting hipTexRefSetFlags() with HIP_TRSF_READ_AS_INTEGER flag instead of 0 as suggested in SWDEV-256096.
5. Enabling test cases 0x03 (short) and 0x04 (char).
6. Implemented external code review comments.
SWDEV-238517 for enhancing hip unit tests
Change-Id: If42796047ec1cf2e3695dc2b7f40a2d9dd50f5bd
[ROCm/hip commit: 9d92c90b55]
SWDEV-262064 - compilation fails when hipLaunchCooperativeKernel C++ routine is invoked
on nvcc path.
Change-Id: Icde35dd1a7008b406401bb3996c06d7acfb27d8d
[ROCm/hip commit: 44d6914ef3]