Make hipMemset asynchronous with respect to the host unless
destination pointer refers to pinned host memory.
Change-Id: I5ba230e0cc622aa076dc94692a4c43a57dd2df7b
[ROCm/clr commit: 9ba5902be4]
Modified hipPointerGetAttributes to support hipHostRegister ptrs and mapped device ptrs from hipHostRegister
Change-Id: I2f4cf12e09578768308f644123d6d4bdccbd77a8
[ROCm/clr commit: 40733e1fff]
1.Fix unexpected repeated hip runtime lib building
2.Seperate test cmake from top cmake.
Change-Id: I1ae457effd639d6c25bca293c1724b026be21ecd
[ROCm/clr commit: a7aa84f73a]
The older compiler does not promote integral
arguments to double, instead it uses float.
This patch tries to support the old compiler
headers, and also the newer one.
Change-Id: I01e0b96a82ecddc6230cb5baf2573a318b25445b
[ROCm/clr commit: d432615faa]
Remove hip-hcc codes from hip code base
Simplify hip CMakeLists.txt to exclude hip-hcc
Simplify cmake cmd for hip-rocclr building
Some minor fixes
Change-Id: I1ae357ecfd638d6c25bca293c1724b026be21ecd
[ROCm/clr commit: 1cba7ec965]
HIP supports compiling kernels from LLVM IR into executable.
The device LLVM IR needs to be compiled into a fat binary
object. This device object is embedded into a host object using
llvm-mc directives. Then, any host linker may link the host and
device objects together into an executable. A README was added.
Change-Id: I8ebb6ae86b7ab4290f7cba2eea5584d73a7c453e
[ROCm/clr commit: f7b5fea916]
HIP supports compiling kernels from assembly into exec.
The device assembly needs to be compiled into a fat binary
object. This device object is embedded into a host object using
llvm-mc directives. Then, any host linker may link the host and
device objects together into an executable. A README is added.
Change-Id: I59d3a8b5363073810ffc3aa0d57f21b0df272369
[ROCm/clr commit: b8b90f344c]
The HIP-Clang math headers upstream have been updated
to perform type promotion on math function overloads.
If any argument has integral type, it will be cast to
double.
Change-Id: I38d9e6240c40c4092b946851469498b2ae6500ae
[ROCm/clr commit: 9c041b1984]
SWDEV-261229: Change atomicAdd float implementation to use clang builtin
along with deprecating atomicAddNoRet
Change-Id: I5e98034fec54ee1a2b85d5f56273e34945d49ea3
[ROCm/clr commit: 542d3219dc]
Add a flag to record gpu TS in a marker, also disable marker flush
if there was already a marker submitted before.
Change-Id: I9cdf8e49137690c0050fe9370764dd059855a2f8
[ROCm/clr commit: db7ed1a78c]
- As `long`/`ulong` on Windows has different bit sizes from Linux
platforms, even compiled in 64-bit mode. Need to use portable data
type `int64_t`/`uint64_t` as possible or use `long long` ones to
match the prototype in LLVM IR.
Change-Id: I62fe16a2e53aa0f2f2b4069b8a1596b8db7050c9
[ROCm/clr commit: f3b63c004a]
1. Negative Test Cases
2. API functionality tests for hopcount and linktype using rocm-smi library.
3. API functionality test for hopcount when devices are reversed.
4. Disabling test case 7.
5. Implemented External Review Comments.
6. Created a new header file hipP2pLinkTypeAndHopFunc.h.
7. Removed test case 0x7 from directed test.
8. Put testMaskedDevice() under __linux__, placed TODO in line 404 (comments) and replaced '&=' with '='.
SWDEV-238517 for enhancing hip unit tests
Change-Id: I1ae256ecfd637d6c25bca293c1724b026be20eca
[ROCm/clr commit: 4b16d3896d]
- Test d2d transfer after doing copy from pinned host buffer
- Test d2d transfer after doing copy from unpinned host buffer
SWDEV-238517 for enhancing hip unit tests
Change-Id: Ib262185ce2692036e11c0f95ccb850eb3bce1c81
[ROCm/clr commit: 50524ae60c]
1. Negative Test Cases
2. API functionality test using multiple streams.
3. API functionality validation using multiple GPUs.
4. Setting hipTexRefSetFlags() with HIP_TRSF_READ_AS_INTEGER flag instead of 0 as suggested in SWDEV-256096.
5. Enabling test cases 0x03 (short) and 0x04 (char).
6. Implemented external code review comments.
SWDEV-238517 for enhancing hip unit tests
Change-Id: If42796047ec1cf2e3695dc2b7f40a2d9dd50f5bd
[ROCm/clr commit: 4f039d1bf5]
SWDEV-262064 - compilation fails when hipLaunchCooperativeKernel C++ routine is invoked
on nvcc path.
Change-Id: Icde35dd1a7008b406401bb3996c06d7acfb27d8d
[ROCm/clr commit: 80cb172fef]
Change-Id: I043caa89b2a43dace7f18c739f30d2a7bfa644ca
Fixes the issue when memcpy DtoH is performed on device 0 where the device buffer
is allocated on device1 and the destination pinned buffer is allocated from device 0.
[ROCm/clr commit: 60ec2a3e46]