1.Remove clock functions from some tests that don't need them.
2.In some memory pool tests and coherency tests, timer-based kernel
delay isn't reliable, use pinned host based notification instead.
3.Add CHECK_PCIE_ATOMICS_SUPPORT before some tests.
4.catch/unit/memory/hipMemoryAllocateCoherent.cc is removed
as it is useless and originally excluded in building.
5.Some tests can still pass even if clock rate =0, thus they
will be kept as is.
6.Some logic and format improvement in some tests.
Change-Id: I6b3c6bf54c61cffd45cd6f17c75998f751b75725
- For H2D transfer cases, we cannot enfore ErrorNotReady behavior if the
source is unpinned. In that case the driver can sync depending on if
it pins or stages the source buffer.
Change-Id: I436a4b667bff172cfbadc58631a46a215c40fcb1
- Since any-host-to-any-host as always synchronous, there's no need for
separate functions
- Removed 'return' statement as it skips the rest of the test.
Change-Id: If70e0b21ad8151311d5ac7c5de789f6265cf44b6
Fix memory leaks by adding missing destroy calls for
events, streams, and graphs at the end of tests.
Ensure that every test case executes destroy calls,
regardless of whether it passes or fails.
Change-Id: I814e35c528d90ed2abb34d77377f1a7fd3f1f11c
* _sync tests that use warp match functions (__match_any_sync and __match_all_sync) were disabled for nvidia because those functions are supported only on devices with compute capability 7.x or higher. In this solution, tests are always build (with specifying appropriate compile flag) and in in the runtime it is decided weather they should be skipped or not.
Change-Id: Ifa9e55f3b47aa3a00027cb986cd6ae46aed45ebd
Add kernel excution perf test per blockSize and block number.
Implement a solution to roughly evaluate gpu variable
frequency based on clock64() and wall_clock64().
Change-Id: Ic87761a862d4a894fdcaab3431d63fe2592bb682
- Fixed wrong assumptions in the test:
- Pinned Host memory to Pinned Host memory is async (before:sync)
- Pinned Host memory to Device is async (before:sync)
Change-Id: Ib826e177854cbcdad04181e245382cf0bec02c46
Modified
TestContext::finalizeResults()
Added
HIP_CHECK_IGNORED_RETURN()
HIP_CHECK_ERRORS()
to supported ignored error such as hipErrorNotSupported.
Enable all mipmap tests in catch2 memory folder for linux.
The issues in the following tickets should be fixed.
SWDEV-465162, SWDEV-465163, SWDEV-465165, SWDEV-465166,
SWDEV-465168, SWDEV-465171, SWDEV-465172
Change-Id: I1146d6184582a7111a5af25b52f9e12c6a160e49
The following builtins are tested:
- __all_sync, __any_sync, __ballot_sync and __activemask
- __match_any_sync and __match_all_sync
- __shfl_sync, __shfl_up_sync, __shfl_down_sync, and __shfl_xor_sync
The tests for shfl (all variants) were manually validated on a CUDA setup.
NOTE:
- Unit_hipVoteSync_All temporarily disabled on Windows (SWDEV-452308).
- All new tests temporarily disabled on CUDA (SWDEV-453145).
Change-Id: I84b205a88aa24219d199c760793e2f19f0cf8f13
This change adds a new cmake target to generate test plan documentation using
Doxygen. The target is called build_docs and it's not added to the default
build commands, hence it needs to be called separately.
Signed-off-by: Sebastian Luzynski <Sebastian.Luzynski@amd.com>
Change-Id: I441524568e0513019ca21467950e48a551234802
Multiple subtests will fail if the default mem pool will keep information
of the previous subtests.
Change-Id: Ic9ab8ed93771e71735594a85235b5bb734d6680c