1. Simply enable test on NV
Some need minor fix
performance/compute/hipPerfDotProduct.cpp
performance/dispatch/hipPerfDispatchSpeed.cpp
performance/memory/hipPerfBufferCopyRectSpeed.cpp
performance/memory/hipPerfBufferCopySpeed.cpp
performance/memory/hipPerfDevMemReadSpeed.cpp
performance/memory/hipPerfDevMemWriteSpeed.cpp
performance/memory/hipPerfMemcpy.cpp
performance/memory/hipPerfMemset.cpp
performance/memory/hipPerfSharedMemReadSpeed.cpp
performance/stream/hipPerfDeviceConcurrency.cpp
performance/stream/hipPerfStreamCreateCopyDestroy.cpp
2. Enable and fix on NV
performance/compute/hipPerfMandelbrot.cpp
Root cause: coordIdx is random
Solution: Initialize coordIdx correctly
performance/memory/hipPerfMemFill.cpp
Root cause: Hip ext Apis called.
Solution: Exclude case with Hip ext Apis involved
performance/memory/hipPerfMemMallocCpyFree.cpp
Root cause: Test allocates device memory more than GPU has.
Solution: Allocate device memory in terms of GPU capacity.
tests/performance/memory/hipPerfSampleRate.cpp
Root cause: Cuda has no operators += for float2 and float4.
Solution: Provide the operators.
performance/stream/hipPerfStreamConcurrency.cpp
Root cause:float4 format doesn't match cude.
operators are missing in cuda lib.
Solution: Use (x, y, z, w) format.
Add necessary float4 operatoris for cuda.
Change-Id: I5add29ebabcfb21fb3ef89d09004c5d13423a291
[ROCm/hip commit: 9035ae3154]
* SWDEV-266829 - Enable more tests on AMD and NV devices
1. Enable tests on AMD and NV devices
tests/src/runtimeApi/event/hipEventMultiThreaded.cpp
Loops and threads per core are changed smaller so that test can
finish in a shorter time.
tests/src/runtimeApi/stream/hipStreamCreateWithPriority.cpp
Fix logic error on how to get priority_normal
2. Simply enable test on AMD device
tests/src/runtimeApi/memory/hipManagedKeyword.cpp
tests/src/runtimeApi/module/hipManagedKeyword.cpp
tests/src/runtimeApi/stream/hipStreamACb_MultiThread.cpp
tests/src/runtimeApi/memory/p2p_copy_coherency.cpp
3. Simply enable test on NV device
tests/src/runtimeApi/module/hipModuleLoadDataMultThreaded.cpp
4. Fix typo
tests/src/runtimeApi/stream/hipStreamAddCallbackCatch.cpp
5. Remove useless tests
tests/src/hipC.c
tests/src/hipHcc.cpp
Change-Id: Ia4406353e64d69bd34c58ebb56185701f7ce1caa
* Remove tests/src/runtimeApi/module/hipModuleLoadDataMultThreaded.cpp for cuda test
Co-authored-by: anusha GodavarthySurya <Anusha.GodavarthySurya@amd.com>
Co-authored-by: Jenkins <jenkins-compute@amd.com>
[ROCm/hip commit: 3fd16c0b5b]
Fix the following failed tests on NV,
hipCGMultiGridGroupType
hipCGMultiGridGroupTypeViaBaseType
hipCGMultiGridGroupTypeViaPublicApi
1. Fix wrong logic in kernel for both AMD and NV.
2. Remove unnecessary hipDeviceSynchronize().
3. In hipCGMultiGridGroupTypeViaBaseType.cpp, change
multi_grid_group as thread_group which is originally expected.
4. hipFree(syncResultD) is fixed as hipHostFree(syncResultD)
5. Optimize some host codes.
Change-Id: I3fe6dac35a7b14bab12adf397b7885df83d28059
[ROCm/hip commit: c57e0f8fe5]
Enable cooperativeGrps/cooperative_streams on NV.
Add test cases of the least/half/full capacity.
Verify data in terms of AMD/NV devices.
Optimize codes
Change-Id: I3fe6dbc35b7b24abb11adf297b7885df83d28154
[ROCm/hip commit: 67b3681d26]
Migrated malloc related files under memory folder into catch2 framework
Change-Id: I5aa07fc8148bdf6bef135947091aaf1d3c54663b
[ROCm/hip commit: 05e230f5c1]
Add test cases for filter modes: hipFilterModePoint and hipFilterModeLinear
Change-Id: I3fe6dbc35a7b14aab12adf297b7885df83d28056
[ROCm/hip commit: 48d8040b06]
Migrated all hipMemcpy related APIs to CATCH2 framework by optmizing
the code and moving the stress related tests to stress folder.
Change-Id: Id47669b49304c35d1a68fabdaaf3f6e3ab0428a5
[ROCm/hip commit: 346a77b4c0]
1. In kernel/hipDynamicShared
Fix shared memory size and type mismatch in host and kernel.
2. In kernel/hipDynamicShared2
Cuda kernels relying on shared memory allocations over 48 KB require
to explicitly set size using hipFuncSetAttribute().
Change-Id: I4248b6cebd3dc156f9d5d427e1897da22fb964ed
[ROCm/hip commit: 5b739b0373]
make hipIpcOpenEventHandle has the same behavour of cudaIpcOpenEventHandle.
Add Api usages.
Change-Id: I4248b2cebd3de156f9d5d427e1797da22fb964eb
[ROCm/hip commit: c053d60282]
1.Fix hipModuleNegative failure on all NV GPUs
a.Add signal handler for signal sent by cuda functions.
b.Make hipModuleGetGlobal match cuModuleGetGlobal behavour.
That is, if one of the first two parameters is nullptr, ignore it.
2.Fix hipModuleLoadDataMultThreaded failure on NV RTX5000
Improve lamda function.
Change-Id: I3fe6dbc35a7a14aa9119df197b7885df83d28047
[ROCm/hip commit: ae30c5cd6b]
Enable NV printf DTests as many as possible.
Fix the bugs due to behavour difference between
Hip-Rocclr and Cuda.
Add hipLimitPrintfFifoSize.
Change-Id: I3fe6dbc35a7a140a9919df197b7885df83d28049
[ROCm/hip commit: 586165ebc2]
Migrated functional and negative scenarios of
hipMemcpyAtoH and
hipMemcpyHtoA APIs to catch2 framework
Change-Id: I68bb37d99fc371b8803e64ebf1533c0870b14fab
[ROCm/hip commit: bdc90769fb]
Changes contain Device files under "hip/tests/src/runtimeApi/device" migrated to Catch2 and some additional device tests.
Change-Id: Ie88adc3c32c2079456b14e029cfc6c319b48d8f6
[ROCm/hip commit: 44e1ae449a]
passed() should not call exit(0) which will prevent object
destructor being called.
Change-Id: Ibaee0d85cb616727dc5db557bb9422702670b688
[ROCm/hip commit: c74e2da333]
Added functional and negative scenarios for hipMemcpy2DFromArray and
hipMemcpy2DFromArrayAsync APIs
Change-Id: I269499b1d538affd505d9f529e7516cb3aa17863
[ROCm/hip commit: f6e9d2491b]
http://ontrack-internal.amd.com/browse/SWDEV-230423
1. Moving stress folder from hip/test/src to hip/test.
2. Adding Stream stress tests.
These stress tests create multiple streams and launches kernel on them
in multiple combinations for 100000 iterations. These tests will test
the stability of streams created using hipStreamCreate, hipStreamCreateWithPriority
and hipStreamCreateWithFlags.
3. Adding printf stress tests using simple kernels.
4. Adding printf stress tests using a complex kernel.
Change-Id: Idcd26707fb9504ab8dbe6cebcbb32ade61bf1483
[ROCm/hip commit: 1b53e0c51c]
Migrated functional and negative scenarios of
hipMemcpy2D and
hipMemcpy2DAsync APIs to catch2 framework
Change-Id: I339f0914cc76fee194fff53dd9686b0c16d85af9
[ROCm/hip commit: b77fe6cc9f]
Added functional and negative scenarios for
hipMemcpy3D and
hipMemcpyParam2D APIs in catch2 framework
Change-Id: I3473eb952c23aba6bc500d78ca93d52a9f7a5d6f
[ROCm/hip commit: aa6b01f793]
Changes contain Catch2 device multiprocess tests which were covered under "hip/tests/src/runtimeApi/device" and some additional tests. These test files makes use of fork call or sets visible devices env variable and validates various device aspects.
Change-Id: Iaaca37598d386104da9e5f37f92ab176bc8a2845
[ROCm/hip commit: 6a8fd2b762]
Added functional and negative scenarios for
hipMemcpyPeer and
hipMemcpyPeerAsync APIs
Change-Id: Ibaee0d84cb616727dc5db057bb9422702670b686
[ROCm/hip commit: 48c2a0fb27]
Added functional and negative scenarios for hipMemcpy2DToArray and
hipMemcpy2DToArrayAsync APIs
Change-Id: I97331a587fb1d51583faeec998cdb4ac09bfa564
[ROCm/hip commit: edb1a96cdb]