Ben Sander
882dab4536
Refactor hipStreamWaitEvent
...
- Null streams use same flow as non-null.
- Add HIP_SYNC_STREAM_WAIT
- Resolve null stream.
2017-08-31 03:00:14 +00:00
Rahul Garg
4b98572f47
Enable hipStreamSync2 and record_event dtests
2017-08-24 15:51:58 +05:30
Rahul Garg
9441f3a50e
PCI Bus ID related dtests
2017-08-22 15:41:16 +05:30
Maneesh Gupta
4bcb3372b5
[dtests] Fix hip_threadfence_system test's BUILD command
...
Change-Id: I037c523369e4bddd7a386ffe117fe97a178900fd
2017-08-11 14:49:17 +05:30
Siu Chi Chan
9a04c5cdd4
add C++11 compilation flags and minor bug fixes
2017-08-10 17:58:36 +00:00
Maneesh Gupta
f14e31d9d7
Merge pull request #138 from scchan/threadfence_system
...
implement __threadfence_system
2017-08-10 12:51:05 +05:30
Maneesh Gupta
69e431f035
[dtests] Disable hip_threadfence_system.tst on nvcc
...
The test needs some more modifications before it is ready for building
on nvcc path. Disabling it for now so that the pull request can be
merged.
Change-Id: I19a894fcda7b2159f86a4e4e95a409c5655d3760
2017-08-10 12:05:58 +05:30
Rahul Garg
2f617bac6d
Disable failing test p2p_copy_coherency
2017-08-02 14:47:13 +05:30
Siu Chi Chan
96e79f97e2
implement __threadfence_system
2017-08-02 08:50:18 +00:00
Ben Sander
9cede50942
Merge pull request #110 from sunway513/fix_hipEnvVar_test
...
Fix hipEnvVar test
2017-07-27 13:03:43 -05:00
sunway513
ed0d6cf745
Change to use hipDeviceGetPCIBusId api for hipEnvVarDriver test
2017-07-27 17:41:49 +00:00
Ben Sander
4f93b99f23
Merge pull request #100 from weixingzhang/texture
...
HIP Texture Support
2017-07-26 19:28:15 -05:00
sunway513
81bc82d166
Fix indentation, binary path and add wait on popen calls to hipEnvVar test
2017-07-24 17:16:13 +00:00
Maneesh Gupta
7a446dcbc5
Renable frexp(f) device math function
...
Change-Id: I53c022b8ddf38cd17ddb42eba457b9020db66395
2017-07-20 14:41:30 +05:30
Maneesh Gupta
9d5d4f9abb
[dt] hip_texture.h -> hip_texture_types.h
...
Change-Id: If106efa0a2e52105d4025706fdbc291c27352a74
2017-07-19 12:46:18 +05:30
Rahul Garg
039f62213b
Enable memcpytosymbol family functions test
2017-07-18 12:37:37 +05:30
Maneesh Gupta
b3a9e202b1
[dt] temporarily disable broken tests
...
Change-Id: I3ce23e410dc36e06d460263d768412f8368a6699
2017-07-18 11:17:02 +05:30
Wen-Heng (Jack) Chung
30ee797dd4
Remove explicit address spaces declarations
2017-07-07 12:46:29 -05:00
Ben Sander
7912e61502
Clean up old USE_* and RELEASE.md notes.
2017-06-23 18:05:30 -05:00
Rahul Garg
a9449533a1
Update P2P test for using memcpy and kernel tests
...
Change-Id: Ib0f8fc9425e6e85fd11d7d02395c52bc713dcb37
2017-06-12 11:53:25 +05:30
Maneesh Gupta
1c93d8592e
hipMemcpy-size test: reduce max size to make it work correctly on nvcc path
...
Change-Id: I9ce9f5a9e141ffd8ddf961269010b33358e02771
2017-06-07 15:25:54 +05:30
Maneesh Gupta
e6cafbf342
hipDeviceMemcpy test: make it functional on nvcc path
...
Change-Id: Id10c79b48747ed701adbd0a233c53cd60cfa743b
2017-06-07 15:24:44 +05:30
Maneesh Gupta
1efb6ce994
p2p_copy_coherency test: gracefully handle single gpu case
...
Change-Id: I216663f67ef58c673136332635dab8b57079b909
2017-06-07 15:23:37 +05:30
Ben Sander
39c18e5e5f
Fix HIP_SYNC_NULL_STREAM=0 mode.
...
- Fix null-stream sync
- hipStreamDestroy of null stream returns hipErrorInvalidResourceHandle
- Update documentation.
- Add tests for null stream sync, hipEventElapsedTime.
- Rename internal enum hipEventStatusRecorded to hipEventStatusComplete
- refactor hipStreamWaitEvent to streamline control-flow
2017-06-05 08:50:22 -05:00
Ben Sander
8ce6d17983
Update tests.
...
Fix some NVCC issues.
Add hipStreamSync2, record_event tests.
2017-06-04 20:18:37 -05:00
Ben Sander
d5c1616324
Update tests, add p2p coherency test.
2017-06-03 17:11:34 -05:00
Maneesh Gupta
502a74fcd6
Fix hipMemoryAllocate test for single GPU
...
Change-Id: If121c18ab490ba125dc689ffc08a8839fd280c38
2017-05-31 10:16:57 +05:30
Maneesh Gupta
5cdd1b2bf5
Disable rcbrtf, scalblnf, scalbnf in single precision device test
...
Change-Id: I8a250a64a0cb05132d022a11d9766ced9cdf11a7
2017-05-31 10:16:19 +05:30
Maneesh Gupta
4919863d3e
Disable rcbrt, scalbln and scalbn double precision device test
...
Change-Id: I46bd895701c46d3592b553090cafba99e41a2e2d
2017-05-31 10:15:41 +05:30
Sandeep Kumar
830f2b100d
Add inline asm hip directed tests for v_add and v_mac
...
Change-Id: Ie5ace2e42d5da89b16e040537df2bb13d3883c6d
2017-05-31 09:25:40 +05:30
Ben Sander
ec7102f461
Add event controls for release fences.
...
Env var : HIP_EVENT_SYS_RELEASE
Event allocation flags : hipEventReleaseToDevice, hipEventReleaseToSystem
(remove hipEventDisableSystemRelease)
Update test for new functionality.
2017-05-27 16:02:34 -05:00
Ben Sander
620eb30691
Cleanup hipEvent. (Intermediate checkpoint)
...
Support hipEventDisableSystemRelease flag.
Update test.
Remove stray printf
2017-05-27 16:02:34 -05:00
Ben Sander
be8d0ba644
Updates so hip compiles on CUDA.
2017-05-27 15:55:07 -05:00
Ben Sander
75f691ec2f
Add hipHostMallocCoherent, hipHostMallocNonCoherent
...
Provide per-allocation control over coherent/non-coherent mem.
These overrid the default HIP_COHERENT_HOST_ALLOC setting.
2017-05-24 00:48:10 -05:00
Ben Sander
236ce70e94
Expand test to cover copy followed by event sync
2017-05-23 23:15:45 -05:00
Ben Sander
2e1fec47ab
Make hipMultiThreadStreams1 test a little harsher.
...
Fail faster if synchronization rules are violated.
Run vectorAddRevers to read last elements of array first - if the
vector add kernel starts before preceding copy finishes we
will read stale data and flag the error.
Increase default array sizes, so synchronization errors more easily
exposed.
2017-05-16 19:04:25 -05:00
Ben Sander
427f8472aa
Fix HIP_TRACE_API so kernel launch only printed when requested.
2017-05-16 19:04:25 -05:00
Ben Sander
27877f8854
Add initial HIP_SYNC_NULL_STREAM=0 mode.
...
This eliminates host-synchronization for null stream. Instead, the
null-stream uses GPU-side events to wait for other streams.
Default is OFF pending additional testing.
Add enhanced null-stream test.
Also refine HIP_TRACE_API.
2017-05-16 19:04:25 -05:00
Ben Sander
db097ab392
split debugging into separate .md file
2017-05-16 19:04:25 -05:00
Ben Sander
c3ccaa01e5
hipHostMalloc allocation are mapped to all devices by default.
...
Support hipHostMallocPortable flag.
Default flags are hipHostMallocPortable | hipHostMallocMapped.
Also:
-refactor tests to move addCount and addCountReverse into HipTest
namespace.
-test multi-GPU host memory.
2017-05-10 17:34:36 -05:00
Ben Sander
e0c3ea15b2
Fix hipStreamWaitEvent for single GPU.
2017-05-10 13:23:49 -05:00
Ben Sander
6437f5d2b2
Refactor hipHostRegister test.
...
Run all tests in one command.
Run 128 offsets.
2017-05-05 17:29:04 -05:00
Ben Sander
2d0f509de5
Update streamEventTEst.
...
- add checks for events across devices.
- refactor test to make sure it runs long enough to sensitive sync
techniques.
- add tests for DeviceSync, streamWaitEvent.
2017-05-05 17:29:04 -05:00
wsttiger
2a253680da
Improve hipStreamWaitEvent test.
...
- use addOne kernel, use local initializer rather than init_array.
- use addOneReverse test to add from back of array. Test alternate fwd and backward to stress dependency logic.
- check device-side dependencies.
2017-05-05 17:29:04 -05:00
Aditya Atluri
fc6248ce82
added hipFuncSetCacheConfig API for nvcc path
...
Change-Id: I87fae35bc0e10a0dca5ae1c5015fe5d9e52a1d0d
2017-04-26 18:56:57 -05:00
Ben Sander
cd93a2f229
Add host offset checking
2017-04-24 22:30:54 -05:00
Ben Sander
b7b17c3841
Refactor hipHostRegister test.
...
- Add more testing for offsets.
- Parse cmdline options and use --tests.
2017-04-24 22:30:54 -05:00
Ben Sander
8ba993e36c
Refactor hipHostRegister to cover misaligned cases.
2017-04-24 22:30:54 -05:00
Ben Sander
ab410add5c
Add test for non-page-aligned mem copies.
2017-04-24 22:30:54 -05:00
Ben Sander
c5d89d9e7f
Add corrected test for offsets
2017-04-24 22:30:54 -05:00