Commit Graph

414 Commits

Author SHA1 Message Date
Ben Sander 882dab4536 Refactor hipStreamWaitEvent
- Null streams use same flow as non-null.
- Add HIP_SYNC_STREAM_WAIT
- Resolve null stream.
2017-08-31 03:00:14 +00:00
Rahul Garg 4b98572f47 Enable hipStreamSync2 and record_event dtests 2017-08-24 15:51:58 +05:30
Rahul Garg 9441f3a50e PCI Bus ID related dtests 2017-08-22 15:41:16 +05:30
Maneesh Gupta 4bcb3372b5 [dtests] Fix hip_threadfence_system test's BUILD command
Change-Id: I037c523369e4bddd7a386ffe117fe97a178900fd
2017-08-11 14:49:17 +05:30
Siu Chi Chan 9a04c5cdd4 add C++11 compilation flags and minor bug fixes 2017-08-10 17:58:36 +00:00
Maneesh Gupta f14e31d9d7 Merge pull request #138 from scchan/threadfence_system
implement __threadfence_system
2017-08-10 12:51:05 +05:30
Maneesh Gupta 69e431f035 [dtests] Disable hip_threadfence_system.tst on nvcc
The test needs some more modifications before it is ready for building
on nvcc path. Disabling it for now so that the pull request can be
merged.

Change-Id: I19a894fcda7b2159f86a4e4e95a409c5655d3760
2017-08-10 12:05:58 +05:30
Rahul Garg 2f617bac6d Disable failing test p2p_copy_coherency 2017-08-02 14:47:13 +05:30
Siu Chi Chan 96e79f97e2 implement __threadfence_system 2017-08-02 08:50:18 +00:00
Ben Sander 9cede50942 Merge pull request #110 from sunway513/fix_hipEnvVar_test
Fix hipEnvVar test
2017-07-27 13:03:43 -05:00
sunway513 ed0d6cf745 Change to use hipDeviceGetPCIBusId api for hipEnvVarDriver test 2017-07-27 17:41:49 +00:00
Ben Sander 4f93b99f23 Merge pull request #100 from weixingzhang/texture
HIP Texture Support
2017-07-26 19:28:15 -05:00
sunway513 81bc82d166 Fix indentation, binary path and add wait on popen calls to hipEnvVar test 2017-07-24 17:16:13 +00:00
Maneesh Gupta 7a446dcbc5 Renable frexp(f) device math function
Change-Id: I53c022b8ddf38cd17ddb42eba457b9020db66395
2017-07-20 14:41:30 +05:30
Maneesh Gupta 9d5d4f9abb [dt] hip_texture.h -> hip_texture_types.h
Change-Id: If106efa0a2e52105d4025706fdbc291c27352a74
2017-07-19 12:46:18 +05:30
Rahul Garg 039f62213b Enable memcpytosymbol family functions test 2017-07-18 12:37:37 +05:30
Maneesh Gupta b3a9e202b1 [dt] temporarily disable broken tests
Change-Id: I3ce23e410dc36e06d460263d768412f8368a6699
2017-07-18 11:17:02 +05:30
Wen-Heng (Jack) Chung 30ee797dd4 Remove explicit address spaces declarations 2017-07-07 12:46:29 -05:00
Ben Sander 7912e61502 Clean up old USE_* and RELEASE.md notes. 2017-06-23 18:05:30 -05:00
Rahul Garg a9449533a1 Update P2P test for using memcpy and kernel tests
Change-Id: Ib0f8fc9425e6e85fd11d7d02395c52bc713dcb37
2017-06-12 11:53:25 +05:30
Maneesh Gupta 1c93d8592e hipMemcpy-size test: reduce max size to make it work correctly on nvcc path
Change-Id: I9ce9f5a9e141ffd8ddf961269010b33358e02771
2017-06-07 15:25:54 +05:30
Maneesh Gupta e6cafbf342 hipDeviceMemcpy test: make it functional on nvcc path
Change-Id: Id10c79b48747ed701adbd0a233c53cd60cfa743b
2017-06-07 15:24:44 +05:30
Maneesh Gupta 1efb6ce994 p2p_copy_coherency test: gracefully handle single gpu case
Change-Id: I216663f67ef58c673136332635dab8b57079b909
2017-06-07 15:23:37 +05:30
Ben Sander 39c18e5e5f Fix HIP_SYNC_NULL_STREAM=0 mode.
- Fix null-stream sync
- hipStreamDestroy of null stream returns hipErrorInvalidResourceHandle
- Update documentation.
- Add tests for null stream sync, hipEventElapsedTime.
- Rename internal enum hipEventStatusRecorded to hipEventStatusComplete
- refactor hipStreamWaitEvent to streamline control-flow
2017-06-05 08:50:22 -05:00
Ben Sander 8ce6d17983 Update tests.
Fix some NVCC issues.
Add hipStreamSync2, record_event tests.
2017-06-04 20:18:37 -05:00
Ben Sander d5c1616324 Update tests, add p2p coherency test. 2017-06-03 17:11:34 -05:00
Maneesh Gupta 502a74fcd6 Fix hipMemoryAllocate test for single GPU
Change-Id: If121c18ab490ba125dc689ffc08a8839fd280c38
2017-05-31 10:16:57 +05:30
Maneesh Gupta 5cdd1b2bf5 Disable rcbrtf, scalblnf, scalbnf in single precision device test
Change-Id: I8a250a64a0cb05132d022a11d9766ced9cdf11a7
2017-05-31 10:16:19 +05:30
Maneesh Gupta 4919863d3e Disable rcbrt, scalbln and scalbn double precision device test
Change-Id: I46bd895701c46d3592b553090cafba99e41a2e2d
2017-05-31 10:15:41 +05:30
Sandeep Kumar 830f2b100d Add inline asm hip directed tests for v_add and v_mac
Change-Id: Ie5ace2e42d5da89b16e040537df2bb13d3883c6d
2017-05-31 09:25:40 +05:30
Ben Sander ec7102f461 Add event controls for release fences.
Env var : HIP_EVENT_SYS_RELEASE
Event allocation flags : hipEventReleaseToDevice, hipEventReleaseToSystem
   (remove hipEventDisableSystemRelease)

Update test for new functionality.
2017-05-27 16:02:34 -05:00
Ben Sander 620eb30691 Cleanup hipEvent. (Intermediate checkpoint)
Support hipEventDisableSystemRelease flag.
Update test.
Remove stray printf
2017-05-27 16:02:34 -05:00
Ben Sander be8d0ba644 Updates so hip compiles on CUDA. 2017-05-27 15:55:07 -05:00
Ben Sander 75f691ec2f Add hipHostMallocCoherent, hipHostMallocNonCoherent
Provide per-allocation control over coherent/non-coherent mem.
These overrid the default HIP_COHERENT_HOST_ALLOC setting.
2017-05-24 00:48:10 -05:00
Ben Sander 236ce70e94 Expand test to cover copy followed by event sync 2017-05-23 23:15:45 -05:00
Ben Sander 2e1fec47ab Make hipMultiThreadStreams1 test a little harsher.
Fail faster if synchronization rules are violated.
Run vectorAddRevers to read last elements of array first - if the
vector add kernel starts before preceding copy finishes we
will read stale data and flag the error.

Increase default array sizes, so synchronization errors more easily
exposed.
2017-05-16 19:04:25 -05:00
Ben Sander 427f8472aa Fix HIP_TRACE_API so kernel launch only printed when requested. 2017-05-16 19:04:25 -05:00
Ben Sander 27877f8854 Add initial HIP_SYNC_NULL_STREAM=0 mode.
This eliminates host-synchronization for null stream.  Instead, the
null-stream uses GPU-side events to wait for other streams.
Default is OFF pending additional testing.

Add enhanced null-stream test.

Also refine HIP_TRACE_API.
2017-05-16 19:04:25 -05:00
Ben Sander db097ab392 split debugging into separate .md file 2017-05-16 19:04:25 -05:00
Ben Sander c3ccaa01e5 hipHostMalloc allocation are mapped to all devices by default.
Support hipHostMallocPortable flag.
Default flags are hipHostMallocPortable | hipHostMallocMapped.

Also:
-refactor tests to move addCount and addCountReverse into HipTest
namespace.
-test multi-GPU host memory.
2017-05-10 17:34:36 -05:00
Ben Sander e0c3ea15b2 Fix hipStreamWaitEvent for single GPU. 2017-05-10 13:23:49 -05:00
Ben Sander 6437f5d2b2 Refactor hipHostRegister test.
Run all tests in one command.
Run 128 offsets.
2017-05-05 17:29:04 -05:00
Ben Sander 2d0f509de5 Update streamEventTEst.
- add checks for events across devices.
- refactor test to make sure it runs long enough to sensitive sync
  techniques.
- add tests for DeviceSync, streamWaitEvent.
2017-05-05 17:29:04 -05:00
wsttiger 2a253680da Improve hipStreamWaitEvent test.
- use addOne kernel, use local initializer rather than init_array.
- use addOneReverse test to add from back of array.  Test alternate fwd and backward to stress dependency logic.
- check device-side dependencies.
2017-05-05 17:29:04 -05:00
Aditya Atluri fc6248ce82 added hipFuncSetCacheConfig API for nvcc path
Change-Id: I87fae35bc0e10a0dca5ae1c5015fe5d9e52a1d0d
2017-04-26 18:56:57 -05:00
Ben Sander cd93a2f229 Add host offset checking 2017-04-24 22:30:54 -05:00
Ben Sander b7b17c3841 Refactor hipHostRegister test.
- Add more testing for offsets.
- Parse cmdline options and use --tests.
2017-04-24 22:30:54 -05:00
Ben Sander 8ba993e36c Refactor hipHostRegister to cover misaligned cases. 2017-04-24 22:30:54 -05:00
Ben Sander ab410add5c Add test for non-page-aligned mem copies. 2017-04-24 22:30:54 -05:00
Ben Sander c5d89d9e7f Add corrected test for offsets 2017-04-24 22:30:54 -05:00