Ben Sander
b520a34579
Refactor copy - place common code in resolveMemoryKind.
...
[ROCm/hip commit: 194d02ac5a ]
2016-03-19 22:56:10 -05:00
Ben Sander
4c77ecef9a
Deprecate hipMallocHost and hipFreeHost.
...
These will print compiler warnings if used, so we can weed them out
before removing.
Also add a default flags args for hipHostAlloc, in the C++ functioin
headers. So you can replace hipMallocHost(&ptr, size( with hipHostAlloc(&ptr, size)
[ROCm/hip commit: cea37c3e91 ]
2016-03-19 22:53:59 -05:00
Ben Sander
36d2a024c0
Refactor waitALlDevices and async mem copy.
...
- move waitAllStreams to device member function.
- create separate stream member function for copyAsync, like copySync.
hipMemcpyAsync now calls the copyAsync.
[ROCm/hip commit: 52cc2bb75a ]
2016-03-19 05:42:19 -05:00
Ben Sander
11603764be
Fix bug: test was allocating host mem instead of device mem.
...
Caused assertion when checking free + allocated should
not exceed total. Bug introduced in hipHostAlloc conversion.
[ROCm/hip commit: 95e1c8eb32 ]
2016-03-19 04:11:39 -05:00
Ben Sander
16df30e767
Swap in corrected hipHostAlloc (bad merge)
...
[ROCm/hip commit: d898997c65 ]
2016-03-19 04:11:08 -05:00
Ben Sander
5297376252
Merge branch 'privatestaging' of https://github.com/AMDComputeLibraries/HIP-privatestaging into privatestaging
...
Conflicts:
src/hip_hcc.cpp
[ROCm/hip commit: 1de63bfeea ]
2016-03-19 03:22:09 -05:00
Ben Sander
013f08418f
disable mt streams tests (for now)
...
[ROCm/hip commit: dfca4faf27 ]
2016-03-19 03:10:31 -05:00
Ben Sander
8204d20644
Describe HIP env vars
...
[ROCm/hip commit: 2dc7e9ccfa ]
2016-03-19 03:09:57 -05:00
Ben Sander
662cd0bb30
Fix copy and sync bugs. Remove extra sync in default stream.
...
- NULL stream was waiting for itself to be empty before each command.
- Force "blocking" streams to wait for NULL to empty. This was missing
before.
- async copy was disabling itself via trueAsync=false for common cases.
Refactor:
- rename _null_stream to _default_stream.
- move some null sync function to defaultSync, move to dev member func.
[ROCm/hip commit: 6472c8b915 ]
2016-03-19 02:44:26 -05:00
Ben Sander
b6de799f47
Add beastperiteration and onesize for testing.
...
onesize allows running tests at one specific size.
[ROCm/hip commit: 7ff5b16d2a ]
2016-03-19 02:43:04 -05:00
Ben Sander
79d0d442d3
Improve formatting - line up cols
...
[ROCm/hip commit: 85fce5f21e ]
2016-03-18 23:43:04 -05:00
Ben Sander
5996a5c0a1
Print Pinned or Unpinned in result summary
...
[ROCm/hip commit: c2102847a4 ]
2016-03-18 21:28:29 -05:00
Aditya Atluri
2dd079da40
Update CUDA_Runtime_API_functions_supported_by_HIP.md
...
[ROCm/hip commit: 27b6a620d9 ]
2016-03-18 11:28:06 -05:00
Aditya Atluri
cc3c700d00
Create CUDA_Runtime_API_functions_supported_by_HIP.md
...
[ROCm/hip commit: 456f8b8455 ]
2016-03-18 11:23:44 -05:00
Ben Sander
8a47efb22a
Supported --aliged mode. Add results check for H2D and D2H.
...
[ROCm/hip commit: 618556eaf9 ]
2016-03-18 03:09:52 -05:00
Ben Sander
0134651419
Refactor copy code.
...
-Move staging buffer locks inside the staging buffer code.
-Remove dedicated per-device completion_signal + per-device lock -
instead allocated signal from the per-stream pool. This elimintes
the lock and allows more concurrency.
-remove switch HIP_DISABLE_BIDIR_MEMCPY
[ROCm/hip commit: 0af4d3623f ]
2016-03-18 03:02:00 -05:00
Ben Sander
3320975a80
Refactor staging buffer and sync copies.
...
- refactor staging buffer to operate on hsa* data structures not
hc::accelerator.
- use hsa_memory_allocate to allocate staging buffers rather than
am_alloc.
- Refactor device reset with single member function. Don't reallocate
staging buffers on reset.
- Properly track dependencies based on command type. Add new deps for
H2D and D2D rather than overloading H2D.
[ROCm/hip commit: 7d500599fa ]
2016-03-17 20:09:10 -05:00
Ben Sander
fc27c61c58
Refactor to isolate staging buffer code.
...
[ROCm/hip commit: e7586adb33 ]
2016-03-17 00:20:56 -05:00
Ben Sander
0ae7bc7e14
Start separaration of staging_buffer.cpp code.
...
Still #include staging_buffer.cpp into hip_hcc.cpp.
Directed tests compile hip_hcc to static library and use the library.
[ROCm/hip commit: 28ee7aff71 ]
2016-03-16 22:26:49 -05:00
Ben Sander
910017a20f
Add aligned alloc
...
[ROCm/hip commit: c02bccc153 ]
2016-03-16 21:55:57 -05:00
Ben Sander
0b617788a8
Checkpoint code cleanup.
...
- Refactor ihipStream in prep for thread-safe implementation.
- Do some work on PinInPlace implementation.
[ROCm/hip commit: ec50cbb3aa ]
2016-03-16 21:16:29 -05:00
Aditya Atluri
cef9188d14
changed flag in hipHostRegister
...
[ROCm/hip commit: 1e588e56e0 ]
2016-03-16 08:01:53 -05:00
Aditya Atluri
9b39bdf3e0
src/ fixed hipHostAllocDefault flags
...
[ROCm/hip commit: b768568902 ]
2016-03-16 07:32:54 -05:00
Aditya Atluri
7dd54d20ca
Merge branch 'privatestaging' of https://github.com/AMDComputeLibraries/HIP-privatestaging into privatestaging
...
[ROCm/hip commit: 57a85763bd ]
2016-03-16 07:17:22 -05:00
Aditya Atluri
e35d72336c
Added performance test for memcpy
...
[ROCm/hip commit: 67d385f407 ]
2016-03-16 07:16:51 -05:00
Aditya Atluri
60ce389329
tests/src [v4] Added feature for partial writes on CPU
...
[ROCm/hip commit: 451cd9f876 ]
2016-03-16 07:04:40 -05:00
Aditya Atluri
18651d1879
test/src [v3] clean up
...
[ROCm/hip commit: 62b8351f02 ]
2016-03-16 05:30:29 -05:00
Aditya Atluri
e3bbf4e8c9
increased iteration size
...
[ROCm/hip commit: 89eb2a7b52 ]
2016-03-16 05:24:08 -05:00
Aditya Atluri
ec921786f7
added cudaHostRegister test
...
[ROCm/hip commit: e67133ce21 ]
2016-03-16 05:15:03 -05:00
Aditya Atluri
637fca1cb6
corrected first and second kernel dispatch
...
[ROCm/hip commit: e23bd0a23e ]
2016-03-15 14:22:00 -05:00
Aditya Atluri
2223b41232
Added single kernel launch to sample
...
[ROCm/hip commit: 862817626b ]
2016-03-15 21:05:15 -05:00
Aditya Atluri
b202017891
added performance metrics for kernel dispatch
...
[ROCm/hip commit: 31d8f60e56 ]
2016-03-15 12:37:24 -05:00
Aditya Atluri
83cafe7d94
v2 deprecating hipMallocHost with hipHostAlloc
...
[ROCm/hip commit: 58fa0524b6 ]
2016-03-15 13:39:15 -05:00
Aditya Atluri
4c54a8d9d5
Deprecating hipMallocHost to hipHostAlloc
...
[ROCm/hip commit: cbdc8c277c ]
2016-03-15 06:30:16 -05:00
Ben Sander
bef8dc7bcc
print device config info
...
[ROCm/hip commit: 70c5f5e3f5 ]
2016-03-14 23:02:49 -05:00
Ben Sander
0abf5db89e
Merge branch 'privatestaging' of https://github.com/AMDComputeLibraries/HIP-privatestaging into privatestaging
...
Conflicts:
src/hip_hcc.cpp
tests/src/CMakeLists.txt
[ROCm/hip commit: e1617b9604 ]
2016-03-14 15:01:26 -05:00
Ben Sander
5951d581d9
enable DB, comments
...
[ROCm/hip commit: 1a27e5134e ]
2016-03-14 14:40:41 -05:00
Ben Sander
10ca8ed91d
Add Bidir copy test and help.
...
[ROCm/hip commit: 5606bee076 ]
2016-03-14 14:39:23 -05:00
Ben Sander
3427873d7c
refactor, add support for speccing xfers in bytes
...
[ROCm/hip commit: ac6ed35ba0 ]
2016-03-13 09:41:06 -05:00
Ben Sander
e08baf41d7
Remove ROCR_V2, assume=1
...
[ROCm/hip commit: b0dca6bd0c ]
2016-03-12 04:08:13 -06:00
Ben Sander
30fbe45694
Remove USE_AM_TRACKER (assume 1)
...
[ROCm/hip commit: a5a2c7f33c ]
2016-03-12 04:04:57 -06:00
Ben Sander
4900ebb39f
Improve error reporting.
...
use throw with error class.
fix bug when memcpyDefault resolved to D2D copy.
[ROCm/hip commit: 250739666d ]
2016-03-12 04:02:04 -06:00
Aditya Atluri
db1ce3ba84
Added hipHostRegister for hip with tests and added copyright
...
[ROCm/hip commit: 102f173396 ]
2016-03-08 12:57:22 -06:00
Aditya Atluri
963c0f6076
Added a random test for memcpyAsync
...
[ROCm/hip commit: cbb42c6b6a ]
2016-03-08 03:40:56 -06:00
Aditya Atluri
ab68dc4240
v2 added support for unpinned async memcpy
...
[ROCm/hip commit: 18c0e69f51 ]
2016-03-08 03:19:50 -06:00
Aditya Atluri
bf3d3a9745
Revert "Fixed passing unpinned memory to async memcpy"
...
This reverts commit c272c53808 .
[ROCm/hip commit: 45965aaa2d ]
2016-03-08 03:17:27 -06:00
Aditya Atluri
7f8a1a3e17
Merge branch 'privatestaging' of https://github.com/AMDComputeLibraries/HIP-privatestaging into privatestaging
...
[ROCm/hip commit: 81824926be ]
2016-03-08 03:15:33 -06:00
Aditya Atluri
c272c53808
Fixed passing unpinned memory to async memcpy
...
[ROCm/hip commit: 216af71480 ]
2016-03-08 03:14:49 -06:00
Ben Sander
6d77b45f88
Refactor memccpy tests.
...
- Add "hipMemcpy_simple" - some simple smoke tests.
- Run memcpy 3 times w/ different switches for kinds, sizes,
multi-thread.
[ROCm/hip commit: cc7517b57e ]
2016-03-07 17:15:48 -06:00
Aditya Atluri
c05f4abd71
Added hipHostRegister flags
...
[ROCm/hip commit: d9429dd4ec ]
2016-03-07 10:52:40 -06:00