Ben Sander
723327cd0f
Describe how to file an issue
2016-03-23 01:15:05 -05:00
Ben Sander
2d0fade1f7
hipHostRegister and hipHostMalloc refactor.
...
Note hipHostMalloc (not hipHostAlloc or hipMallocHost).
- the hipHost* is used for all HIP APIs dealing with Host memory.
(including hipHostMalloc, hipHostFree, hipHostRegister,
hipHostUnregister, hipHostGetFlags, hipHostGetDevicePointer).
- hipMallocHost is consistent with "hipMalloc" for allocating device
memory. Enumerations hipHostMalloc* also used as optional
flags parm to hipHostMalloc.
2016-03-22 02:30:10 -05:00
Ben Sander
bf83b949f6
Merge branch 'privatestaging' of https://github.com/AMDComputeLibraries/HIP-privatestaging into privatestaging
2016-03-22 01:17:17 -05:00
Ben Sander
ac1eca47f7
Remove kind (aka direction) for copy commands.
...
This is always auto-detected from the src/dest location.
2016-03-21 05:35:36 -05:00
Aditya Atluri
ea352aba6b
suppressed warning in hipFreeHost
2016-03-20 15:31:59 -05:00
Aditya Atluri
f6b38b18b6
Added feature for --default-streams not working tests and hipcc
2016-03-20 08:08:33 -05:00
Ben Sander
80d708846a
Implement hipHostFree on HCC path
2016-03-19 23:25:11 -05:00
Ben Sander
6984f24d3d
fix nvcc compiler
...
- MallocHost and FreeHost deprecation.
- Change tests to call new hipHost* equivs.
- Add missing StreamSynchronize.
2016-03-19 04:20:15 -05:00
Ben Sander
03731020f1
Refactor copy - place common code in resolveMemoryKind.
2016-03-19 22:56:10 -05:00
Ben Sander
57365eb7a3
Deprecate hipMallocHost and hipFreeHost.
...
These will print compiler warnings if used, so we can weed them out
before removing.
Also add a default flags args for hipHostAlloc, in the C++ functioin
headers. So you can replace hipMallocHost(&ptr, size( with hipHostAlloc(&ptr, size)
2016-03-19 22:53:59 -05:00
Ben Sander
a88c2b1ec9
Refactor waitALlDevices and async mem copy.
...
- move waitAllStreams to device member function.
- create separate stream member function for copyAsync, like copySync.
hipMemcpyAsync now calls the copyAsync.
2016-03-19 05:42:19 -05:00
Ben Sander
4c6fd4e7ec
Fix bug: test was allocating host mem instead of device mem.
...
Caused assertion when checking free + allocated should
not exceed total. Bug introduced in hipHostAlloc conversion.
2016-03-19 04:11:39 -05:00
Ben Sander
aaa2429feb
Swap in corrected hipHostAlloc (bad merge)
2016-03-19 04:11:08 -05:00
Ben Sander
90ad8ddc5d
Merge branch 'privatestaging' of https://github.com/AMDComputeLibraries/HIP-privatestaging into privatestaging
...
Conflicts:
src/hip_hcc.cpp
2016-03-19 03:22:09 -05:00
Ben Sander
efc9df8805
disable mt streams tests (for now)
2016-03-19 03:10:31 -05:00
Ben Sander
c39b0f9660
Describe HIP env vars
2016-03-19 03:09:57 -05:00
Ben Sander
44522eb607
Fix copy and sync bugs. Remove extra sync in default stream.
...
- NULL stream was waiting for itself to be empty before each command.
- Force "blocking" streams to wait for NULL to empty. This was missing
before.
- async copy was disabling itself via trueAsync=false for common cases.
Refactor:
- rename _null_stream to _default_stream.
- move some null sync function to defaultSync, move to dev member func.
2016-03-19 02:44:26 -05:00
Ben Sander
5197cf250d
Add beastperiteration and onesize for testing.
...
onesize allows running tests at one specific size.
2016-03-19 02:43:04 -05:00
Ben Sander
c5d0813f03
Improve formatting - line up cols
2016-03-18 23:43:04 -05:00
Ben Sander
3dc6906855
Print Pinned or Unpinned in result summary
2016-03-18 21:28:29 -05:00
Aditya Atluri
a17e390283
Update CUDA_Runtime_API_functions_supported_by_HIP.md
2016-03-18 11:28:06 -05:00
Aditya Atluri
564d13a0b3
Create CUDA_Runtime_API_functions_supported_by_HIP.md
2016-03-18 11:23:44 -05:00
Ben Sander
690486b9eb
Supported --aliged mode. Add results check for H2D and D2H.
2016-03-18 03:09:52 -05:00
Ben Sander
e64174f47a
Refactor copy code.
...
-Move staging buffer locks inside the staging buffer code.
-Remove dedicated per-device completion_signal + per-device lock -
instead allocated signal from the per-stream pool. This elimintes
the lock and allows more concurrency.
-remove switch HIP_DISABLE_BIDIR_MEMCPY
2016-03-18 03:02:00 -05:00
Ben Sander
3b45e064f9
Refactor staging buffer and sync copies.
...
- refactor staging buffer to operate on hsa* data structures not
hc::accelerator.
- use hsa_memory_allocate to allocate staging buffers rather than
am_alloc.
- Refactor device reset with single member function. Don't reallocate
staging buffers on reset.
- Properly track dependencies based on command type. Add new deps for
H2D and D2D rather than overloading H2D.
2016-03-17 20:09:10 -05:00
Ben Sander
1b7cc7d921
Refactor to isolate staging buffer code.
2016-03-17 00:20:56 -05:00
Ben Sander
a1879ba59b
Start separaration of staging_buffer.cpp code.
...
Still #include staging_buffer.cpp into hip_hcc.cpp.
Directed tests compile hip_hcc to static library and use the library.
2016-03-16 22:26:49 -05:00
Ben Sander
c9d46bdcde
Add aligned alloc
2016-03-16 21:55:57 -05:00
Ben Sander
8acb53e160
Checkpoint code cleanup.
...
- Refactor ihipStream in prep for thread-safe implementation.
- Do some work on PinInPlace implementation.
2016-03-16 21:16:29 -05:00
Aditya Atluri
4b96e8c789
changed flag in hipHostRegister
2016-03-16 08:01:53 -05:00
Aditya Atluri
6beed19460
src/ fixed hipHostAllocDefault flags
2016-03-16 07:32:54 -05:00
Aditya Atluri
9b78f0a454
Merge branch 'privatestaging' of https://github.com/AMDComputeLibraries/HIP-privatestaging into privatestaging
2016-03-16 07:17:22 -05:00
Aditya Atluri
7567a6c0ac
Added performance test for memcpy
2016-03-16 07:16:51 -05:00
Aditya Atluri
6baeb08e1c
tests/src [v4] Added feature for partial writes on CPU
2016-03-16 07:04:40 -05:00
Aditya Atluri
9b772c8553
test/src [v3] clean up
2016-03-16 05:30:29 -05:00
Aditya Atluri
8f527d2288
increased iteration size
2016-03-16 05:24:08 -05:00
Aditya Atluri
a60649868d
added cudaHostRegister test
2016-03-16 05:15:03 -05:00
Aditya Atluri
369d01086f
corrected first and second kernel dispatch
2016-03-15 14:22:00 -05:00
Aditya Atluri
a6441c3334
Added single kernel launch to sample
2016-03-15 21:05:15 -05:00
Aditya Atluri
b7bf5830c1
added performance metrics for kernel dispatch
2016-03-15 12:37:24 -05:00
Aditya Atluri
2743a5082f
v2 deprecating hipMallocHost with hipHostAlloc
2016-03-15 13:39:15 -05:00
Aditya Atluri
4d4d592e0c
Deprecating hipMallocHost to hipHostAlloc
2016-03-15 06:30:16 -05:00
Ben Sander
bdefb70862
print device config info
2016-03-14 23:02:49 -05:00
Ben Sander
15a8e8f8a0
Merge branch 'privatestaging' of https://github.com/AMDComputeLibraries/HIP-privatestaging into privatestaging
...
Conflicts:
src/hip_hcc.cpp
tests/src/CMakeLists.txt
2016-03-14 15:01:26 -05:00
Ben Sander
0d05517d0a
enable DB, comments
2016-03-14 14:40:41 -05:00
Ben Sander
87c15a3629
Add Bidir copy test and help.
2016-03-14 14:39:23 -05:00
Ben Sander
0aab91123b
refactor, add support for speccing xfers in bytes
2016-03-13 09:41:06 -05:00
Ben Sander
59f0ef2801
Remove ROCR_V2, assume=1
2016-03-12 04:08:13 -06:00
Ben Sander
de4bb0ca0e
Remove USE_AM_TRACKER (assume 1)
2016-03-12 04:04:57 -06:00
Ben Sander
ac272932f6
Improve error reporting.
...
use throw with error class.
fix bug when memcpyDefault resolved to D2D copy.
2016-03-12 04:02:04 -06:00