Aditya Atluri
3f5eb20cf0
Revert "fix nvcc for hipHostMalloc* flags."
...
This reverts commit b6962826eb .
2016-03-21 10:36:14 -05:00
Aditya Atluri
287ba34aca
Revert "fixed memory free apis"
...
This reverts commit 96a1899df7 .
2016-03-21 10:36:11 -05:00
Aditya Atluri
caa80af31b
Merge branch 'privatestaging' of https://github.com/AMDComputeLibraries/HIP-privatestaging into privatestaging
2016-03-21 10:34:08 -05:00
Aditya Atluri
96a1899df7
fixed memory free apis
2016-03-21 10:32:30 -05:00
Aditya Atluri
6d1420ab68
Disabling default-stream per-thread tests
2016-03-21 14:42:23 -05:00
Ben Sander
ab910efb96
hipHostRegister and hipHostMalloc refactor.
...
Note hipHostMalloc (not hipHostAlloc or hipMallocHost).
- the hipHost* is used for all HIP APIs dealing with Host memory.
(including hipHostMalloc, hipHostFree, hipHostRegister,
hipHostUnregister, hipHostGetFlags, hipHostGetDevicePointer).
- hipMallocHost is consistent with "hipMalloc" for allocating device
memory. Enumerations hipHostMalloc* also used as optional
flags parm to hipHostMalloc.
2016-03-22 02:30:10 -05:00
Ben Sander
8087bc0401
Merge branch 'privatestaging' of https://github.com/AMDComputeLibraries/HIP-privatestaging into privatestaging
2016-03-22 01:17:17 -05:00
Ben Sander
b6962826eb
fix nvcc for hipHostMalloc* flags.
2016-03-21 09:33:46 -05:00
Ben Sander
04af145d6a
Remove kind (aka direction) for copy commands.
...
This is always auto-detected from the src/dest location.
2016-03-21 05:35:36 -05:00
Aditya Atluri
6b3821afc7
suppressed warning in hipFreeHost
2016-03-20 15:31:59 -05:00
Aditya Atluri
a232d65db1
Added feature for --default-streams not working tests and hipcc
2016-03-20 08:08:33 -05:00
Ben Sander
deb38625ca
Implement hipHostFree on HCC path
2016-03-19 23:25:11 -05:00
Ben Sander
9941ba0bc6
fix nvcc compiler
...
- MallocHost and FreeHost deprecation.
- Change tests to call new hipHost* equivs.
- Add missing StreamSynchronize.
2016-03-19 04:20:15 -05:00
Ben Sander
194d02ac5a
Refactor copy - place common code in resolveMemoryKind.
2016-03-19 22:56:10 -05:00
Ben Sander
cea37c3e91
Deprecate hipMallocHost and hipFreeHost.
...
These will print compiler warnings if used, so we can weed them out
before removing.
Also add a default flags args for hipHostAlloc, in the C++ functioin
headers. So you can replace hipMallocHost(&ptr, size( with hipHostAlloc(&ptr, size)
2016-03-19 22:53:59 -05:00
Ben Sander
52cc2bb75a
Refactor waitALlDevices and async mem copy.
...
- move waitAllStreams to device member function.
- create separate stream member function for copyAsync, like copySync.
hipMemcpyAsync now calls the copyAsync.
2016-03-19 05:42:19 -05:00
Ben Sander
95e1c8eb32
Fix bug: test was allocating host mem instead of device mem.
...
Caused assertion when checking free + allocated should
not exceed total. Bug introduced in hipHostAlloc conversion.
2016-03-19 04:11:39 -05:00
Ben Sander
d898997c65
Swap in corrected hipHostAlloc (bad merge)
2016-03-19 04:11:08 -05:00
Ben Sander
1de63bfeea
Merge branch 'privatestaging' of https://github.com/AMDComputeLibraries/HIP-privatestaging into privatestaging
...
Conflicts:
src/hip_hcc.cpp
2016-03-19 03:22:09 -05:00
Ben Sander
dfca4faf27
disable mt streams tests (for now)
2016-03-19 03:10:31 -05:00
Ben Sander
2dc7e9ccfa
Describe HIP env vars
2016-03-19 03:09:57 -05:00
Ben Sander
6472c8b915
Fix copy and sync bugs. Remove extra sync in default stream.
...
- NULL stream was waiting for itself to be empty before each command.
- Force "blocking" streams to wait for NULL to empty. This was missing
before.
- async copy was disabling itself via trueAsync=false for common cases.
Refactor:
- rename _null_stream to _default_stream.
- move some null sync function to defaultSync, move to dev member func.
2016-03-19 02:44:26 -05:00
Ben Sander
7ff5b16d2a
Add beastperiteration and onesize for testing.
...
onesize allows running tests at one specific size.
2016-03-19 02:43:04 -05:00
Ben Sander
85fce5f21e
Improve formatting - line up cols
2016-03-18 23:43:04 -05:00
Ben Sander
c2102847a4
Print Pinned or Unpinned in result summary
2016-03-18 21:28:29 -05:00
Aditya Atluri
27b6a620d9
Update CUDA_Runtime_API_functions_supported_by_HIP.md
2016-03-18 11:28:06 -05:00
Aditya Atluri
456f8b8455
Create CUDA_Runtime_API_functions_supported_by_HIP.md
2016-03-18 11:23:44 -05:00
Ben Sander
618556eaf9
Supported --aliged mode. Add results check for H2D and D2H.
2016-03-18 03:09:52 -05:00
Ben Sander
0af4d3623f
Refactor copy code.
...
-Move staging buffer locks inside the staging buffer code.
-Remove dedicated per-device completion_signal + per-device lock -
instead allocated signal from the per-stream pool. This elimintes
the lock and allows more concurrency.
-remove switch HIP_DISABLE_BIDIR_MEMCPY
2016-03-18 03:02:00 -05:00
Ben Sander
7d500599fa
Refactor staging buffer and sync copies.
...
- refactor staging buffer to operate on hsa* data structures not
hc::accelerator.
- use hsa_memory_allocate to allocate staging buffers rather than
am_alloc.
- Refactor device reset with single member function. Don't reallocate
staging buffers on reset.
- Properly track dependencies based on command type. Add new deps for
H2D and D2D rather than overloading H2D.
2016-03-17 20:09:10 -05:00
Ben Sander
e7586adb33
Refactor to isolate staging buffer code.
2016-03-17 00:20:56 -05:00
Ben Sander
28ee7aff71
Start separaration of staging_buffer.cpp code.
...
Still #include staging_buffer.cpp into hip_hcc.cpp.
Directed tests compile hip_hcc to static library and use the library.
2016-03-16 22:26:49 -05:00
Ben Sander
c02bccc153
Add aligned alloc
2016-03-16 21:55:57 -05:00
Ben Sander
ec50cbb3aa
Checkpoint code cleanup.
...
- Refactor ihipStream in prep for thread-safe implementation.
- Do some work on PinInPlace implementation.
2016-03-16 21:16:29 -05:00
Aditya Atluri
1e588e56e0
changed flag in hipHostRegister
2016-03-16 08:01:53 -05:00
Aditya Atluri
b768568902
src/ fixed hipHostAllocDefault flags
2016-03-16 07:32:54 -05:00
Aditya Atluri
57a85763bd
Merge branch 'privatestaging' of https://github.com/AMDComputeLibraries/HIP-privatestaging into privatestaging
2016-03-16 07:17:22 -05:00
Aditya Atluri
67d385f407
Added performance test for memcpy
2016-03-16 07:16:51 -05:00
Aditya Atluri
451cd9f876
tests/src [v4] Added feature for partial writes on CPU
2016-03-16 07:04:40 -05:00
Aditya Atluri
62b8351f02
test/src [v3] clean up
2016-03-16 05:30:29 -05:00
Aditya Atluri
89eb2a7b52
increased iteration size
2016-03-16 05:24:08 -05:00
Aditya Atluri
e67133ce21
added cudaHostRegister test
2016-03-16 05:15:03 -05:00
Aditya Atluri
e23bd0a23e
corrected first and second kernel dispatch
2016-03-15 14:22:00 -05:00
Aditya Atluri
862817626b
Added single kernel launch to sample
2016-03-15 21:05:15 -05:00
Aditya Atluri
31d8f60e56
added performance metrics for kernel dispatch
2016-03-15 12:37:24 -05:00
Aditya Atluri
58fa0524b6
v2 deprecating hipMallocHost with hipHostAlloc
2016-03-15 13:39:15 -05:00
Aditya Atluri
cbdc8c277c
Deprecating hipMallocHost to hipHostAlloc
2016-03-15 06:30:16 -05:00
Ben Sander
70c5f5e3f5
print device config info
2016-03-14 23:02:49 -05:00
Ben Sander
e1617b9604
Merge branch 'privatestaging' of https://github.com/AMDComputeLibraries/HIP-privatestaging into privatestaging
...
Conflicts:
src/hip_hcc.cpp
tests/src/CMakeLists.txt
2016-03-14 15:01:26 -05:00
Ben Sander
1a27e5134e
enable DB, comments
2016-03-14 14:40:41 -05:00