Graf commitů

248 Commity

Autor SHA1 Zpráva Datum
Ben Sander b520a34579 Refactor copy - place common code in resolveMemoryKind.
[ROCm/hip commit: 194d02ac5a]
2016-03-19 22:56:10 -05:00
Ben Sander 4c77ecef9a Deprecate hipMallocHost and hipFreeHost.
These will print compiler warnings if used, so we can weed them out
before removing.

Also add a default flags args for hipHostAlloc, in the C++ functioin
headers.  So you can replace hipMallocHost(&ptr, size( with hipHostAlloc(&ptr, size)


[ROCm/hip commit: cea37c3e91]
2016-03-19 22:53:59 -05:00
Ben Sander 36d2a024c0 Refactor waitALlDevices and async mem copy.
- move waitAllStreams to device member function.
- create separate stream member function for copyAsync, like copySync.
  hipMemcpyAsync now calls the copyAsync.


[ROCm/hip commit: 52cc2bb75a]
2016-03-19 05:42:19 -05:00
Ben Sander 11603764be Fix bug: test was allocating host mem instead of device mem.
Caused assertion when checking free + allocated should
not exceed total.  Bug introduced in hipHostAlloc conversion.


[ROCm/hip commit: 95e1c8eb32]
2016-03-19 04:11:39 -05:00
Ben Sander 16df30e767 Swap in corrected hipHostAlloc (bad merge)
[ROCm/hip commit: d898997c65]
2016-03-19 04:11:08 -05:00
Ben Sander 5297376252 Merge branch 'privatestaging' of https://github.com/AMDComputeLibraries/HIP-privatestaging into privatestaging
Conflicts:
	src/hip_hcc.cpp


[ROCm/hip commit: 1de63bfeea]
2016-03-19 03:22:09 -05:00
Ben Sander 013f08418f disable mt streams tests (for now)
[ROCm/hip commit: dfca4faf27]
2016-03-19 03:10:31 -05:00
Ben Sander 8204d20644 Describe HIP env vars
[ROCm/hip commit: 2dc7e9ccfa]
2016-03-19 03:09:57 -05:00
Ben Sander 662cd0bb30 Fix copy and sync bugs. Remove extra sync in default stream.
- NULL stream was waiting for itself to be empty before each command.
- Force "blocking" streams to wait for NULL to empty.  This was missing
  before.
- async copy was disabling itself via trueAsync=false for common cases.

Refactor:
- rename _null_stream to _default_stream.
- move some null sync function to defaultSync, move to dev member func.


[ROCm/hip commit: 6472c8b915]
2016-03-19 02:44:26 -05:00
Ben Sander b6de799f47 Add beastperiteration and onesize for testing.
onesize allows running tests at one specific size.


[ROCm/hip commit: 7ff5b16d2a]
2016-03-19 02:43:04 -05:00
Ben Sander 79d0d442d3 Improve formatting - line up cols
[ROCm/hip commit: 85fce5f21e]
2016-03-18 23:43:04 -05:00
Ben Sander 5996a5c0a1 Print Pinned or Unpinned in result summary
[ROCm/hip commit: c2102847a4]
2016-03-18 21:28:29 -05:00
Aditya Atluri 2dd079da40 Update CUDA_Runtime_API_functions_supported_by_HIP.md
[ROCm/hip commit: 27b6a620d9]
2016-03-18 11:28:06 -05:00
Aditya Atluri cc3c700d00 Create CUDA_Runtime_API_functions_supported_by_HIP.md
[ROCm/hip commit: 456f8b8455]
2016-03-18 11:23:44 -05:00
Ben Sander 8a47efb22a Supported --aliged mode. Add results check for H2D and D2H.
[ROCm/hip commit: 618556eaf9]
2016-03-18 03:09:52 -05:00
Ben Sander 0134651419 Refactor copy code.
-Move staging buffer locks inside the staging buffer code.
-Remove dedicated per-device completion_signal + per-device lock -
instead allocated signal from the per-stream pool.   This elimintes
the lock and allows more concurrency.
-remove switch HIP_DISABLE_BIDIR_MEMCPY


[ROCm/hip commit: 0af4d3623f]
2016-03-18 03:02:00 -05:00
Ben Sander 3320975a80 Refactor staging buffer and sync copies.
- refactor staging buffer to operate on hsa* data structures not
  hc::accelerator.
- use hsa_memory_allocate to allocate staging buffers rather than
  am_alloc.
- Refactor device reset with single member function.  Don't reallocate
  staging buffers on reset.
- Properly track dependencies based on command type.  Add new deps for
  H2D and D2D rather than overloading H2D.


[ROCm/hip commit: 7d500599fa]
2016-03-17 20:09:10 -05:00
Ben Sander fc27c61c58 Refactor to isolate staging buffer code.
[ROCm/hip commit: e7586adb33]
2016-03-17 00:20:56 -05:00
Ben Sander 0ae7bc7e14 Start separaration of staging_buffer.cpp code.
Still #include staging_buffer.cpp into hip_hcc.cpp.
Directed tests compile hip_hcc to static library and use the library.


[ROCm/hip commit: 28ee7aff71]
2016-03-16 22:26:49 -05:00
Ben Sander 910017a20f Add aligned alloc
[ROCm/hip commit: c02bccc153]
2016-03-16 21:55:57 -05:00
Ben Sander 0b617788a8 Checkpoint code cleanup.
- Refactor ihipStream in prep for thread-safe implementation.
- Do some work on PinInPlace implementation.


[ROCm/hip commit: ec50cbb3aa]
2016-03-16 21:16:29 -05:00
Aditya Atluri cef9188d14 changed flag in hipHostRegister
[ROCm/hip commit: 1e588e56e0]
2016-03-16 08:01:53 -05:00
Aditya Atluri 9b39bdf3e0 src/ fixed hipHostAllocDefault flags
[ROCm/hip commit: b768568902]
2016-03-16 07:32:54 -05:00
Aditya Atluri 7dd54d20ca Merge branch 'privatestaging' of https://github.com/AMDComputeLibraries/HIP-privatestaging into privatestaging
[ROCm/hip commit: 57a85763bd]
2016-03-16 07:17:22 -05:00
Aditya Atluri e35d72336c Added performance test for memcpy
[ROCm/hip commit: 67d385f407]
2016-03-16 07:16:51 -05:00
Aditya Atluri 60ce389329 tests/src [v4] Added feature for partial writes on CPU
[ROCm/hip commit: 451cd9f876]
2016-03-16 07:04:40 -05:00
Aditya Atluri 18651d1879 test/src [v3] clean up
[ROCm/hip commit: 62b8351f02]
2016-03-16 05:30:29 -05:00
Aditya Atluri e3bbf4e8c9 increased iteration size
[ROCm/hip commit: 89eb2a7b52]
2016-03-16 05:24:08 -05:00
Aditya Atluri ec921786f7 added cudaHostRegister test
[ROCm/hip commit: e67133ce21]
2016-03-16 05:15:03 -05:00
Aditya Atluri 637fca1cb6 corrected first and second kernel dispatch
[ROCm/hip commit: e23bd0a23e]
2016-03-15 14:22:00 -05:00
Aditya Atluri 2223b41232 Added single kernel launch to sample
[ROCm/hip commit: 862817626b]
2016-03-15 21:05:15 -05:00
Aditya Atluri b202017891 added performance metrics for kernel dispatch
[ROCm/hip commit: 31d8f60e56]
2016-03-15 12:37:24 -05:00
Aditya Atluri 83cafe7d94 v2 deprecating hipMallocHost with hipHostAlloc
[ROCm/hip commit: 58fa0524b6]
2016-03-15 13:39:15 -05:00
Aditya Atluri 4c54a8d9d5 Deprecating hipMallocHost to hipHostAlloc
[ROCm/hip commit: cbdc8c277c]
2016-03-15 06:30:16 -05:00
Ben Sander bef8dc7bcc print device config info
[ROCm/hip commit: 70c5f5e3f5]
2016-03-14 23:02:49 -05:00
Ben Sander 0abf5db89e Merge branch 'privatestaging' of https://github.com/AMDComputeLibraries/HIP-privatestaging into privatestaging
Conflicts:
	src/hip_hcc.cpp
	tests/src/CMakeLists.txt


[ROCm/hip commit: e1617b9604]
2016-03-14 15:01:26 -05:00
Ben Sander 5951d581d9 enable DB, comments
[ROCm/hip commit: 1a27e5134e]
2016-03-14 14:40:41 -05:00
Ben Sander 10ca8ed91d Add Bidir copy test and help.
[ROCm/hip commit: 5606bee076]
2016-03-14 14:39:23 -05:00
Ben Sander 3427873d7c refactor, add support for speccing xfers in bytes
[ROCm/hip commit: ac6ed35ba0]
2016-03-13 09:41:06 -05:00
Ben Sander e08baf41d7 Remove ROCR_V2, assume=1
[ROCm/hip commit: b0dca6bd0c]
2016-03-12 04:08:13 -06:00
Ben Sander 30fbe45694 Remove USE_AM_TRACKER (assume 1)
[ROCm/hip commit: a5a2c7f33c]
2016-03-12 04:04:57 -06:00
Ben Sander 4900ebb39f Improve error reporting.
use throw with error class.
fix bug when memcpyDefault resolved to D2D copy.


[ROCm/hip commit: 250739666d]
2016-03-12 04:02:04 -06:00
Aditya Atluri db1ce3ba84 Added hipHostRegister for hip with tests and added copyright
[ROCm/hip commit: 102f173396]
2016-03-08 12:57:22 -06:00
Aditya Atluri 963c0f6076 Added a random test for memcpyAsync
[ROCm/hip commit: cbb42c6b6a]
2016-03-08 03:40:56 -06:00
Aditya Atluri ab68dc4240 v2 added support for unpinned async memcpy
[ROCm/hip commit: 18c0e69f51]
2016-03-08 03:19:50 -06:00
Aditya Atluri bf3d3a9745 Revert "Fixed passing unpinned memory to async memcpy"
This reverts commit c272c53808.


[ROCm/hip commit: 45965aaa2d]
2016-03-08 03:17:27 -06:00
Aditya Atluri 7f8a1a3e17 Merge branch 'privatestaging' of https://github.com/AMDComputeLibraries/HIP-privatestaging into privatestaging
[ROCm/hip commit: 81824926be]
2016-03-08 03:15:33 -06:00
Aditya Atluri c272c53808 Fixed passing unpinned memory to async memcpy
[ROCm/hip commit: 216af71480]
2016-03-08 03:14:49 -06:00
Ben Sander 6d77b45f88 Refactor memccpy tests.
- Add "hipMemcpy_simple" - some simple smoke tests.
- Run memcpy 3 times w/ different switches for kinds, sizes,
  multi-thread.


[ROCm/hip commit: cc7517b57e]
2016-03-07 17:15:48 -06:00
Aditya Atluri c05f4abd71 Added hipHostRegister flags
[ROCm/hip commit: d9429dd4ec]
2016-03-07 10:52:40 -06:00