Wykres commitów

261 Commity

Autor SHA1 Wiadomość Data
Aditya Atluri 3f5eb20cf0 Revert "fix nvcc for hipHostMalloc* flags."
This reverts commit b6962826eb.
2016-03-21 10:36:14 -05:00
Aditya Atluri 287ba34aca Revert "fixed memory free apis"
This reverts commit 96a1899df7.
2016-03-21 10:36:11 -05:00
Aditya Atluri caa80af31b Merge branch 'privatestaging' of https://github.com/AMDComputeLibraries/HIP-privatestaging into privatestaging 2016-03-21 10:34:08 -05:00
Aditya Atluri 96a1899df7 fixed memory free apis 2016-03-21 10:32:30 -05:00
Aditya Atluri 6d1420ab68 Disabling default-stream per-thread tests 2016-03-21 14:42:23 -05:00
Ben Sander ab910efb96 hipHostRegister and hipHostMalloc refactor.
Note hipHostMalloc (not hipHostAlloc or hipMallocHost).
 -  the hipHost* is used for all HIP APIs dealing with Host memory.
    (including hipHostMalloc, hipHostFree, hipHostRegister,
hipHostUnregister, hipHostGetFlags, hipHostGetDevicePointer).
  - hipMallocHost is consistent with "hipMalloc" for allocating device
    memory.  Enumerations hipHostMalloc* also used as optional
    flags parm to hipHostMalloc.
2016-03-22 02:30:10 -05:00
Ben Sander 8087bc0401 Merge branch 'privatestaging' of https://github.com/AMDComputeLibraries/HIP-privatestaging into privatestaging 2016-03-22 01:17:17 -05:00
Ben Sander b6962826eb fix nvcc for hipHostMalloc* flags. 2016-03-21 09:33:46 -05:00
Ben Sander 04af145d6a Remove kind (aka direction) for copy commands.
This is always auto-detected from the src/dest location.
2016-03-21 05:35:36 -05:00
Aditya Atluri 6b3821afc7 suppressed warning in hipFreeHost 2016-03-20 15:31:59 -05:00
Aditya Atluri a232d65db1 Added feature for --default-streams not working tests and hipcc 2016-03-20 08:08:33 -05:00
Ben Sander deb38625ca Implement hipHostFree on HCC path 2016-03-19 23:25:11 -05:00
Ben Sander 9941ba0bc6 fix nvcc compiler
- MallocHost and FreeHost deprecation.
- Change tests to call new hipHost* equivs.
- Add missing StreamSynchronize.
2016-03-19 04:20:15 -05:00
Ben Sander 194d02ac5a Refactor copy - place common code in resolveMemoryKind. 2016-03-19 22:56:10 -05:00
Ben Sander cea37c3e91 Deprecate hipMallocHost and hipFreeHost.
These will print compiler warnings if used, so we can weed them out
before removing.

Also add a default flags args for hipHostAlloc, in the C++ functioin
headers.  So you can replace hipMallocHost(&ptr, size( with hipHostAlloc(&ptr, size)
2016-03-19 22:53:59 -05:00
Ben Sander 52cc2bb75a Refactor waitALlDevices and async mem copy.
- move waitAllStreams to device member function.
- create separate stream member function for copyAsync, like copySync.
  hipMemcpyAsync now calls the copyAsync.
2016-03-19 05:42:19 -05:00
Ben Sander 95e1c8eb32 Fix bug: test was allocating host mem instead of device mem.
Caused assertion when checking free + allocated should
not exceed total.  Bug introduced in hipHostAlloc conversion.
2016-03-19 04:11:39 -05:00
Ben Sander d898997c65 Swap in corrected hipHostAlloc (bad merge) 2016-03-19 04:11:08 -05:00
Ben Sander 1de63bfeea Merge branch 'privatestaging' of https://github.com/AMDComputeLibraries/HIP-privatestaging into privatestaging
Conflicts:
	src/hip_hcc.cpp
2016-03-19 03:22:09 -05:00
Ben Sander dfca4faf27 disable mt streams tests (for now) 2016-03-19 03:10:31 -05:00
Ben Sander 2dc7e9ccfa Describe HIP env vars 2016-03-19 03:09:57 -05:00
Ben Sander 6472c8b915 Fix copy and sync bugs. Remove extra sync in default stream.
- NULL stream was waiting for itself to be empty before each command.
- Force "blocking" streams to wait for NULL to empty.  This was missing
  before.
- async copy was disabling itself via trueAsync=false for common cases.

Refactor:
- rename _null_stream to _default_stream.
- move some null sync function to defaultSync, move to dev member func.
2016-03-19 02:44:26 -05:00
Ben Sander 7ff5b16d2a Add beastperiteration and onesize for testing.
onesize allows running tests at one specific size.
2016-03-19 02:43:04 -05:00
Ben Sander 85fce5f21e Improve formatting - line up cols 2016-03-18 23:43:04 -05:00
Ben Sander c2102847a4 Print Pinned or Unpinned in result summary 2016-03-18 21:28:29 -05:00
Aditya Atluri 27b6a620d9 Update CUDA_Runtime_API_functions_supported_by_HIP.md 2016-03-18 11:28:06 -05:00
Aditya Atluri 456f8b8455 Create CUDA_Runtime_API_functions_supported_by_HIP.md 2016-03-18 11:23:44 -05:00
Ben Sander 618556eaf9 Supported --aliged mode. Add results check for H2D and D2H. 2016-03-18 03:09:52 -05:00
Ben Sander 0af4d3623f Refactor copy code.
-Move staging buffer locks inside the staging buffer code.
-Remove dedicated per-device completion_signal + per-device lock -
instead allocated signal from the per-stream pool.   This elimintes
the lock and allows more concurrency.
-remove switch HIP_DISABLE_BIDIR_MEMCPY
2016-03-18 03:02:00 -05:00
Ben Sander 7d500599fa Refactor staging buffer and sync copies.
- refactor staging buffer to operate on hsa* data structures not
  hc::accelerator.
- use hsa_memory_allocate to allocate staging buffers rather than
  am_alloc.
- Refactor device reset with single member function.  Don't reallocate
  staging buffers on reset.
- Properly track dependencies based on command type.  Add new deps for
  H2D and D2D rather than overloading H2D.
2016-03-17 20:09:10 -05:00
Ben Sander e7586adb33 Refactor to isolate staging buffer code. 2016-03-17 00:20:56 -05:00
Ben Sander 28ee7aff71 Start separaration of staging_buffer.cpp code.
Still #include staging_buffer.cpp into hip_hcc.cpp.
Directed tests compile hip_hcc to static library and use the library.
2016-03-16 22:26:49 -05:00
Ben Sander c02bccc153 Add aligned alloc 2016-03-16 21:55:57 -05:00
Ben Sander ec50cbb3aa Checkpoint code cleanup.
- Refactor ihipStream in prep for thread-safe implementation.
- Do some work on PinInPlace implementation.
2016-03-16 21:16:29 -05:00
Aditya Atluri 1e588e56e0 changed flag in hipHostRegister 2016-03-16 08:01:53 -05:00
Aditya Atluri b768568902 src/ fixed hipHostAllocDefault flags 2016-03-16 07:32:54 -05:00
Aditya Atluri 57a85763bd Merge branch 'privatestaging' of https://github.com/AMDComputeLibraries/HIP-privatestaging into privatestaging 2016-03-16 07:17:22 -05:00
Aditya Atluri 67d385f407 Added performance test for memcpy 2016-03-16 07:16:51 -05:00
Aditya Atluri 451cd9f876 tests/src [v4] Added feature for partial writes on CPU 2016-03-16 07:04:40 -05:00
Aditya Atluri 62b8351f02 test/src [v3] clean up 2016-03-16 05:30:29 -05:00
Aditya Atluri 89eb2a7b52 increased iteration size 2016-03-16 05:24:08 -05:00
Aditya Atluri e67133ce21 added cudaHostRegister test 2016-03-16 05:15:03 -05:00
Aditya Atluri e23bd0a23e corrected first and second kernel dispatch 2016-03-15 14:22:00 -05:00
Aditya Atluri 862817626b Added single kernel launch to sample 2016-03-15 21:05:15 -05:00
Aditya Atluri 31d8f60e56 added performance metrics for kernel dispatch 2016-03-15 12:37:24 -05:00
Aditya Atluri 58fa0524b6 v2 deprecating hipMallocHost with hipHostAlloc 2016-03-15 13:39:15 -05:00
Aditya Atluri cbdc8c277c Deprecating hipMallocHost to hipHostAlloc 2016-03-15 06:30:16 -05:00
Ben Sander 70c5f5e3f5 print device config info 2016-03-14 23:02:49 -05:00
Ben Sander e1617b9604 Merge branch 'privatestaging' of https://github.com/AMDComputeLibraries/HIP-privatestaging into privatestaging
Conflicts:
	src/hip_hcc.cpp
	tests/src/CMakeLists.txt
2016-03-14 15:01:26 -05:00
Ben Sander 1a27e5134e enable DB, comments 2016-03-14 14:40:41 -05:00