Ben Sander
30233cd2e0
Add experimental notes around p2p
2016-04-18 23:52:42 -05:00
Ben Sander
3bcefdaa2d
Add USE_HCC_LOCK_API.
...
Default off, if set will compile assuming HCC lock APIs exist.
2016-04-18 23:50:52 -05:00
Ben Sander
e020d68309
Fixes for P2P and hipDeviceReset
...
- devicereset would lose track of default stream and thus subsequent
synchronization calls might not actually sychronize.
- Also deviceReset now correctly frees streams.
- fix waits in P2P staging copy - first phase (Device0-to-Staging) must
wait for second phase (Staging to Device1) to finish draining the
buffer.
2016-04-18 20:49:33 -05:00
Ben Sander
65abde6626
Move HIP_HCC define to CMake
2016-04-17 07:40:04 -05:00
Ben Sander
49cc5aec91
Merge branch 'privatestaging' into p2p
...
Conflicts:
include/hcc_detail/hip_hcc.h
src/hip_hcc.cpp
2016-04-17 06:46:52 -05:00
Aditya Atluri
846432a1c1
Merge branch 'privatestaging' of https://github.com/AMDComputeLibraries/HIP-privatestaging into privatestaging
2016-04-16 17:10:32 -05:00
Aditya Atluri
8dc1bdcbe6
Corrected Memcpydefault
2016-04-16 17:10:13 -05:00
Ben Sander
dcabc9dbf7
P2P Update.
...
- add P2P staging buffer copy.
- If copy device does not have sufficient access permissions, fall back
to staging buffer.
- improve docs for which copy device is used.
2016-04-16 10:18:56 -05:00
streamhsa
76f2670cef
Add __ldg
2016-04-16 21:12:09 +08:00
Maneesh Gupta
dc4c174a54
Bump min required hcc to workweek 16155
2016-04-16 14:49:10 +05:30
Aditya Atluri
a0aef11da7
changed to guard from hc.hpp
2016-04-15 11:21:45 -05:00
Aditya Atluri
2e80de6dcb
improved error returns for hipHostRegister/UnRegister functions
2016-04-15 10:42:31 -05:00
pensun
bdd26bd1f1
Merge branch 'privatestaging' of https://github.com/AMDComputeLibraries/HIP-privatestaging into privatestaging
2016-04-14 09:02:16 -05:00
Aditya Atluri
facc7a39be
Merge branch 'privatestaging' of https://github.com/AMDComputeLibraries/HIP-privatestaging into privatestaging
2016-04-14 09:17:30 -05:00
Aditya Atluri
573224e55f
Enabling make_* device code
2016-04-14 09:17:13 -05:00
pensun
234727c801
Add missing cudaMemsetAsync transformation
2016-04-14 09:02:02 -05:00
Maneesh Gupta
011a0335d4
Add hip soft-link to include for migrating to new include format
2016-04-14 16:37:27 +05:30
Jack Chung
210ba67b84
Add missing USE_PEER_TO_PEER macro definition
2016-04-13 13:56:26 +08:00
Aditya Atluri
4dfe27e8b5
Removed pragma once
2016-04-12 09:22:03 -05:00
Aditya Atluri
d5feabfa1c
added C guard to hip_runtime.h
2016-04-11 13:01:02 -05:00
Ben Sander
31dc13d2ec
Merge branch 'p2p' of https://github.com/AMDComputeLibraries/HIP-privatestaging into p2p
...
Conflicts:
RELEASE.md
include/hcc_detail/hip_hcc.h
samples/1_Utils/hipInfo/hipInfo.cpp
src/hip_hcc.cpp
src/hip_peer.cpp
2016-04-11 09:17:27 -05:00
Ben Sander
1f53c55d3e
P2p checkpoint.
...
- set USE_PEER_TO_PEER=3 (requires HCC "am_memtracker_update_peers")
- when enabling peer, turn it on for previously allocated memory.
- hipDeviceCanAccessPeer is no longer self-ware (self does not qualify
as a peer)
- device peerlist always includes self, so when we call allow_access
we never remove self access.
- hipDeviceReset() removes old peer mappings.
2016-04-11 12:52:18 -05:00
Ben Sander
b0529e04f1
Clean up disable.
...
Add USE_HCC_LOCK (disabled)
Disable USE_PEER_TO_PEER.
2016-04-11 09:09:36 -05:00
Ben Sander
83f0de7806
P2p checkpoint.
...
- set USE_PEER_TO_PEER=3 (requires HCC "am_memtracker_update_peers")
- when enabling peer, turn it on for previously allocated memory.
- hipDeviceCanAccessPeer is no longer self-ware (self does not qualify
as a peer)
- device peerlist always includes self, so when we call allow_access
we never remove self access.
- hipDeviceReset() removes old peer mappings.
2016-04-11 07:58:59 -05:00
Ben Sander
7a11a2178d
Use HCC get_is_pool, add USE_PEER_TO_PEER level 2, 3.
2016-04-11 07:58:58 -05:00
Ben Sander
813b063888
For P2P, use the peer list when allocating Device memory or pinned host.
...
Each new allocation is automatically mapped into the address space of
all enabled peers.
2016-04-11 07:58:58 -05:00
Ben Sander
f2aa470f7f
P2P checkpoint.
...
Maintain enabled peer tables for each device.
2016-04-11 07:58:58 -05:00
Ben Sander
69f2469cbb
Checkpoint initial peer2peer implementation.
2016-04-11 07:58:58 -05:00
Ben Sander
15b4b1f81f
Use HCC get_is_pool, add USE_PEER_TO_PEER level 2, 3.
2016-04-07 14:28:42 -05:00
Aditya Atluri
41860cb90d
Merge branch 'privatestaging' of https://github.com/AMDComputeLibraries/HIP-privatestaging into privatestaging
2016-04-07 04:57:54 -05:00
Aditya Atluri
91d1974204
C proofing hip
2016-04-07 04:57:31 -05:00
Ben Sander
41f7317fb5
For P2P, use the peer list when allocating Device memory or pinned host.
...
Each new allocation is automatically mapped into the address space of
all enabled peers.
2016-04-06 16:44:31 -05:00
Ben Sander
36926e6233
P2P checkpoint.
...
Maintain enabled peer tables for each device.
2016-04-06 15:50:47 -05:00
Ben Sander
b02e9163ab
Checkpoint initial peer2peer implementation.
2016-04-06 15:50:47 -05:00
Aditya Atluri
1dcdfc93f1
Added feature for cross-compilation and seperated hcc api from hip api
2016-04-06 14:18:51 -05:00
Aditya Atluri
26985b746b
added device math functions
2016-03-31 12:29:19 -05:00
Aditya Atluri
72c72d8748
moved cpp specific device code from headers to source
2016-03-31 04:45:00 -05:00
Aditya Atluri
496da0aabd
C guarding the apis
2016-03-30 06:08:50 -05:00
Aditya Atluri
da1d9caf6b
Moved flag to hip_runtime.h
2016-03-30 03:32:39 -05:00
Aditya Atluri
39af5d9caf
Merge branch 'privatestaging' of https://github.com/AMDComputeLibraries/HIP-privatestaging into privatestaging
2016-03-29 11:18:09 -05:00
Ben Sander
f8a4483a55
Inline some new string functions.
2016-03-29 17:28:27 -05:00
Ben Sander
740b730cac
Add runtime switch to control HIP_ATP_MARKER
...
Only generate the function strings if requested at
compile-time && runtime.
2016-03-29 17:27:30 -05:00
Ben Sander
fb7862dba3
Merge branch 'privatestaging' of https://github.com/AMDComputeLibraries/HIP-privatestaging into privatestaging
2016-03-29 14:43:35 -05:00
Aditya Atluri
61cd2bb399
Calling intrinsics from global kernel
2016-03-29 11:17:55 -05:00
Ben Sander
756c0edf9b
Merge branch 'privatestaging' of https://github.com/AMDComputeLibraries/HIP-privatestaging into privatestaging
...
Conflicts:
include/hcc_detail/trace_helper.h
2016-03-29 05:56:03 -05:00
Aditya Atluri
4584fb4453
Merge branch 'privatestaging' of https://github.com/AMDComputeLibraries/HIP-privatestaging into privatestaging
2016-03-29 02:28:20 -05:00
Aditya Atluri
f01810183b
Not using classes inside headers
2016-03-29 02:27:51 -05:00
Ben Sander
8635863724
Tweak thread-safe implementation.
...
introduce LockedAccessor option so destructor does not unlock.
Allows locks to exist across function boundaries, required
for hipLaunchKernel macro which has several unusual requirements.
(including C comppatibility, must use variadic macro, more).
2016-03-28 21:41:47 -05:00
scchan
00ae9531a7
turn off ONE_OBJECT_FILE since we are building a static library
2016-03-28 12:56:15 -05:00
Ben Sander
3f18bab2c7
Stream thread-safe checkpoint.
...
Moving data structures to critical / protected section.
2016-03-28 09:46:40 -05:00