Ben Sander
21d733b2e7
Add experimental notes around p2p
2016-04-18 23:52:42 -05:00
Ben Sander
9329fb8c92
Add USE_HCC_LOCK_API.
...
Default off, if set will compile assuming HCC lock APIs exist.
2016-04-18 23:50:52 -05:00
Ben Sander
6abfa13c34
Fixes for P2P and hipDeviceReset
...
- devicereset would lose track of default stream and thus subsequent
synchronization calls might not actually sychronize.
- Also deviceReset now correctly frees streams.
- fix waits in P2P staging copy - first phase (Device0-to-Staging) must
wait for second phase (Staging to Device1) to finish draining the
buffer.
2016-04-18 20:49:33 -05:00
Ben Sander
f9a31e28ad
Move HIP_HCC define to CMake
2016-04-17 07:40:04 -05:00
Ben Sander
8d26dfcde3
Merge branch 'privatestaging' into p2p
...
Conflicts:
include/hcc_detail/hip_hcc.h
src/hip_hcc.cpp
2016-04-17 06:46:52 -05:00
Aditya Atluri
95e9de0946
Merge branch 'privatestaging' of https://github.com/AMDComputeLibraries/HIP-privatestaging into privatestaging
2016-04-16 17:10:32 -05:00
Aditya Atluri
dc61929a3d
Corrected Memcpydefault
2016-04-16 17:10:13 -05:00
Ben Sander
c3bd85595d
P2P Update.
...
- add P2P staging buffer copy.
- If copy device does not have sufficient access permissions, fall back
to staging buffer.
- improve docs for which copy device is used.
2016-04-16 10:18:56 -05:00
streamhsa
0426564a7b
Add __ldg
2016-04-16 21:12:09 +08:00
Aditya Atluri
e8902a6e20
improved error returns for hipHostRegister/UnRegister functions
2016-04-15 10:42:31 -05:00
pensun
1ab9b0ac71
Merge branch 'privatestaging' of https://github.com/AMDComputeLibraries/HIP-privatestaging into privatestaging
2016-04-14 09:02:16 -05:00
Aditya Atluri
c3b854fba4
Merge branch 'privatestaging' of https://github.com/AMDComputeLibraries/HIP-privatestaging into privatestaging
2016-04-14 09:17:30 -05:00
Aditya Atluri
12e24f4816
Enabling make_* device code
2016-04-14 09:17:13 -05:00
pensun
596e8e4e4e
Add missing cudaMemsetAsync transformation
2016-04-14 09:02:02 -05:00
Maneesh Gupta
0dfb9cddad
Add hip soft-link to include for migrating to new include format
2016-04-14 16:37:27 +05:30
Jack Chung
fe0e5f3541
Add missing USE_PEER_TO_PEER macro definition
2016-04-13 13:56:26 +08:00
Aditya Atluri
3b79b86a96
Removed pragma once
2016-04-12 09:22:03 -05:00
Aditya Atluri
42df534354
added C guard to hip_runtime.h
2016-04-11 13:01:02 -05:00
Ben Sander
1cc0ea86a1
Merge branch 'p2p' of https://github.com/AMDComputeLibraries/HIP-privatestaging into p2p
...
Conflicts:
RELEASE.md
include/hcc_detail/hip_hcc.h
samples/1_Utils/hipInfo/hipInfo.cpp
src/hip_hcc.cpp
src/hip_peer.cpp
2016-04-11 09:17:27 -05:00
Ben Sander
5af4c901c6
P2p checkpoint.
...
- set USE_PEER_TO_PEER=3 (requires HCC "am_memtracker_update_peers")
- when enabling peer, turn it on for previously allocated memory.
- hipDeviceCanAccessPeer is no longer self-ware (self does not qualify
as a peer)
- device peerlist always includes self, so when we call allow_access
we never remove self access.
- hipDeviceReset() removes old peer mappings.
2016-04-11 12:52:18 -05:00
Ben Sander
efffb0ed86
Clean up disable.
...
Add USE_HCC_LOCK (disabled)
Disable USE_PEER_TO_PEER.
2016-04-11 09:09:36 -05:00
Ben Sander
9e7efd7c65
P2p checkpoint.
...
- set USE_PEER_TO_PEER=3 (requires HCC "am_memtracker_update_peers")
- when enabling peer, turn it on for previously allocated memory.
- hipDeviceCanAccessPeer is no longer self-ware (self does not qualify
as a peer)
- device peerlist always includes self, so when we call allow_access
we never remove self access.
- hipDeviceReset() removes old peer mappings.
2016-04-11 07:58:59 -05:00
Ben Sander
52ac414b40
Use HCC get_is_pool, add USE_PEER_TO_PEER level 2, 3.
2016-04-11 07:58:58 -05:00
Ben Sander
97772d6363
For P2P, use the peer list when allocating Device memory or pinned host.
...
Each new allocation is automatically mapped into the address space of
all enabled peers.
2016-04-11 07:58:58 -05:00
Ben Sander
e2d19d7f7a
P2P checkpoint.
...
Maintain enabled peer tables for each device.
2016-04-11 07:58:58 -05:00
Ben Sander
4400875dda
Checkpoint initial peer2peer implementation.
2016-04-11 07:58:58 -05:00
Ben Sander
bae62e02dc
Use HCC get_is_pool, add USE_PEER_TO_PEER level 2, 3.
2016-04-07 14:28:42 -05:00
Aditya Atluri
b8fed74f62
Merge branch 'privatestaging' of https://github.com/AMDComputeLibraries/HIP-privatestaging into privatestaging
2016-04-07 04:57:54 -05:00
Aditya Atluri
c006359510
C proofing hip
2016-04-07 04:57:31 -05:00
Ben Sander
288682ccb3
For P2P, use the peer list when allocating Device memory or pinned host.
...
Each new allocation is automatically mapped into the address space of
all enabled peers.
2016-04-06 16:44:31 -05:00
Ben Sander
6a182ce788
P2P checkpoint.
...
Maintain enabled peer tables for each device.
2016-04-06 15:50:47 -05:00
Ben Sander
db91890f53
Checkpoint initial peer2peer implementation.
2016-04-06 15:50:47 -05:00
Aditya Atluri
39f27984a8
Added feature for cross-compilation and seperated hcc api from hip api
2016-04-06 14:18:51 -05:00
Aditya Atluri
c2d04ba59c
added device math functions
2016-03-31 12:29:19 -05:00
Aditya Atluri
48b8b426d5
moved cpp specific device code from headers to source
2016-03-31 04:45:00 -05:00
Aditya Atluri
c4d93cf3db
C guarding the apis
2016-03-30 06:08:50 -05:00
Aditya Atluri
e476d4e8f0
Moved flag to hip_runtime.h
2016-03-30 03:32:39 -05:00
Aditya Atluri
00bf37d28d
Merge branch 'privatestaging' of https://github.com/AMDComputeLibraries/HIP-privatestaging into privatestaging
2016-03-29 11:18:09 -05:00
Ben Sander
e7b15a53da
Inline some new string functions.
2016-03-29 17:28:27 -05:00
Ben Sander
e22925be22
Add runtime switch to control HIP_ATP_MARKER
...
Only generate the function strings if requested at
compile-time && runtime.
2016-03-29 17:27:30 -05:00
Ben Sander
03c0fcbfec
Merge branch 'privatestaging' of https://github.com/AMDComputeLibraries/HIP-privatestaging into privatestaging
2016-03-29 14:43:35 -05:00
Aditya Atluri
3b3db60c25
Calling intrinsics from global kernel
2016-03-29 11:17:55 -05:00
Ben Sander
ba114a238f
Merge branch 'privatestaging' of https://github.com/AMDComputeLibraries/HIP-privatestaging into privatestaging
...
Conflicts:
include/hcc_detail/trace_helper.h
2016-03-29 05:56:03 -05:00
Aditya Atluri
73ecbf5c95
Merge branch 'privatestaging' of https://github.com/AMDComputeLibraries/HIP-privatestaging into privatestaging
2016-03-29 02:28:20 -05:00
Aditya Atluri
8e437d0565
Not using classes inside headers
2016-03-29 02:27:51 -05:00
Ben Sander
1b2ab173c1
Tweak thread-safe implementation.
...
introduce LockedAccessor option so destructor does not unlock.
Allows locks to exist across function boundaries, required
for hipLaunchKernel macro which has several unusual requirements.
(including C comppatibility, must use variadic macro, more).
2016-03-28 21:41:47 -05:00
scchan
3aa542e21b
turn off ONE_OBJECT_FILE since we are building a static library
2016-03-28 12:56:15 -05:00
Ben Sander
6cab7862ae
Stream thread-safe checkpoint.
...
Moving data structures to critical / protected section.
2016-03-28 09:46:40 -05:00
scchan
5afa5d23fc
add inline keyword to all the template functions
2016-03-28 09:18:38 -05:00
Ben Sander
4f8786c66a
fix ihipLogStatus so status arg only evaluated once
2016-03-28 05:23:53 -05:00