bwicakso
5552301de2
Merge remote-tracking branch 'refs/remotes/origin/privatestaging' into kernel_synchronization
2016-04-25 14:43:28 -05:00
bwicakso
a849321bd9
Update with original
2016-04-25 14:42:35 -05:00
bwicakso
4aca1babe8
Merge remote-tracking branch 'refs/remotes/origin/privatestaging' into kernel_synchronization
2016-04-25 13:57:28 -05:00
bwicakso
7175ca8824
Merge from original
2016-04-25 13:55:44 -05:00
Aditya Atluri
c404f60621
Fixed git spills
2016-04-22 09:19:05 -05:00
Aditya Atluri
a2bc6ebc9f
added workweek to hip_ldg
2016-04-22 09:14:26 -05:00
Maneesh Gupta
02e6fc27f4
Merge branch 'release_0.84.00' into privatestaging
...
Conflicts:
include/hcc_detail/hip_runtime.h
src/hip_hcc.cpp
2016-04-22 10:55:58 +05:30
Aditya Atluri
48f3762160
reorg make_datatype feature
2016-04-21 14:51:19 -05:00
Aditya Atluri
3aac6d45ad
added full data type support for __ldg
2016-04-21 11:17:26 -05:00
bwicakso
6773a64b22
Fix for kernel synchronization
...
The completion future of a particular kernel is lost if there are
multiple kernels in the stream. This can cause a racing condition where
the signal associated with the unreferenced completion_future might get
released by hcc runtime.
2016-04-20 15:51:39 -05:00
Aditya Atluri
f46e1bcf0e
added copyright for ldg
2016-04-20 12:28:02 -05:00
Aditya Atluri
b493eac7e0
added support for __ldg
2016-04-20 12:25:40 -05:00
pensun
517008f98b
update API trace information for hipLaunchKernel
2016-04-20 09:57:55 -05:00
pensun
f0b0bc672a
Merge branch 'release_0.84.00' of https://github.com/AMDComputeLibraries/HIP-privatestaging into release_0.84.00
2016-04-20 09:48:52 -05:00
pensun
f8ad557864
update hipLaunchKernel API trace information
2016-04-20 09:48:35 -05:00
Ben Sander
ccf2c1c323
Fix hipDeviceReset synchronization
2016-04-19 11:56:12 -05:00
Maneesh Gupta
c4cc76398c
Prefix HIP includes with hip/
2016-04-19 15:02:12 +05:30
Ben Sander
21d733b2e7
Add experimental notes around p2p
2016-04-18 23:52:42 -05:00
Ben Sander
9329fb8c92
Add USE_HCC_LOCK_API.
...
Default off, if set will compile assuming HCC lock APIs exist.
2016-04-18 23:50:52 -05:00
Ben Sander
6abfa13c34
Fixes for P2P and hipDeviceReset
...
- devicereset would lose track of default stream and thus subsequent
synchronization calls might not actually sychronize.
- Also deviceReset now correctly frees streams.
- fix waits in P2P staging copy - first phase (Device0-to-Staging) must
wait for second phase (Staging to Device1) to finish draining the
buffer.
2016-04-18 20:49:33 -05:00
Ben Sander
f9a31e28ad
Move HIP_HCC define to CMake
2016-04-17 07:40:04 -05:00
Ben Sander
8d26dfcde3
Merge branch 'privatestaging' into p2p
...
Conflicts:
include/hcc_detail/hip_hcc.h
src/hip_hcc.cpp
2016-04-17 06:46:52 -05:00
Aditya Atluri
95e9de0946
Merge branch 'privatestaging' of https://github.com/AMDComputeLibraries/HIP-privatestaging into privatestaging
2016-04-16 17:10:32 -05:00
Aditya Atluri
dc61929a3d
Corrected Memcpydefault
2016-04-16 17:10:13 -05:00
Ben Sander
c3bd85595d
P2P Update.
...
- add P2P staging buffer copy.
- If copy device does not have sufficient access permissions, fall back
to staging buffer.
- improve docs for which copy device is used.
2016-04-16 10:18:56 -05:00
streamhsa
0426564a7b
Add __ldg
2016-04-16 21:12:09 +08:00
Maneesh Gupta
1cd52f5192
Bump min required hcc to workweek 16155
2016-04-16 14:49:10 +05:30
Aditya Atluri
e8902a6e20
improved error returns for hipHostRegister/UnRegister functions
2016-04-15 10:42:31 -05:00
pensun
1ab9b0ac71
Merge branch 'privatestaging' of https://github.com/AMDComputeLibraries/HIP-privatestaging into privatestaging
2016-04-14 09:02:16 -05:00
Aditya Atluri
c3b854fba4
Merge branch 'privatestaging' of https://github.com/AMDComputeLibraries/HIP-privatestaging into privatestaging
2016-04-14 09:17:30 -05:00
Aditya Atluri
12e24f4816
Enabling make_* device code
2016-04-14 09:17:13 -05:00
pensun
596e8e4e4e
Add missing cudaMemsetAsync transformation
2016-04-14 09:02:02 -05:00
Maneesh Gupta
0dfb9cddad
Add hip soft-link to include for migrating to new include format
2016-04-14 16:37:27 +05:30
Jack Chung
fe0e5f3541
Add missing USE_PEER_TO_PEER macro definition
2016-04-13 13:56:26 +08:00
Aditya Atluri
3b79b86a96
Removed pragma once
2016-04-12 09:22:03 -05:00
Aditya Atluri
42df534354
added C guard to hip_runtime.h
2016-04-11 13:01:02 -05:00
Ben Sander
1cc0ea86a1
Merge branch 'p2p' of https://github.com/AMDComputeLibraries/HIP-privatestaging into p2p
...
Conflicts:
RELEASE.md
include/hcc_detail/hip_hcc.h
samples/1_Utils/hipInfo/hipInfo.cpp
src/hip_hcc.cpp
src/hip_peer.cpp
2016-04-11 09:17:27 -05:00
Ben Sander
5af4c901c6
P2p checkpoint.
...
- set USE_PEER_TO_PEER=3 (requires HCC "am_memtracker_update_peers")
- when enabling peer, turn it on for previously allocated memory.
- hipDeviceCanAccessPeer is no longer self-ware (self does not qualify
as a peer)
- device peerlist always includes self, so when we call allow_access
we never remove self access.
- hipDeviceReset() removes old peer mappings.
2016-04-11 12:52:18 -05:00
Ben Sander
efffb0ed86
Clean up disable.
...
Add USE_HCC_LOCK (disabled)
Disable USE_PEER_TO_PEER.
2016-04-11 09:09:36 -05:00
Ben Sander
9e7efd7c65
P2p checkpoint.
...
- set USE_PEER_TO_PEER=3 (requires HCC "am_memtracker_update_peers")
- when enabling peer, turn it on for previously allocated memory.
- hipDeviceCanAccessPeer is no longer self-ware (self does not qualify
as a peer)
- device peerlist always includes self, so when we call allow_access
we never remove self access.
- hipDeviceReset() removes old peer mappings.
2016-04-11 07:58:59 -05:00
Ben Sander
52ac414b40
Use HCC get_is_pool, add USE_PEER_TO_PEER level 2, 3.
2016-04-11 07:58:58 -05:00
Ben Sander
97772d6363
For P2P, use the peer list when allocating Device memory or pinned host.
...
Each new allocation is automatically mapped into the address space of
all enabled peers.
2016-04-11 07:58:58 -05:00
Ben Sander
e2d19d7f7a
P2P checkpoint.
...
Maintain enabled peer tables for each device.
2016-04-11 07:58:58 -05:00
Ben Sander
4400875dda
Checkpoint initial peer2peer implementation.
2016-04-11 07:58:58 -05:00
Ben Sander
bae62e02dc
Use HCC get_is_pool, add USE_PEER_TO_PEER level 2, 3.
2016-04-07 14:28:42 -05:00
Aditya Atluri
b8fed74f62
Merge branch 'privatestaging' of https://github.com/AMDComputeLibraries/HIP-privatestaging into privatestaging
2016-04-07 04:57:54 -05:00
Aditya Atluri
c006359510
C proofing hip
2016-04-07 04:57:31 -05:00
Ben Sander
288682ccb3
For P2P, use the peer list when allocating Device memory or pinned host.
...
Each new allocation is automatically mapped into the address space of
all enabled peers.
2016-04-06 16:44:31 -05:00
Ben Sander
6a182ce788
P2P checkpoint.
...
Maintain enabled peer tables for each device.
2016-04-06 15:50:47 -05:00
Ben Sander
db91890f53
Checkpoint initial peer2peer implementation.
2016-04-06 15:50:47 -05:00