pensun
b197cb30cb
Merge branch 'release_0.84.00' of https://github.com/AMDComputeLibraries/HIP-privatestaging into release_0.84.00
...
[ROCm/hip commit: 2ddade96d7 ]
2016-04-20 09:48:52 -05:00
pensun
e2327b1b18
update hipLaunchKernel API trace information
...
[ROCm/hip commit: 7f37fc4ec7 ]
2016-04-20 09:48:35 -05:00
Ben Sander
5858b85e8d
Fix hipDeviceReset synchronization
...
[ROCm/hip commit: 453615ed57 ]
2016-04-19 11:56:12 -05:00
Maneesh Gupta
b29742d6f8
Prefix HIP includes with hip/
...
[ROCm/hip commit: bc4f388b69 ]
2016-04-19 15:02:12 +05:30
Maneesh Gupta
44e5430da9
Bump min required hcc to workweek 16155
...
[ROCm/hip commit: dc4c174a54 ]
2016-04-16 14:49:10 +05:30
pensun
42df341237
Merge branch 'privatestaging' of https://github.com/AMDComputeLibraries/HIP-privatestaging into privatestaging
...
[ROCm/hip commit: bdd26bd1f1 ]
2016-04-14 09:02:16 -05:00
Aditya Atluri
8c347cd30a
Merge branch 'privatestaging' of https://github.com/AMDComputeLibraries/HIP-privatestaging into privatestaging
...
[ROCm/hip commit: facc7a39be ]
2016-04-14 09:17:30 -05:00
Aditya Atluri
3b82076adf
Enabling make_* device code
...
[ROCm/hip commit: 573224e55f ]
2016-04-14 09:17:13 -05:00
pensun
a708c400d3
Add missing cudaMemsetAsync transformation
...
[ROCm/hip commit: 234727c801 ]
2016-04-14 09:02:02 -05:00
Maneesh Gupta
3f0256a3b1
Add hip soft-link to include for migrating to new include format
...
[ROCm/hip commit: 011a0335d4 ]
2016-04-14 16:37:27 +05:30
Jack Chung
181b44df3f
Add missing USE_PEER_TO_PEER macro definition
...
[ROCm/hip commit: 210ba67b84 ]
2016-04-13 13:56:26 +08:00
Aditya Atluri
10bc7e683f
Removed pragma once
...
[ROCm/hip commit: 4dfe27e8b5 ]
2016-04-12 09:22:03 -05:00
Aditya Atluri
817cda91ff
added C guard to hip_runtime.h
...
[ROCm/hip commit: d5feabfa1c ]
2016-04-11 13:01:02 -05:00
Ben Sander
5060403b2a
Merge branch 'p2p' of https://github.com/AMDComputeLibraries/HIP-privatestaging into p2p
...
Conflicts:
RELEASE.md
include/hcc_detail/hip_hcc.h
samples/1_Utils/hipInfo/hipInfo.cpp
src/hip_hcc.cpp
src/hip_peer.cpp
[ROCm/hip commit: 31dc13d2ec ]
2016-04-11 09:17:27 -05:00
Ben Sander
7c92c9cc22
P2p checkpoint.
...
- set USE_PEER_TO_PEER=3 (requires HCC "am_memtracker_update_peers")
- when enabling peer, turn it on for previously allocated memory.
- hipDeviceCanAccessPeer is no longer self-ware (self does not qualify
as a peer)
- device peerlist always includes self, so when we call allow_access
we never remove self access.
- hipDeviceReset() removes old peer mappings.
[ROCm/hip commit: 1f53c55d3e ]
2016-04-11 12:52:18 -05:00
Ben Sander
b174e8bba7
Clean up disable.
...
Add USE_HCC_LOCK (disabled)
Disable USE_PEER_TO_PEER.
[ROCm/hip commit: b0529e04f1 ]
2016-04-11 09:09:36 -05:00
Ben Sander
c964e3c75a
P2p checkpoint.
...
- set USE_PEER_TO_PEER=3 (requires HCC "am_memtracker_update_peers")
- when enabling peer, turn it on for previously allocated memory.
- hipDeviceCanAccessPeer is no longer self-ware (self does not qualify
as a peer)
- device peerlist always includes self, so when we call allow_access
we never remove self access.
- hipDeviceReset() removes old peer mappings.
[ROCm/hip commit: 83f0de7806 ]
2016-04-11 07:58:59 -05:00
Ben Sander
216c9a0c16
Use HCC get_is_pool, add USE_PEER_TO_PEER level 2, 3.
...
[ROCm/hip commit: 7a11a2178d ]
2016-04-11 07:58:58 -05:00
Ben Sander
06e2dbd128
For P2P, use the peer list when allocating Device memory or pinned host.
...
Each new allocation is automatically mapped into the address space of
all enabled peers.
[ROCm/hip commit: 813b063888 ]
2016-04-11 07:58:58 -05:00
Ben Sander
ac11c83d76
P2P checkpoint.
...
Maintain enabled peer tables for each device.
[ROCm/hip commit: f2aa470f7f ]
2016-04-11 07:58:58 -05:00
Ben Sander
56d684e2fe
Checkpoint initial peer2peer implementation.
...
[ROCm/hip commit: 69f2469cbb ]
2016-04-11 07:58:58 -05:00
Ben Sander
c0d88c2f6d
Use HCC get_is_pool, add USE_PEER_TO_PEER level 2, 3.
...
[ROCm/hip commit: 15b4b1f81f ]
2016-04-07 14:28:42 -05:00
Aditya Atluri
7e9aa103d6
Merge branch 'privatestaging' of https://github.com/AMDComputeLibraries/HIP-privatestaging into privatestaging
...
[ROCm/hip commit: 41860cb90d ]
2016-04-07 04:57:54 -05:00
Aditya Atluri
fc28918100
C proofing hip
...
[ROCm/hip commit: 91d1974204 ]
2016-04-07 04:57:31 -05:00
Ben Sander
84b5016a79
For P2P, use the peer list when allocating Device memory or pinned host.
...
Each new allocation is automatically mapped into the address space of
all enabled peers.
[ROCm/hip commit: 41f7317fb5 ]
2016-04-06 16:44:31 -05:00
Ben Sander
c5240bd079
P2P checkpoint.
...
Maintain enabled peer tables for each device.
[ROCm/hip commit: 36926e6233 ]
2016-04-06 15:50:47 -05:00
Ben Sander
8db4a6d46e
Checkpoint initial peer2peer implementation.
...
[ROCm/hip commit: b02e9163ab ]
2016-04-06 15:50:47 -05:00
Aditya Atluri
cf9cefcb6a
Added feature for cross-compilation and seperated hcc api from hip api
...
[ROCm/hip commit: 1dcdfc93f1 ]
2016-04-06 14:18:51 -05:00
Aditya Atluri
70c1ac9285
added device math functions
...
[ROCm/hip commit: 26985b746b ]
2016-03-31 12:29:19 -05:00
Aditya Atluri
fe0f4e8bc2
moved cpp specific device code from headers to source
...
[ROCm/hip commit: 72c72d8748 ]
2016-03-31 04:45:00 -05:00
Aditya Atluri
afd8f2bcfc
C guarding the apis
...
[ROCm/hip commit: 496da0aabd ]
2016-03-30 06:08:50 -05:00
Aditya Atluri
3683e2a3bd
Moved flag to hip_runtime.h
...
[ROCm/hip commit: da1d9caf6b ]
2016-03-30 03:32:39 -05:00
Aditya Atluri
f052a2752a
Merge branch 'privatestaging' of https://github.com/AMDComputeLibraries/HIP-privatestaging into privatestaging
...
[ROCm/hip commit: 39af5d9caf ]
2016-03-29 11:18:09 -05:00
Ben Sander
2543332f03
Inline some new string functions.
...
[ROCm/hip commit: f8a4483a55 ]
2016-03-29 17:28:27 -05:00
Ben Sander
24f82925bb
Add runtime switch to control HIP_ATP_MARKER
...
Only generate the function strings if requested at
compile-time && runtime.
[ROCm/hip commit: 740b730cac ]
2016-03-29 17:27:30 -05:00
Ben Sander
1f8e0e2e72
Merge branch 'privatestaging' of https://github.com/AMDComputeLibraries/HIP-privatestaging into privatestaging
...
[ROCm/hip commit: fb7862dba3 ]
2016-03-29 14:43:35 -05:00
Aditya Atluri
ffe185d731
Calling intrinsics from global kernel
...
[ROCm/hip commit: 61cd2bb399 ]
2016-03-29 11:17:55 -05:00
Ben Sander
9e50af9124
Merge branch 'privatestaging' of https://github.com/AMDComputeLibraries/HIP-privatestaging into privatestaging
...
Conflicts:
include/hcc_detail/trace_helper.h
[ROCm/hip commit: 756c0edf9b ]
2016-03-29 05:56:03 -05:00
Aditya Atluri
79fea63bb7
Merge branch 'privatestaging' of https://github.com/AMDComputeLibraries/HIP-privatestaging into privatestaging
...
[ROCm/hip commit: 4584fb4453 ]
2016-03-29 02:28:20 -05:00
Aditya Atluri
811d6d52ea
Not using classes inside headers
...
[ROCm/hip commit: f01810183b ]
2016-03-29 02:27:51 -05:00
Ben Sander
630ef59d7b
Tweak thread-safe implementation.
...
introduce LockedAccessor option so destructor does not unlock.
Allows locks to exist across function boundaries, required
for hipLaunchKernel macro which has several unusual requirements.
(including C comppatibility, must use variadic macro, more).
[ROCm/hip commit: 8635863724 ]
2016-03-28 21:41:47 -05:00
scchan
24f214e92a
turn off ONE_OBJECT_FILE since we are building a static library
...
[ROCm/hip commit: 00ae9531a7 ]
2016-03-28 12:56:15 -05:00
Ben Sander
9e3ac64c54
Stream thread-safe checkpoint.
...
Moving data structures to critical / protected section.
[ROCm/hip commit: 3f18bab2c7 ]
2016-03-28 09:46:40 -05:00
scchan
8afe22a3e9
add inline keyword to all the template functions
...
[ROCm/hip commit: 0087ee405e ]
2016-03-28 09:18:38 -05:00
Ben Sander
0f11beb328
fix ihipLogStatus so status arg only evaluated once
...
[ROCm/hip commit: 3aed9d3b19 ]
2016-03-28 05:23:53 -05:00
Ben Sander
c9b5b99078
Stream thread-safe checkpoint.
...
[ROCm/hip commit: ec049e7f0b ]
2016-03-28 04:22:20 -05:00
Ben Sander
48773c6d4f
Improve debug print messages.
...
- Remove "call-to-call" for hipStreamCreate and hipEventCreate.
These now call an internal functions rather than calling through
hipStreamCreateWithFalgs and hipEventCreateWithFlags.
- Add HIP_INIT_API for more functions so they trace correctly.
- Use stream#DEVICE.STREAMID in debug messages via new specialization in
tace_helper.
[ROCm/hip commit: 7934cf620d ]
2016-03-26 12:35:04 -05:00
Ben Sander
6f88d31a1f
Protect _stream_id as well.
...
- move lockedaccessor
- clean up device class.
- add simple ihipDevice constructor.
[ROCm/hip commit: 395e7eab96 ]
2016-03-26 11:45:25 -05:00
Ben Sander
141ccad414
Make ihipDevice_t thread-safe.
...
Move critical data into separate class and protect with LockAccessor
wrapper class.
For device, the streams list is the critical data since it is modified when
streams are created or destroyed. The streams list is accessed in
several places including when synchronizing across all streams on the
device (ie from the default stream).
Other device data is set once by the device cosntructor and is not critical
so
All functions which acquire the LockAccessor now named with "locked_" prefix.
[ROCm/hip commit: 530ab9434a ]
2016-03-26 10:46:20 -05:00
pensun
fa114274ac
fix query of memoryClockRate and memoryBusWidth for both NV and HCC path
...
[ROCm/hip commit: 535de2ecc1 ]
2016-03-25 09:24:08 -05:00