Ben Sander
8a2bcf2da3
Merge branch 'privatestaging' of https://github.com/AMDComputeLibraries/HIP-privatestaging into privatestaging
...
[ROCm/hip commit: c300ffe458 ]
2016-02-26 06:15:09 -06:00
Ben Sander
2a650be661
Add PATH and LD_LIBRARY_FLAGS
...
[ROCm/hip commit: a30018e166 ]
2016-02-25 23:07:18 -06:00
Ben Sander
5ca4914e0e
Merge branch 'memtracker' into privatestaging
...
Conflicts:
src/hip_hcc.cpp
[ROCm/hip commit: 4adab7b7ef ]
2016-02-25 19:38:46 -06:00
Ben Sander
22a3806c0c
Tweak version numbers
...
[ROCm/hip commit: b1da7e4a70 ]
2016-02-25 19:30:42 -06:00
Ben Sander
ae365b4ae2
Improve memory copy and commands switching
...
- Add chicken bits to use host-side dependency management.
- Add optional PinInPlace path for unpinned copies
- Synchronize before pinned memcpy path.
- Add mutex to protect two threads launching to same stream.
[ROCm/hip commit: 8b64c0dc62 ]
2016-02-25 19:19:49 -06:00
Aditya Avinash Atluri
6bb5485beb
Update hip.vim
...
Added d2d and h2h highlights
[ROCm/hip commit: 29c385ed72 ]
2016-02-25 14:49:58 -06:00
Evgeny Mankov
82900a1888
Attribute hipDeviceAttributeIsMultiGpuBoard for obtaining Device property isMultiGpuBoard is added.
...
On HIP path property obtaining done through hsa_iterate_agents and counting the devices of HSA_DEVICE_TYPE_GPU type.
P.S.
On multi-boards systems it might be problems with detection what board a GPU plugged into (not tested).
[ROCm/hip commit: 57e212606d ]
2016-02-25 23:44:39 +03:00
Aditya Avinash Atluri
55e5190b57
Update hip.vim
...
[ROCm/hip commit: 4851cedf6c ]
2016-02-23 13:40:20 -06:00
Ben Sander
1888acb5f3
Sync review.
...
- add calls to ihipInit missing from some routines.
- sync before draining a stream.
[ROCm/hip commit: 3886d494f4 ]
2016-02-23 04:07:11 -06:00
Ben Sander
a40adb296f
add hipLaunchParm
...
[ROCm/hip commit: 9c259bb86c ]
2016-02-23 04:05:41 -06:00
Ben Sander
f0c734e256
Improve async copy implementation.
...
- Add device-side signal waits when transitioning between command classes
(Kernel, H2D copy, D2H copy).
- Support waiting in staged memory copies as well.
- Add several chicken bits to control implementation:
- HIP_DISABLE_ENQ_BARRIER
- HIP_DISABLE_BIDIR_MEMCPY
- HIP_ONESHOT_COPY_DEP
- Refactor signal pool to support efficient deallocation based on
signsequnm.
- Deallocate copy signals on eventSynchronize.
- Improve copy tests, add pingpong.
[ROCm/hip commit: 549b18ce77 ]
2016-02-22 23:15:24 -06:00
gargrahul
ccd1ed0a97
Update for shared atomics support
...
[ROCm/hip commit: a2fbf06129 ]
2016-02-22 16:21:52 +05:30
Maneesh Gupta
d40b8d8fb0
Enable cospi,rsqrt,sinpi tests for HCC newer than 16073
...
[ROCm/hip commit: c5c11d3707 ]
2016-02-22 15:13:23 +05:30
Ben Sander
ebf2700936
Track last command to a stream.
...
Passing simple tests.
[ROCm/hip commit: d33d806a5b ]
2016-02-20 11:02:07 -06:00
streamhsa
0389b292a6
Resolve issues for hip_popc and hip_ballot on nvcc
...
[ROCm/hip commit: a7232c7f9e ]
2016-02-19 20:18:03 +08:00
Evgeny Mankov
d8df47383d
A support of the following device properties is added to legacy hipify.pl: hipDeviceAttributeConcurrentKernels, hipDeviceAttributeMemoryClockRate & hipDeviceAttributeMemoryBusWidth.
...
[ROCm/hip commit: b23f9776ca ]
2016-02-19 13:36:37 +03:00
Evgeny Mankov
c3a600c63b
Guard #ifdef USE_ROCR_20 is added for ROCR_20 device properties (memoryClockRate, memoryBusWidth)
...
By default isn't defined.
To add ROCR_20 support HIP have to be compiled as follows: make CXX_DEFINES+=-DUSE_ROCR_20
[ROCm/hip commit: 833c9e52ad ]
2016-02-19 13:27:03 +03:00
Ben Sander
7f6001cb75
Describe how to update HTML docs
...
[ROCm/hip commit: d8cf65edac ]
2016-02-19 01:56:17 -06:00
Evgeny Mankov
68621c6702
Formatting, no functional changes.
...
[ROCm/hip commit: fbdeee39cd ]
2016-02-18 18:54:19 +03:00
Evgeny Mankov
4fcd9f2542
Device property memoryBusWidth implementation.
...
+ Device property memoryBusWidth is added to hipDeviceProp_t struct.
+ Device attribute hipDeviceAttributeMemoryBusWidth is added to hipDeviceAttribute_t struct.
+ Tests update.
[ROCm/hip commit: 1c19dbb807 ]
2016-02-18 18:15:01 +03:00
Ben Sander
bd83583d40
Update release notes
...
[ROCm/hip commit: 617e7d8a7d ]
2016-02-18 21:07:14 -06:00
Ben Sander
f54ada2a14
Search multiple dirs.
...
[ROCm/hip commit: 311a728d25 ]
2016-02-18 21:07:14 -06:00
Ben Sander
6b6d71743a
Update doxygen HTML
...
[ROCm/hip commit: ac5957320e ]
2016-02-18 21:02:39 -06:00
Ben Sander
5cb2d5fc9a
Update doxygen HTML
...
[ROCm/hip commit: 129088992b ]
2016-02-18 20:43:03 -06:00
Evgeny Mankov
a0cc7134e3
Device property memoryClockRate implementation.
...
+ Device property memoryClockRate is added to hipDeviceProp_t struct.
+ Device attribute hipDeviceAttributeMemoryClockRate is added to hipDeviceAttribute_t struct.
+ Tests update.
+ Rename hipDevAttrConcurrentKernels to hipDeviceAttributeConcurrentKernels.
[ROCm/hip commit: 5ea8543d2e ]
2016-02-18 17:25:28 +03:00
Evgeny Mankov
801ae992d6
hipInfo sample update with new Device Properties.
...
[ROCm/hip commit: 5b05a9fef1 ]
2016-02-18 15:08:55 +03:00
Evgeny Mankov
8c1a0d1924
Attribute hipDevAttrConcurrentKernels for obtaining Device property concurrentKernels is added.
...
[ROCm/hip commit: 2b6fda77ca ]
2016-02-18 14:34:18 +03:00
Ben Sander
c989994613
Merge branch 'memtracker' of https://github.com/AMDComputeLibraries/HIP-privatestaging into memtracker
...
[ROCm/hip commit: ba224011b0 ]
2016-02-17 23:06:51 -06:00
Ben Sander
4e0ba06044
Enable Tracker and ROCR by default, verify with HCC
...
[ROCm/hip commit: c6f8883b0d ]
2016-02-17 23:03:37 -06:00
Ben Sander
a2d8f9d98e
Remove HIP-local AM tracker (now in HCC)
...
[ROCm/hip commit: d653782d9d ]
2016-02-17 21:33:32 -06:00
Ben Sander
5ab45e9be7
USE_AM_TRACKER=0 works
...
[ROCm/hip commit: 44f40e171a ]
2016-02-17 21:23:36 -06:00
Ben Sander
16c1d446cf
Tweak full formatting
...
[ROCm/hip commit: dffe573d49 ]
2016-02-17 21:22:31 -06:00
Ben Sander
21b30996cf
Support HSA_PATH env, async path tweak
...
[ROCm/hip commit: f466b291d8 ]
2016-02-17 21:22:07 -06:00
Ben Sander
bb8a1fe72d
more work on async copies
...
[ROCm/hip commit: 59379ffb44 ]
2016-02-17 00:59:12 -06:00
Ben Sander
512163b889
Add per-stream pool for hsa_signals.
...
[ROCm/hip commit: caef9b5ced ]
2016-02-16 01:59:13 -06:00
Ben Sander
787078f0b3
Add comments to tests
...
[ROCm/hip commit: fb883d9d03 ]
2016-02-16 01:58:24 -06:00
Ben Sander
9ab5b92173
Update before checkin to HCC.
...
Add support for USE_AM_TRACKER=2 (HCC version).
Add AM_ALLOC, AM_FREE indirection to ease swapping AM implementations.
[ROCm/hip commit: 38c735fd1d ]
2016-02-15 21:16:00 -06:00
Ben Sander
d58eab1706
Move warpSize to header, have shuffles use default warpsize.
...
[ROCm/hip commit: db3a63360b ]
2016-02-15 05:41:09 -06:00
Ben Sander
79a99f48f9
Remove old include path.
...
[ROCm/hip commit: 57274850f9 ]
2016-02-15 05:40:37 -06:00
Ben Sander
b97e430921
Update docs, cleanup
...
[ROCm/hip commit: 4637e19da4 ]
2016-02-15 05:40:12 -06:00
Ben Sander
9ccdbdca71
Fix tests to account for multi-gpu
...
[ROCm/hip commit: 3ecd1b1539 ]
2016-02-15 05:19:52 -06:00
Evgeny Mankov
189e652a5e
Formatting, no functional changes.
...
[ROCm/hip commit: 072d649d8d ]
2016-02-15 13:16:05 +03:00
Ben Sander
9e60e519d4
Add multi-threading synchonization on staging buffers and signals.
...
Also pre-allocate a couple signals for copies.
[ROCm/hip commit: 6420655dc8 ]
2016-02-13 03:18:01 -06:00
Ben Sander
565d0fb968
Enable -O3, style points on array size
...
[ROCm/hip commit: 928996fec7 ]
2016-02-13 03:17:42 -06:00
Ben Sander
2d468c6ce2
D2H multi-buffer
...
[ROCm/hip commit: b314777bc1 ]
2016-02-13 01:15:23 -06:00
Ben Sander
7a633dc4b8
Result formatting
...
[ROCm/hip commit: c3720c19a8 ]
2016-02-13 01:14:01 -06:00
Ben Sander
2498ca10d1
Add D2H test
...
[ROCm/hip commit: bcb5953d6e ]
2016-02-12 22:47:26 -06:00
Ben Sander
05e9d398f4
Add D2H test
...
[ROCm/hip commit: 559db057d5 ]
2016-02-12 22:46:34 -06:00
Ben Sander
b21c2ecfa4
Add Bus Bandwidth test, leveraged from SHOC.
...
[ROCm/hip commit: f3fd6476eb ]
2016-02-12 21:30:43 -06:00
Ben Sander
a835134f8e
Improve copy testing
...
[ROCm/hip commit: 1bfd3cdbd0 ]
2016-02-12 18:24:08 -06:00