Aditya Avinash Atluri
4851cedf6c
Update hip.vim
2016-02-23 13:40:20 -06:00
Ben Sander
7090f5c3f9
Add tests for multi-threaded streams
2016-02-23 12:08:22 -06:00
Ben Sander
3886d494f4
Sync review.
...
- add calls to ihipInit missing from some routines.
- sync before draining a stream.
2016-02-23 04:07:11 -06:00
Ben Sander
9c259bb86c
add hipLaunchParm
2016-02-23 04:05:41 -06:00
Ben Sander
549b18ce77
Improve async copy implementation.
...
- Add device-side signal waits when transitioning between command classes
(Kernel, H2D copy, D2H copy).
- Support waiting in staged memory copies as well.
- Add several chicken bits to control implementation:
- HIP_DISABLE_ENQ_BARRIER
- HIP_DISABLE_BIDIR_MEMCPY
- HIP_ONESHOT_COPY_DEP
- Refactor signal pool to support efficient deallocation based on
signsequnm.
- Deallocate copy signals on eventSynchronize.
- Improve copy tests, add pingpong.
2016-02-22 23:15:24 -06:00
Ben Sander
c2d66a48a7
Fix memcpy for Titan. Add <threads> to common includes
2016-02-22 15:09:23 -06:00
Ben Sander
0a98db4b5f
Merge branch 'memtracker' of https://github.com/AMDComputeLibraries/HIP-privatestaging into memtracker
2016-02-22 08:33:47 -06:00
gargrahul
a2fbf06129
Update for shared atomics support
2016-02-22 16:21:52 +05:30
Maneesh Gupta
c5c11d3707
Enable cospi,rsqrt,sinpi tests for HCC newer than 16073
2016-02-22 15:13:23 +05:30
Ben Sander
d33d806a5b
Track last command to a stream.
...
Passing simple tests.
2016-02-20 11:02:07 -06:00
atimofee
4b13cfc460
Added CUDA names replacement in string literals (i.e. error messages)
2016-02-19 21:58:33 +03:00
streamhsa
a7232c7f9e
Resolve issues for hip_popc and hip_ballot on nvcc
2016-02-19 20:18:03 +08:00
Evgeny Mankov
b23f9776ca
A support of the following device properties is added to legacy hipify.pl: hipDeviceAttributeConcurrentKernels, hipDeviceAttributeMemoryClockRate & hipDeviceAttributeMemoryBusWidth.
2016-02-19 13:36:37 +03:00
Evgeny Mankov
833c9e52ad
Guard #ifdef USE_ROCR_20 is added for ROCR_20 device properties (memoryClockRate, memoryBusWidth)
...
By default isn't defined.
To add ROCR_20 support HIP have to be compiled as follows: make CXX_DEFINES+=-DUSE_ROCR_20
2016-02-19 13:27:03 +03:00
Ben Sander
d8cf65edac
Describe how to update HTML docs
2016-02-19 01:56:17 -06:00
dfukalov
f3dfd07fa5
fix build bug with current clang/llvm
2016-02-18 23:16:52 +03:00
Evgeny Mankov
fbdeee39cd
Formatting, no functional changes.
2016-02-18 18:54:19 +03:00
Evgeny Mankov
1c19dbb807
Device property memoryBusWidth implementation.
...
+ Device property memoryBusWidth is added to hipDeviceProp_t struct.
+ Device attribute hipDeviceAttributeMemoryBusWidth is added to hipDeviceAttribute_t struct.
+ Tests update.
2016-02-18 18:15:01 +03:00
Ben Sander
617e7d8a7d
Update release notes
2016-02-18 21:07:14 -06:00
Ben Sander
311a728d25
Search multiple dirs.
2016-02-18 21:07:14 -06:00
Ben Sander
ac5957320e
Update doxygen HTML
2016-02-18 21:02:39 -06:00
Ben Sander
129088992b
Update doxygen HTML
2016-02-18 20:43:03 -06:00
Evgeny Mankov
5ea8543d2e
Device property memoryClockRate implementation.
...
+ Device property memoryClockRate is added to hipDeviceProp_t struct.
+ Device attribute hipDeviceAttributeMemoryClockRate is added to hipDeviceAttribute_t struct.
+ Tests update.
+ Rename hipDevAttrConcurrentKernels to hipDeviceAttributeConcurrentKernels.
2016-02-18 17:25:28 +03:00
Evgeny Mankov
5b05a9fef1
hipInfo sample update with new Device Properties.
2016-02-18 15:08:55 +03:00
Evgeny Mankov
2b6fda77ca
Attribute hipDevAttrConcurrentKernels for obtaining Device property concurrentKernels is added.
2016-02-18 14:34:18 +03:00
Ben Sander
b63470f4cc
remove extra :
2016-02-18 03:05:53 -06:00
Ben Sander
b12ec21806
Tweak version numbers
2016-02-18 21:29:51 -06:00
Ben Sander
ba224011b0
Merge branch 'memtracker' of https://github.com/AMDComputeLibraries/HIP-privatestaging into memtracker
2016-02-17 23:06:51 -06:00
Ben Sander
c6f8883b0d
Enable Tracker and ROCR by default, verify with HCC
2016-02-17 23:03:37 -06:00
Ben Sander
d653782d9d
Remove HIP-local AM tracker (now in HCC)
2016-02-17 21:33:32 -06:00
Ben Sander
44f40e171a
USE_AM_TRACKER=0 works
2016-02-17 21:23:36 -06:00
Ben Sander
dffe573d49
Tweak full formatting
2016-02-17 21:22:31 -06:00
Ben Sander
f466b291d8
Support HSA_PATH env, async path tweak
2016-02-17 21:22:07 -06:00
dfukalov
a0720fe79a
Adding lit tests
2016-02-17 19:05:18 +03:00
atimofee
434c070ea7
Hipify tool in it current state
2016-02-17 18:34:58 +03:00
pensun
8aa4bfce57
1. Bug fix
...
2. passed initial tests on different sets of
HIP_VISIBLE_DEVICES: (0),(1),(0,1),(1,2),(2,3),(1,2,3),(2,3,4),(1,5,2,3)
and achieved expected choice of GPU devices at the runtime.
3. Passed HIP test suite.
2016-02-17 09:32:50 -06:00
pensun
c1e120fb1b
Implementation of HIP_VISIBLE_DEVICES in runtime
2016-02-17 06:59:18 -06:00
Ben Sander
59379ffb44
more work on async copies
2016-02-17 00:59:12 -06:00
pensun
060439b6ab
modify to add remove invalid devices numbers
2016-02-16 10:00:05 -06:00
pensun
d40cbef2af
Implement to read HIP_VISIBLE_DEVICES to internal global variable
2016-02-16 07:39:04 -06:00
Ben Sander
caef9b5ced
Add per-stream pool for hsa_signals.
2016-02-16 01:59:13 -06:00
Ben Sander
fb883d9d03
Add comments to tests
2016-02-16 01:58:24 -06:00
Ben Sander
38c735fd1d
Update before checkin to HCC.
...
Add support for USE_AM_TRACKER=2 (HCC version).
Add AM_ALLOC, AM_FREE indirection to ease swapping AM implementations.
2016-02-15 21:16:00 -06:00
Ben Sander
db3a63360b
Move warpSize to header, have shuffles use default warpsize.
2016-02-15 05:41:09 -06:00
Ben Sander
57274850f9
Remove old include path.
2016-02-15 05:40:37 -06:00
Ben Sander
4637e19da4
Update docs, cleanup
2016-02-15 05:40:12 -06:00
Ben Sander
3ecd1b1539
Fix tests to account for multi-gpu
2016-02-15 05:19:52 -06:00
Evgeny Mankov
072d649d8d
Formatting, no functional changes.
2016-02-15 13:16:05 +03:00
Ben Sander
6420655dc8
Add multi-threading synchonization on staging buffers and signals.
...
Also pre-allocate a couple signals for copies.
2016-02-13 03:18:01 -06:00
Ben Sander
928996fec7
Enable -O3, style points on array size
2016-02-13 03:17:42 -06:00