Revīziju grafs

6428 Revīzijas

Autors SHA1 Ziņojums Datums
Aditya Avinash Atluri 4851cedf6c Update hip.vim 2016-02-23 13:40:20 -06:00
Ben Sander 7090f5c3f9 Add tests for multi-threaded streams 2016-02-23 12:08:22 -06:00
Ben Sander 3886d494f4 Sync review.
- add calls to ihipInit missing from some routines.
- sync before draining a stream.
2016-02-23 04:07:11 -06:00
Ben Sander 9c259bb86c add hipLaunchParm 2016-02-23 04:05:41 -06:00
Ben Sander 549b18ce77 Improve async copy implementation.
- Add device-side signal waits when transitioning between command classes
(Kernel, H2D copy, D2H copy).
- Support waiting in staged memory copies as well.
- Add several chicken bits to control implementation:
    - HIP_DISABLE_ENQ_BARRIER
    - HIP_DISABLE_BIDIR_MEMCPY
    - HIP_ONESHOT_COPY_DEP
- Refactor signal pool to support efficient deallocation based on
signsequnm.
- Deallocate copy signals on eventSynchronize.
- Improve copy tests, add pingpong.
2016-02-22 23:15:24 -06:00
Ben Sander c2d66a48a7 Fix memcpy for Titan. Add <threads> to common includes 2016-02-22 15:09:23 -06:00
Ben Sander 0a98db4b5f Merge branch 'memtracker' of https://github.com/AMDComputeLibraries/HIP-privatestaging into memtracker 2016-02-22 08:33:47 -06:00
gargrahul a2fbf06129 Update for shared atomics support 2016-02-22 16:21:52 +05:30
Maneesh Gupta c5c11d3707 Enable cospi,rsqrt,sinpi tests for HCC newer than 16073 2016-02-22 15:13:23 +05:30
Ben Sander d33d806a5b Track last command to a stream.
Passing simple tests.
2016-02-20 11:02:07 -06:00
atimofee 4b13cfc460 Added CUDA names replacement in string literals (i.e. error messages) 2016-02-19 21:58:33 +03:00
streamhsa a7232c7f9e Resolve issues for hip_popc and hip_ballot on nvcc 2016-02-19 20:18:03 +08:00
Evgeny Mankov b23f9776ca A support of the following device properties is added to legacy hipify.pl: hipDeviceAttributeConcurrentKernels, hipDeviceAttributeMemoryClockRate & hipDeviceAttributeMemoryBusWidth. 2016-02-19 13:36:37 +03:00
Evgeny Mankov 833c9e52ad Guard #ifdef USE_ROCR_20 is added for ROCR_20 device properties (memoryClockRate, memoryBusWidth)
By default isn't defined.
To add ROCR_20 support HIP have to be compiled as follows: make CXX_DEFINES+=-DUSE_ROCR_20
2016-02-19 13:27:03 +03:00
Ben Sander d8cf65edac Describe how to update HTML docs 2016-02-19 01:56:17 -06:00
dfukalov f3dfd07fa5 fix build bug with current clang/llvm 2016-02-18 23:16:52 +03:00
Evgeny Mankov fbdeee39cd Formatting, no functional changes. 2016-02-18 18:54:19 +03:00
Evgeny Mankov 1c19dbb807 Device property memoryBusWidth implementation.
+ Device property memoryBusWidth is added to hipDeviceProp_t struct.
+ Device attribute hipDeviceAttributeMemoryBusWidth is added to hipDeviceAttribute_t struct.
+ Tests update.
2016-02-18 18:15:01 +03:00
Ben Sander 617e7d8a7d Update release notes 2016-02-18 21:07:14 -06:00
Ben Sander 311a728d25 Search multiple dirs. 2016-02-18 21:07:14 -06:00
Ben Sander ac5957320e Update doxygen HTML 2016-02-18 21:02:39 -06:00
Ben Sander 129088992b Update doxygen HTML 2016-02-18 20:43:03 -06:00
Evgeny Mankov 5ea8543d2e Device property memoryClockRate implementation.
+ Device property memoryClockRate is added to hipDeviceProp_t struct.
+ Device attribute hipDeviceAttributeMemoryClockRate is added to hipDeviceAttribute_t struct.
+ Tests update.
+ Rename hipDevAttrConcurrentKernels to hipDeviceAttributeConcurrentKernels.
2016-02-18 17:25:28 +03:00
Evgeny Mankov 5b05a9fef1 hipInfo sample update with new Device Properties. 2016-02-18 15:08:55 +03:00
Evgeny Mankov 2b6fda77ca Attribute hipDevAttrConcurrentKernels for obtaining Device property concurrentKernels is added. 2016-02-18 14:34:18 +03:00
Ben Sander b63470f4cc remove extra : 2016-02-18 03:05:53 -06:00
Ben Sander b12ec21806 Tweak version numbers 2016-02-18 21:29:51 -06:00
Ben Sander ba224011b0 Merge branch 'memtracker' of https://github.com/AMDComputeLibraries/HIP-privatestaging into memtracker 2016-02-17 23:06:51 -06:00
Ben Sander c6f8883b0d Enable Tracker and ROCR by default, verify with HCC 2016-02-17 23:03:37 -06:00
Ben Sander d653782d9d Remove HIP-local AM tracker (now in HCC) 2016-02-17 21:33:32 -06:00
Ben Sander 44f40e171a USE_AM_TRACKER=0 works 2016-02-17 21:23:36 -06:00
Ben Sander dffe573d49 Tweak full formatting 2016-02-17 21:22:31 -06:00
Ben Sander f466b291d8 Support HSA_PATH env, async path tweak 2016-02-17 21:22:07 -06:00
dfukalov a0720fe79a Adding lit tests 2016-02-17 19:05:18 +03:00
atimofee 434c070ea7 Hipify tool in it current state 2016-02-17 18:34:58 +03:00
pensun 8aa4bfce57 1. Bug fix
2. passed initial tests on different sets of
HIP_VISIBLE_DEVICES: (0),(1),(0,1),(1,2),(2,3),(1,2,3),(2,3,4),(1,5,2,3)
and achieved expected choice of GPU devices at the runtime.
3. Passed HIP test suite.
2016-02-17 09:32:50 -06:00
pensun c1e120fb1b Implementation of HIP_VISIBLE_DEVICES in runtime 2016-02-17 06:59:18 -06:00
Ben Sander 59379ffb44 more work on async copies 2016-02-17 00:59:12 -06:00
pensun 060439b6ab modify to add remove invalid devices numbers 2016-02-16 10:00:05 -06:00
pensun d40cbef2af Implement to read HIP_VISIBLE_DEVICES to internal global variable 2016-02-16 07:39:04 -06:00
Ben Sander caef9b5ced Add per-stream pool for hsa_signals. 2016-02-16 01:59:13 -06:00
Ben Sander fb883d9d03 Add comments to tests 2016-02-16 01:58:24 -06:00
Ben Sander 38c735fd1d Update before checkin to HCC.
Add support for USE_AM_TRACKER=2 (HCC version).
Add AM_ALLOC, AM_FREE indirection to ease swapping AM implementations.
2016-02-15 21:16:00 -06:00
Ben Sander db3a63360b Move warpSize to header, have shuffles use default warpsize. 2016-02-15 05:41:09 -06:00
Ben Sander 57274850f9 Remove old include path. 2016-02-15 05:40:37 -06:00
Ben Sander 4637e19da4 Update docs, cleanup 2016-02-15 05:40:12 -06:00
Ben Sander 3ecd1b1539 Fix tests to account for multi-gpu 2016-02-15 05:19:52 -06:00
Evgeny Mankov 072d649d8d Formatting, no functional changes. 2016-02-15 13:16:05 +03:00
Ben Sander 6420655dc8 Add multi-threading synchonization on staging buffers and signals.
Also pre-allocate a couple signals for copies.
2016-02-13 03:18:01 -06:00
Ben Sander 928996fec7 Enable -O3, style points on array size 2016-02-13 03:17:42 -06:00