Graf commitů

133 Commity

Autor SHA1 Zpráva Datum
pensun f9748334cc relsove conflicts
[ROCm/clr commit: e21841c152]
2016-02-26 09:57:40 -06:00
pensun 35cbad1c77 fix compiling error
[ROCm/clr commit: 980ec93f46]
2016-02-26 09:50:00 -06:00
Aditya Avinash Atluri 1e9551ce7b Null argument handling HIPCC
This change makes hipcc handle no-arguments passed to it. It solves https://github.com/AMDComputeLibraries/HIP-privatestaging/issues/1

[ROCm/clr commit: aeb86755c3]
2016-02-26 13:47:58 -06:00
Ben Sander b97c2c02b1 fixes for titan platform
[ROCm/clr commit: 8105bd636f]
2016-02-26 05:25:30 -06:00
Ben Sander d5ae77ecc0 Disable ROCR_V2
[ROCm/clr commit: 822c7292c9]
2016-02-26 23:34:45 -06:00
Ben Sander e345f23846 Merge branch 'memtracker' into privatestaging
Conflicts:
	include/nvcc_detail/hip_runtime_api.h


[ROCm/clr commit: 7a1b4c3878]
2016-02-26 06:17:05 -06:00
Ben Sander ee153bb572 Merge branch 'privatestaging' of https://github.com/AMDComputeLibraries/HIP-privatestaging into privatestaging
[ROCm/clr commit: 4a6173fe58]
2016-02-26 06:15:09 -06:00
Ben Sander b642f094b3 Merge branch 'memtracker' of https://github.com/AMDComputeLibraries/HIP-privatestaging into memtracker
Conflicts:
	tests/src/hipMemcpy.cpp


[ROCm/clr commit: 69bb511b10]
2016-02-25 23:24:22 -06:00
Ben Sander d1e06b19fc Merge branch 'memtracker' of https://github.com/AMDComputeLibraries/HIP-privatestaging into memtracker
Conflicts:
	tests/src/hipMemcpy.cpp


[ROCm/clr commit: 8d985188dd]
2016-02-25 23:22:51 -06:00
Ben Sander 7d1955ed1d Add PATH and LD_LIBRARY_FLAGS
[ROCm/clr commit: 67e96e983f]
2016-02-25 23:07:18 -06:00
Ben Sander b8b7596d4d Merge branch 'memtracker' into privatestaging
Conflicts:
	src/hip_hcc.cpp


[ROCm/clr commit: af97f5e317]
2016-02-25 19:38:46 -06:00
Ben Sander 078d5ff21e Tweak version numbers
[ROCm/clr commit: d3e1773704]
2016-02-25 19:30:42 -06:00
Ben Sander d2df45e132 Improve memory copy and commands switching
- Add chicken bits to use host-side dependency management.
- Add optional PinInPlace path for unpinned copies
- Synchronize before pinned memcpy path.
- Add mutex to protect two threads launching to same stream.


[ROCm/clr commit: 91ed5c7d78]
2016-02-25 19:19:49 -06:00
Aditya Avinash Atluri a89fb7c11e Update hip.vim
Added d2d and h2h highlights

[ROCm/clr commit: 6ec52348ea]
2016-02-25 14:49:58 -06:00
Evgeny Mankov 8e6e28df60 Attribute hipDeviceAttributeIsMultiGpuBoard for obtaining Device property isMultiGpuBoard is added.
On HIP path property obtaining done through hsa_iterate_agents and counting the devices of HSA_DEVICE_TYPE_GPU type.

P.S.
On multi-boards systems it might be problems with detection what board a GPU plugged into (not tested).


[ROCm/clr commit: 7bb0f17656]
2016-02-25 23:44:39 +03:00
Ben Sander 37b05790b1 Add test for thread-safety on streams
[ROCm/clr commit: 2228bbd119]
2016-02-25 04:22:34 -06:00
Ben Sander e8021479b3 Add abstraction for pinned/unpinned, and sync/async mem copies selection in tests
[ROCm/clr commit: 382638df1e]
2016-02-25 04:21:24 -06:00
Aditya Avinash Atluri 65dd5b8170 Update hip.vim
[ROCm/clr commit: b86a905958]
2016-02-23 13:40:20 -06:00
Ben Sander 08f8387268 Add tests for multi-threaded streams
[ROCm/clr commit: 836c485d0b]
2016-02-23 12:08:22 -06:00
Ben Sander bb27a1d7e8 Sync review.
- add calls to ihipInit missing from some routines.
- sync before draining a stream.


[ROCm/clr commit: 8f98aca124]
2016-02-23 04:07:11 -06:00
Ben Sander 34aa8f1304 add hipLaunchParm
[ROCm/clr commit: 7d553a1113]
2016-02-23 04:05:41 -06:00
Ben Sander da37035a9c Improve async copy implementation.
- Add device-side signal waits when transitioning between command classes
(Kernel, H2D copy, D2H copy).
- Support waiting in staged memory copies as well.
- Add several chicken bits to control implementation:
    - HIP_DISABLE_ENQ_BARRIER
    - HIP_DISABLE_BIDIR_MEMCPY
    - HIP_ONESHOT_COPY_DEP
- Refactor signal pool to support efficient deallocation based on
signsequnm.
- Deallocate copy signals on eventSynchronize.
- Improve copy tests, add pingpong.


[ROCm/clr commit: 28990567fb]
2016-02-22 23:15:24 -06:00
Ben Sander 9e9b4fb547 Fix memcpy for Titan. Add <threads> to common includes
[ROCm/clr commit: 784ebcbc86]
2016-02-22 15:09:23 -06:00
Ben Sander 3b8a545ba7 Merge branch 'memtracker' of https://github.com/AMDComputeLibraries/HIP-privatestaging into memtracker
[ROCm/clr commit: 16b04fc0d3]
2016-02-22 08:33:47 -06:00
gargrahul 9bb7be6891 Update for shared atomics support
[ROCm/clr commit: 14508fd0d6]
2016-02-22 16:21:52 +05:30
Maneesh Gupta 777bb30c51 Enable cospi,rsqrt,sinpi tests for HCC newer than 16073
[ROCm/clr commit: e2d97e19bc]
2016-02-22 15:13:23 +05:30
Ben Sander 4752321fb1 Track last command to a stream.
Passing simple tests.


[ROCm/clr commit: d5c777268a]
2016-02-20 11:02:07 -06:00
streamhsa 6debabd11a Resolve issues for hip_popc and hip_ballot on nvcc
[ROCm/clr commit: 005155b7b2]
2016-02-19 20:18:03 +08:00
Evgeny Mankov 9874e598f8 A support of the following device properties is added to legacy hipify.pl: hipDeviceAttributeConcurrentKernels, hipDeviceAttributeMemoryClockRate & hipDeviceAttributeMemoryBusWidth.
[ROCm/clr commit: 376fb0d8ad]
2016-02-19 13:36:37 +03:00
Evgeny Mankov c76791140d Guard #ifdef USE_ROCR_20 is added for ROCR_20 device properties (memoryClockRate, memoryBusWidth)
By default isn't defined.
To add ROCR_20 support HIP have to be compiled as follows: make CXX_DEFINES+=-DUSE_ROCR_20


[ROCm/clr commit: d4b15399f5]
2016-02-19 13:27:03 +03:00
Ben Sander beae41517d Describe how to update HTML docs
[ROCm/clr commit: 16ff0757a6]
2016-02-19 01:56:17 -06:00
Evgeny Mankov 420eea7379 Formatting, no functional changes.
[ROCm/clr commit: 14ec340746]
2016-02-18 18:54:19 +03:00
Evgeny Mankov a17733dd80 Device property memoryBusWidth implementation.
+ Device property memoryBusWidth is added to hipDeviceProp_t struct.
+ Device attribute hipDeviceAttributeMemoryBusWidth is added to hipDeviceAttribute_t struct.
+ Tests update.


[ROCm/clr commit: da8169dd89]
2016-02-18 18:15:01 +03:00
Ben Sander 6507476d72 Update release notes
[ROCm/clr commit: 2447067f27]
2016-02-18 21:07:14 -06:00
Ben Sander 86958d1d1f Search multiple dirs.
[ROCm/clr commit: ad20273a1d]
2016-02-18 21:07:14 -06:00
Ben Sander 3ed12119e7 Update doxygen HTML
[ROCm/clr commit: 8c3436e927]
2016-02-18 21:02:39 -06:00
Ben Sander c8675123d0 Update doxygen HTML
[ROCm/clr commit: 3496398651]
2016-02-18 20:43:03 -06:00
Evgeny Mankov a47073f25d Device property memoryClockRate implementation.
+ Device property memoryClockRate is added to hipDeviceProp_t struct.
+ Device attribute hipDeviceAttributeMemoryClockRate is added to hipDeviceAttribute_t struct.
+ Tests update.
+ Rename hipDevAttrConcurrentKernels to hipDeviceAttributeConcurrentKernels.


[ROCm/clr commit: 8aace64dce]
2016-02-18 17:25:28 +03:00
Evgeny Mankov 6812fa603f hipInfo sample update with new Device Properties.
[ROCm/clr commit: 859208d6f0]
2016-02-18 15:08:55 +03:00
Evgeny Mankov 3faa6fd86c Attribute hipDevAttrConcurrentKernels for obtaining Device property concurrentKernels is added.
[ROCm/clr commit: d4bd94e9a0]
2016-02-18 14:34:18 +03:00
Ben Sander a3ec0ae280 remove extra :
[ROCm/clr commit: 866e64f6e2]
2016-02-18 03:05:53 -06:00
Ben Sander 3888f4e212 Tweak version numbers
[ROCm/clr commit: 53f37712bd]
2016-02-18 21:29:51 -06:00
Ben Sander 7760c82196 Merge branch 'memtracker' of https://github.com/AMDComputeLibraries/HIP-privatestaging into memtracker
[ROCm/clr commit: d0f9881d60]
2016-02-17 23:06:51 -06:00
Ben Sander f19c6d8342 Enable Tracker and ROCR by default, verify with HCC
[ROCm/clr commit: 400dcb8bcb]
2016-02-17 23:03:37 -06:00
Ben Sander 8ed32daefa Remove HIP-local AM tracker (now in HCC)
[ROCm/clr commit: b08e468c06]
2016-02-17 21:33:32 -06:00
Ben Sander 98e608a5ce USE_AM_TRACKER=0 works
[ROCm/clr commit: 354c9f945a]
2016-02-17 21:23:36 -06:00
Ben Sander f8d9017103 Tweak full formatting
[ROCm/clr commit: d75279c8eb]
2016-02-17 21:22:31 -06:00
Ben Sander 527d64a2da Support HSA_PATH env, async path tweak
[ROCm/clr commit: 9a82d316c3]
2016-02-17 21:22:07 -06:00
pensun 60f00e4e8a 1. Bug fix
2. passed initial tests on different sets of
HIP_VISIBLE_DEVICES: (0),(1),(0,1),(1,2),(2,3),(1,2,3),(2,3,4),(1,5,2,3)
and achieved expected choice of GPU devices at the runtime.
3. Passed HIP test suite.


[ROCm/clr commit: c1da0f1e12]
2016-02-17 09:32:50 -06:00
pensun afc377b2bc Implementation of HIP_VISIBLE_DEVICES in runtime
[ROCm/clr commit: 43785243a5]
2016-02-17 06:59:18 -06:00