İşleme Grafiği

115 İşleme

Yazar SHA1 Mesaj Tarih
Ben Sander 8a2bcf2da3 Merge branch 'privatestaging' of https://github.com/AMDComputeLibraries/HIP-privatestaging into privatestaging
[ROCm/hip commit: c300ffe458]
2016-02-26 06:15:09 -06:00
Ben Sander 2a650be661 Add PATH and LD_LIBRARY_FLAGS
[ROCm/hip commit: a30018e166]
2016-02-25 23:07:18 -06:00
Ben Sander 5ca4914e0e Merge branch 'memtracker' into privatestaging
Conflicts:
	src/hip_hcc.cpp


[ROCm/hip commit: 4adab7b7ef]
2016-02-25 19:38:46 -06:00
Ben Sander 22a3806c0c Tweak version numbers
[ROCm/hip commit: b1da7e4a70]
2016-02-25 19:30:42 -06:00
Ben Sander ae365b4ae2 Improve memory copy and commands switching
- Add chicken bits to use host-side dependency management.
- Add optional PinInPlace path for unpinned copies
- Synchronize before pinned memcpy path.
- Add mutex to protect two threads launching to same stream.


[ROCm/hip commit: 8b64c0dc62]
2016-02-25 19:19:49 -06:00
Aditya Avinash Atluri 6bb5485beb Update hip.vim
Added d2d and h2h highlights

[ROCm/hip commit: 29c385ed72]
2016-02-25 14:49:58 -06:00
Evgeny Mankov 82900a1888 Attribute hipDeviceAttributeIsMultiGpuBoard for obtaining Device property isMultiGpuBoard is added.
On HIP path property obtaining done through hsa_iterate_agents and counting the devices of HSA_DEVICE_TYPE_GPU type.

P.S.
On multi-boards systems it might be problems with detection what board a GPU plugged into (not tested).


[ROCm/hip commit: 57e212606d]
2016-02-25 23:44:39 +03:00
Aditya Avinash Atluri 55e5190b57 Update hip.vim
[ROCm/hip commit: 4851cedf6c]
2016-02-23 13:40:20 -06:00
Ben Sander 1888acb5f3 Sync review.
- add calls to ihipInit missing from some routines.
- sync before draining a stream.


[ROCm/hip commit: 3886d494f4]
2016-02-23 04:07:11 -06:00
Ben Sander a40adb296f add hipLaunchParm
[ROCm/hip commit: 9c259bb86c]
2016-02-23 04:05:41 -06:00
Ben Sander f0c734e256 Improve async copy implementation.
- Add device-side signal waits when transitioning between command classes
(Kernel, H2D copy, D2H copy).
- Support waiting in staged memory copies as well.
- Add several chicken bits to control implementation:
    - HIP_DISABLE_ENQ_BARRIER
    - HIP_DISABLE_BIDIR_MEMCPY
    - HIP_ONESHOT_COPY_DEP
- Refactor signal pool to support efficient deallocation based on
signsequnm.
- Deallocate copy signals on eventSynchronize.
- Improve copy tests, add pingpong.


[ROCm/hip commit: 549b18ce77]
2016-02-22 23:15:24 -06:00
gargrahul ccd1ed0a97 Update for shared atomics support
[ROCm/hip commit: a2fbf06129]
2016-02-22 16:21:52 +05:30
Maneesh Gupta d40b8d8fb0 Enable cospi,rsqrt,sinpi tests for HCC newer than 16073
[ROCm/hip commit: c5c11d3707]
2016-02-22 15:13:23 +05:30
Ben Sander ebf2700936 Track last command to a stream.
Passing simple tests.


[ROCm/hip commit: d33d806a5b]
2016-02-20 11:02:07 -06:00
streamhsa 0389b292a6 Resolve issues for hip_popc and hip_ballot on nvcc
[ROCm/hip commit: a7232c7f9e]
2016-02-19 20:18:03 +08:00
Evgeny Mankov d8df47383d A support of the following device properties is added to legacy hipify.pl: hipDeviceAttributeConcurrentKernels, hipDeviceAttributeMemoryClockRate & hipDeviceAttributeMemoryBusWidth.
[ROCm/hip commit: b23f9776ca]
2016-02-19 13:36:37 +03:00
Evgeny Mankov c3a600c63b Guard #ifdef USE_ROCR_20 is added for ROCR_20 device properties (memoryClockRate, memoryBusWidth)
By default isn't defined.
To add ROCR_20 support HIP have to be compiled as follows: make CXX_DEFINES+=-DUSE_ROCR_20


[ROCm/hip commit: 833c9e52ad]
2016-02-19 13:27:03 +03:00
Ben Sander 7f6001cb75 Describe how to update HTML docs
[ROCm/hip commit: d8cf65edac]
2016-02-19 01:56:17 -06:00
Evgeny Mankov 68621c6702 Formatting, no functional changes.
[ROCm/hip commit: fbdeee39cd]
2016-02-18 18:54:19 +03:00
Evgeny Mankov 4fcd9f2542 Device property memoryBusWidth implementation.
+ Device property memoryBusWidth is added to hipDeviceProp_t struct.
+ Device attribute hipDeviceAttributeMemoryBusWidth is added to hipDeviceAttribute_t struct.
+ Tests update.


[ROCm/hip commit: 1c19dbb807]
2016-02-18 18:15:01 +03:00
Ben Sander bd83583d40 Update release notes
[ROCm/hip commit: 617e7d8a7d]
2016-02-18 21:07:14 -06:00
Ben Sander f54ada2a14 Search multiple dirs.
[ROCm/hip commit: 311a728d25]
2016-02-18 21:07:14 -06:00
Ben Sander 6b6d71743a Update doxygen HTML
[ROCm/hip commit: ac5957320e]
2016-02-18 21:02:39 -06:00
Ben Sander 5cb2d5fc9a Update doxygen HTML
[ROCm/hip commit: 129088992b]
2016-02-18 20:43:03 -06:00
Evgeny Mankov a0cc7134e3 Device property memoryClockRate implementation.
+ Device property memoryClockRate is added to hipDeviceProp_t struct.
+ Device attribute hipDeviceAttributeMemoryClockRate is added to hipDeviceAttribute_t struct.
+ Tests update.
+ Rename hipDevAttrConcurrentKernels to hipDeviceAttributeConcurrentKernels.


[ROCm/hip commit: 5ea8543d2e]
2016-02-18 17:25:28 +03:00
Evgeny Mankov 801ae992d6 hipInfo sample update with new Device Properties.
[ROCm/hip commit: 5b05a9fef1]
2016-02-18 15:08:55 +03:00
Evgeny Mankov 8c1a0d1924 Attribute hipDevAttrConcurrentKernels for obtaining Device property concurrentKernels is added.
[ROCm/hip commit: 2b6fda77ca]
2016-02-18 14:34:18 +03:00
Ben Sander c989994613 Merge branch 'memtracker' of https://github.com/AMDComputeLibraries/HIP-privatestaging into memtracker
[ROCm/hip commit: ba224011b0]
2016-02-17 23:06:51 -06:00
Ben Sander 4e0ba06044 Enable Tracker and ROCR by default, verify with HCC
[ROCm/hip commit: c6f8883b0d]
2016-02-17 23:03:37 -06:00
Ben Sander a2d8f9d98e Remove HIP-local AM tracker (now in HCC)
[ROCm/hip commit: d653782d9d]
2016-02-17 21:33:32 -06:00
Ben Sander 5ab45e9be7 USE_AM_TRACKER=0 works
[ROCm/hip commit: 44f40e171a]
2016-02-17 21:23:36 -06:00
Ben Sander 16c1d446cf Tweak full formatting
[ROCm/hip commit: dffe573d49]
2016-02-17 21:22:31 -06:00
Ben Sander 21b30996cf Support HSA_PATH env, async path tweak
[ROCm/hip commit: f466b291d8]
2016-02-17 21:22:07 -06:00
Ben Sander bb8a1fe72d more work on async copies
[ROCm/hip commit: 59379ffb44]
2016-02-17 00:59:12 -06:00
Ben Sander 512163b889 Add per-stream pool for hsa_signals.
[ROCm/hip commit: caef9b5ced]
2016-02-16 01:59:13 -06:00
Ben Sander 787078f0b3 Add comments to tests
[ROCm/hip commit: fb883d9d03]
2016-02-16 01:58:24 -06:00
Ben Sander 9ab5b92173 Update before checkin to HCC.
Add support for USE_AM_TRACKER=2 (HCC version).
Add AM_ALLOC, AM_FREE indirection to ease swapping AM implementations.


[ROCm/hip commit: 38c735fd1d]
2016-02-15 21:16:00 -06:00
Ben Sander d58eab1706 Move warpSize to header, have shuffles use default warpsize.
[ROCm/hip commit: db3a63360b]
2016-02-15 05:41:09 -06:00
Ben Sander 79a99f48f9 Remove old include path.
[ROCm/hip commit: 57274850f9]
2016-02-15 05:40:37 -06:00
Ben Sander b97e430921 Update docs, cleanup
[ROCm/hip commit: 4637e19da4]
2016-02-15 05:40:12 -06:00
Ben Sander 9ccdbdca71 Fix tests to account for multi-gpu
[ROCm/hip commit: 3ecd1b1539]
2016-02-15 05:19:52 -06:00
Evgeny Mankov 189e652a5e Formatting, no functional changes.
[ROCm/hip commit: 072d649d8d]
2016-02-15 13:16:05 +03:00
Ben Sander 9e60e519d4 Add multi-threading synchonization on staging buffers and signals.
Also pre-allocate a couple signals for copies.


[ROCm/hip commit: 6420655dc8]
2016-02-13 03:18:01 -06:00
Ben Sander 565d0fb968 Enable -O3, style points on array size
[ROCm/hip commit: 928996fec7]
2016-02-13 03:17:42 -06:00
Ben Sander 2d468c6ce2 D2H multi-buffer
[ROCm/hip commit: b314777bc1]
2016-02-13 01:15:23 -06:00
Ben Sander 7a633dc4b8 Result formatting
[ROCm/hip commit: c3720c19a8]
2016-02-13 01:14:01 -06:00
Ben Sander 2498ca10d1 Add D2H test
[ROCm/hip commit: bcb5953d6e]
2016-02-12 22:47:26 -06:00
Ben Sander 05e9d398f4 Add D2H test
[ROCm/hip commit: 559db057d5]
2016-02-12 22:46:34 -06:00
Ben Sander b21c2ecfa4 Add Bus Bandwidth test, leveraged from SHOC.
[ROCm/hip commit: f3fd6476eb]
2016-02-12 21:30:43 -06:00
Ben Sander a835134f8e Improve copy testing
[ROCm/hip commit: 1bfd3cdbd0]
2016-02-12 18:24:08 -06:00