atimofee
1d9d223e31
Fixed tool crash on kernels with empty parameter list
...
[ROCm/clr commit: 63df6f915f ]
2016-02-25 22:54:58 +03:00
dfukalov
79fdab1486
fixed lit script discovery in standalone build case
...
[ROCm/clr commit: 3d5de5305d ]
2016-02-25 15:18:14 +03:00
Ben Sander
37b05790b1
Add test for thread-safety on streams
...
[ROCm/clr commit: 2228bbd119 ]
2016-02-25 04:22:34 -06:00
Ben Sander
e8021479b3
Add abstraction for pinned/unpinned, and sync/async mem copies selection in tests
...
[ROCm/clr commit: 382638df1e ]
2016-02-25 04:21:24 -06:00
atimofee
1ec9553c6b
String literal bug fixed + string literal processing refactoring
...
[ROCm/clr commit: 36833c78f3 ]
2016-02-24 21:22:32 +03:00
atimofee
e0eb715a95
Merge branch 'master' of https://github.com/GPUOpen-ProfessionalCompute-Tools/HIP-hipify
...
[ROCm/clr commit: 2056dadaa1 ]
2016-02-24 19:27:52 +03:00
atimofee
b2755257b1
TAB deleted
...
[ROCm/clr commit: a5695b1d52 ]
2016-02-24 19:27:02 +03:00
atimofee
7ba59141b3
CUDA names in string literals replacment added
...
[ROCm/clr commit: 665d3dc443 ]
2016-02-24 19:23:23 +03:00
dfukalov
d0da3dbb31
-o=<output_filename> option added
...
[ROCm/clr commit: ac30cbfe65 ]
2016-02-24 19:10:00 +03:00
atimofee
108a06eea8
CUDA names in string literals replacment added
...
[ROCm/clr commit: 8bf31a49c8 ]
2016-02-24 17:42:02 +03:00
Aditya Avinash Atluri
65dd5b8170
Update hip.vim
...
[ROCm/clr commit: b86a905958 ]
2016-02-23 13:40:20 -06:00
Ben Sander
08f8387268
Add tests for multi-threaded streams
...
[ROCm/clr commit: 836c485d0b ]
2016-02-23 12:08:22 -06:00
Ben Sander
bb27a1d7e8
Sync review.
...
- add calls to ihipInit missing from some routines.
- sync before draining a stream.
[ROCm/clr commit: 8f98aca124 ]
2016-02-23 04:07:11 -06:00
Ben Sander
34aa8f1304
add hipLaunchParm
...
[ROCm/clr commit: 7d553a1113 ]
2016-02-23 04:05:41 -06:00
Ben Sander
da37035a9c
Improve async copy implementation.
...
- Add device-side signal waits when transitioning between command classes
(Kernel, H2D copy, D2H copy).
- Support waiting in staged memory copies as well.
- Add several chicken bits to control implementation:
- HIP_DISABLE_ENQ_BARRIER
- HIP_DISABLE_BIDIR_MEMCPY
- HIP_ONESHOT_COPY_DEP
- Refactor signal pool to support efficient deallocation based on
signsequnm.
- Deallocate copy signals on eventSynchronize.
- Improve copy tests, add pingpong.
[ROCm/clr commit: 28990567fb ]
2016-02-22 23:15:24 -06:00
Ben Sander
9e9b4fb547
Fix memcpy for Titan. Add <threads> to common includes
...
[ROCm/clr commit: 784ebcbc86 ]
2016-02-22 15:09:23 -06:00
Ben Sander
3b8a545ba7
Merge branch 'memtracker' of https://github.com/AMDComputeLibraries/HIP-privatestaging into memtracker
...
[ROCm/clr commit: 16b04fc0d3 ]
2016-02-22 08:33:47 -06:00
gargrahul
9bb7be6891
Update for shared atomics support
...
[ROCm/clr commit: 14508fd0d6 ]
2016-02-22 16:21:52 +05:30
Maneesh Gupta
777bb30c51
Enable cospi,rsqrt,sinpi tests for HCC newer than 16073
...
[ROCm/clr commit: e2d97e19bc ]
2016-02-22 15:13:23 +05:30
Ben Sander
4752321fb1
Track last command to a stream.
...
Passing simple tests.
[ROCm/clr commit: d5c777268a ]
2016-02-20 11:02:07 -06:00
atimofee
3603f3b1b1
Added CUDA names replacement in string literals (i.e. error messages)
...
[ROCm/clr commit: 88b4f2859f ]
2016-02-19 21:58:33 +03:00
streamhsa
6debabd11a
Resolve issues for hip_popc and hip_ballot on nvcc
...
[ROCm/clr commit: 005155b7b2 ]
2016-02-19 20:18:03 +08:00
Evgeny Mankov
9874e598f8
A support of the following device properties is added to legacy hipify.pl: hipDeviceAttributeConcurrentKernels, hipDeviceAttributeMemoryClockRate & hipDeviceAttributeMemoryBusWidth.
...
[ROCm/clr commit: 376fb0d8ad ]
2016-02-19 13:36:37 +03:00
Evgeny Mankov
c76791140d
Guard #ifdef USE_ROCR_20 is added for ROCR_20 device properties (memoryClockRate, memoryBusWidth)
...
By default isn't defined.
To add ROCR_20 support HIP have to be compiled as follows: make CXX_DEFINES+=-DUSE_ROCR_20
[ROCm/clr commit: d4b15399f5 ]
2016-02-19 13:27:03 +03:00
Ben Sander
beae41517d
Describe how to update HTML docs
...
[ROCm/clr commit: 16ff0757a6 ]
2016-02-19 01:56:17 -06:00
dfukalov
1b66d9ac5c
fix build bug with current clang/llvm
...
[ROCm/clr commit: 1f77fd5dcc ]
2016-02-18 23:16:52 +03:00
Evgeny Mankov
420eea7379
Formatting, no functional changes.
...
[ROCm/clr commit: 14ec340746 ]
2016-02-18 18:54:19 +03:00
Evgeny Mankov
a17733dd80
Device property memoryBusWidth implementation.
...
+ Device property memoryBusWidth is added to hipDeviceProp_t struct.
+ Device attribute hipDeviceAttributeMemoryBusWidth is added to hipDeviceAttribute_t struct.
+ Tests update.
[ROCm/clr commit: da8169dd89 ]
2016-02-18 18:15:01 +03:00
Ben Sander
6507476d72
Update release notes
...
[ROCm/clr commit: 2447067f27 ]
2016-02-18 21:07:14 -06:00
Ben Sander
86958d1d1f
Search multiple dirs.
...
[ROCm/clr commit: ad20273a1d ]
2016-02-18 21:07:14 -06:00
Ben Sander
3ed12119e7
Update doxygen HTML
...
[ROCm/clr commit: 8c3436e927 ]
2016-02-18 21:02:39 -06:00
Ben Sander
c8675123d0
Update doxygen HTML
...
[ROCm/clr commit: 3496398651 ]
2016-02-18 20:43:03 -06:00
Evgeny Mankov
a47073f25d
Device property memoryClockRate implementation.
...
+ Device property memoryClockRate is added to hipDeviceProp_t struct.
+ Device attribute hipDeviceAttributeMemoryClockRate is added to hipDeviceAttribute_t struct.
+ Tests update.
+ Rename hipDevAttrConcurrentKernels to hipDeviceAttributeConcurrentKernels.
[ROCm/clr commit: 8aace64dce ]
2016-02-18 17:25:28 +03:00
Evgeny Mankov
6812fa603f
hipInfo sample update with new Device Properties.
...
[ROCm/clr commit: 859208d6f0 ]
2016-02-18 15:08:55 +03:00
Evgeny Mankov
3faa6fd86c
Attribute hipDevAttrConcurrentKernels for obtaining Device property concurrentKernels is added.
...
[ROCm/clr commit: d4bd94e9a0 ]
2016-02-18 14:34:18 +03:00
Ben Sander
a3ec0ae280
remove extra :
...
[ROCm/clr commit: 866e64f6e2 ]
2016-02-18 03:05:53 -06:00
Ben Sander
3888f4e212
Tweak version numbers
...
[ROCm/clr commit: 53f37712bd ]
2016-02-18 21:29:51 -06:00
Ben Sander
7760c82196
Merge branch 'memtracker' of https://github.com/AMDComputeLibraries/HIP-privatestaging into memtracker
...
[ROCm/clr commit: d0f9881d60 ]
2016-02-17 23:06:51 -06:00
Ben Sander
f19c6d8342
Enable Tracker and ROCR by default, verify with HCC
...
[ROCm/clr commit: 400dcb8bcb ]
2016-02-17 23:03:37 -06:00
Ben Sander
8ed32daefa
Remove HIP-local AM tracker (now in HCC)
...
[ROCm/clr commit: b08e468c06 ]
2016-02-17 21:33:32 -06:00
Ben Sander
98e608a5ce
USE_AM_TRACKER=0 works
...
[ROCm/clr commit: 354c9f945a ]
2016-02-17 21:23:36 -06:00
Ben Sander
f8d9017103
Tweak full formatting
...
[ROCm/clr commit: d75279c8eb ]
2016-02-17 21:22:31 -06:00
Ben Sander
527d64a2da
Support HSA_PATH env, async path tweak
...
[ROCm/clr commit: 9a82d316c3 ]
2016-02-17 21:22:07 -06:00
dfukalov
222206d3d8
Adding lit tests
...
[ROCm/clr commit: cdcb45d9f2 ]
2016-02-17 19:05:18 +03:00
atimofee
50f5ba8276
Hipify tool in it current state
...
[ROCm/clr commit: 5cd248c55f ]
2016-02-17 18:34:58 +03:00
pensun
60f00e4e8a
1. Bug fix
...
2. passed initial tests on different sets of
HIP_VISIBLE_DEVICES: (0),(1),(0,1),(1,2),(2,3),(1,2,3),(2,3,4),(1,5,2,3)
and achieved expected choice of GPU devices at the runtime.
3. Passed HIP test suite.
[ROCm/clr commit: c1da0f1e12 ]
2016-02-17 09:32:50 -06:00
pensun
afc377b2bc
Implementation of HIP_VISIBLE_DEVICES in runtime
...
[ROCm/clr commit: 43785243a5 ]
2016-02-17 06:59:18 -06:00
Ben Sander
6856d28ca3
more work on async copies
...
[ROCm/clr commit: 0cdbe1ff05 ]
2016-02-17 00:59:12 -06:00
pensun
31bba31b90
modify to add remove invalid devices numbers
...
[ROCm/clr commit: 7309e9ea6a ]
2016-02-16 10:00:05 -06:00
pensun
07e4d8261c
Implement to read HIP_VISIBLE_DEVICES to internal global variable
...
[ROCm/clr commit: 45d863851d ]
2016-02-16 07:39:04 -06:00