Ben Sander
b314777bc1
D2H multi-buffer
2016-02-13 01:15:23 -06:00
Ben Sander
c3720c19a8
Result formatting
2016-02-13 01:14:01 -06:00
Ben Sander
bcb5953d6e
Add D2H test
2016-02-12 22:47:26 -06:00
Ben Sander
559db057d5
Add D2H test
2016-02-12 22:46:34 -06:00
Ben Sander
f3fd6476eb
Add Bus Bandwidth test, leveraged from SHOC.
2016-02-12 21:30:43 -06:00
Ben Sander
1bfd3cdbd0
Improve copy testing
2016-02-12 18:24:08 -06:00
Ben Sander
134d7975ce
Improve copy testing implementation.
...
- add tests for (unpinned/pinned) x H2H x D2D.
- Free memory at end of test.
2016-02-12 18:24:08 -06:00
Ben Sander
24c1fdb864
Step1 in staging buffer copy.
...
- use StagingBuffer class for copies.
- refactor g_device to use array rather than vector.
(keeps pointers from moving).
2016-02-12 18:24:08 -06:00
Ben Sander
d7396b5af3
Query tracked memory sizes.
...
Support more accurate hipMemGetInfo. Add test to hipPointerAttrib.
2016-02-12 18:24:08 -06:00
Ben Sander
0370cd1cfc
Remove ! USE_PINNED_HOST support
2016-02-12 18:24:08 -06:00
Ben Sander
00fd172c64
Use memtracker 'appID' to store deviceID associated with ptr
2016-02-12 18:24:08 -06:00
Ben Sander
de45e2291e
Tracker improvements
...
- add API to add / remove user-pointers from the tracker.
- test for thread-safety with MultiThreadtest_2 - rapid
insertions/removal.
- add mutex to provide thread-safety.
- rename tracker interface to "memtracker_..." for consistency.
- add am_memtracker_reset, connect to hipDeviceReset.
-
2016-02-12 18:24:08 -06:00
Ben Sander
4ee2a5229b
Create address tracker for am_alloc.
...
Tracks device where memory is allocated, pinned-host or device, and
more.
Uses memory-range-based lookups - so pointers that exist anywhere in
the range of hostPtr + size will find the associated AmPointerInfo.
The insertions and lookups use a self-balancing binary tree and
should support O(logN) lookup speed.
2016-02-12 18:24:08 -06:00
Ben Sander
e483eea85b
Fix bug in device bounds comparison.
...
Shows up in multi-GPU.
2016-02-12 18:24:08 -06:00
Maneesh Gupta
447201a6a0
Documented supported fastmath functions
2016-02-12 14:21:58 +05:30
Maneesh Gupta
2c01480131
Updated integer intrinsics documentation
2016-02-12 13:58:35 +05:30
Evgeny Mankov
ea8f99702d
Fix typo: maxThreadsPerMultiProcessor -> MaxSharedMemoryPerMultiprocessor
...
Device property MaxSharedMemoryPerMultiprocessor set equal to totalGlobalMem (HIP path).
Reason: MaxSharedMemoryPerMultiprocessor should be as the same as group memory size. Group memory will not be paged out, so, the physical memory size = total shared memory size = group region size. NVCC path remains untouched: CUDA's device property MaxSharedMemoryPerMultiprocessor is reported.
hipify is updated as well.
2016-02-12 01:29:20 +03:00
Evgeny Mankov
9f05a52c74
Device property maxThreadsPerMultiProcessor set equal to totalGlobalMem (HIP path).
...
Reason: maxThreadsPerMultiProcessor should be as the same as group memory size. Group memory will not be paged out, so, the physical memory size = total shared memory size = group region size.
NVCC path remains untouched: CUDA's device property maxThreadsPerMultiProcessor is reported.
2016-02-12 00:04:14 +03:00
Evgeny Mankov
33f60c300d
BDFID (BusID/DeviceID/FunctionID) support.
...
Except FunctionID (or DomainID in CUDA) support, because cudaDeviceProp::pciDomainID is not reported by CUDA.
2016-02-11 22:26:01 +03:00
sunway513
1a8bd21eb6
Merge branch 'privatestaging' of https://github.com/AMDComputeLibraries/HIP-privatestaging into privatestaging
2016-02-11 22:22:47 +05:30
sunway513
d15f22113f
Add reminder to keep ROCR runtime on the system library path
2016-02-11 22:22:00 +05:30
dfukalov
dca4a70bb7
adding ability to build in llvm source tree, updated README
2016-02-11 15:27:00 +03:00
Maneesh Gupta
a87c7988e8
Updated readme for test
2016-02-11 13:06:58 +05:30
dfukalov
5f224280ce
Initial version of CLANG based HIPIFY tool for CUDA -> HIP sources conversion
2016-02-10 20:20:11 +03:00
Ben Sander
09fe3cc7d4
Initial commit
2016-02-10 09:29:29 -06:00
Evgeny Mankov
12d9d9be82
Merge branch 'privatestaging' of https://github.com/AMDComputeLibraries/HIP-privatestaging into privatestaging
2016-02-10 17:21:53 +03:00
Evgeny Mankov
254da4ec53
Formatting, no functional changes
2016-02-10 17:21:18 +03:00
streamhsa
51bff8757c
Remove test for atomicInc and atomicDec
2016-02-10 21:02:52 +08:00
streamhsa
6f2a94fb93
Updated readme for test
2016-02-10 20:05:59 +08:00
streamhsa
4035b71df4
Resolved test issues
2016-02-10 20:01:16 +08:00
gargrahul
8c40a4ace4
Removed atomicInc and atomicDec support from HIP
2016-02-10 04:29:55 +05:30
Peng Sun
1fb48b0714
Merge branch 'privatestaging' of https://github.com/AMDComputeLibraries/HIP-privatestaging into doc_update
2016-02-09 15:08:39 -06:00
Peng Sun
28025f6a74
Fix TODO-Doc in hip_texture.h
2016-02-09 10:58:23 -06:00
Evgeny Mankov
950c3baacd
Device property concurrentKernels is added to hipDeviceProp_t struct.
...
For HCC path concurrentKernels is set to true since all ROCR hardware supports this feature.
For NVCC path concurrentKernels is obtained from CUDA's device property cudaDeviceProp::concurrentKernels.
2016-02-09 17:10:35 +03:00
Maneesh Gupta
7cc7f88717
which_hip -> hipconfig
2016-02-09 11:51:26 +05:30
Maneesh Gupta
08036ccb9b
Merge branch 'privatestaging' of https://github.com/AMDComputeLibraries/HIP-privatestaging into privatestaging
2016-02-09 10:57:46 +05:30
Maneesh Gupta
3291e0ec96
Move HIP_DEVICE_COMPILE defines to hip_common.h
2016-02-09 10:57:20 +05:30
streamhsa
5aa254f32f
Rename test hipInfo as hipGetDeviceAttribute
2016-02-09 13:19:32 +08:00
Ben Sander
0c74d3fc44
Test fixes:
...
- Remove reference to missing test.
- Add hipMemset back.
- Parse --gpu option to specify default starting GPU.
2016-02-08 22:55:23 -06:00
Ben Sander
9e2c3c8df3
minor doc touchup
2016-02-08 22:11:11 -06:00
Ben Sander
b8f7126662
in HIPCHECK, only run command once even if error occurs
2016-02-08 21:45:49 -06:00
Ben Sander
fbc27c222b
Doc update
2016-02-08 21:44:55 -06:00
Ben Sander
2a33197a06
Add hcc-config info to --full
2016-02-08 21:44:55 -06:00
Ben Sander
7cb5412ff4
iScript cleanup, add --full
2016-02-08 21:44:55 -06:00
Peng Sun
fb3b11774b
fix merging conflicts
2016-02-08 15:35:49 -06:00
Ben Sander
272ae52506
Fix HIP_PLATFORM detection
2016-02-05 07:15:46 -06:00
Ben Sander
76ebe6dcfd
Fix getdeviceattr compilation for NVCC
2016-02-04 16:26:33 -06:00
Sam Kolton
0a27507208
Implementation of hipDeviceGetAttribute()
2016-02-04 17:39:27 +03:00
Peng Sun
3d5608ea84
Additional typo and extra space fix
2016-02-03 09:42:16 -06:00
Ben Sander
1327d3e03d
Merge branch 'master' into privatestaging
2016-02-03 09:39:19 -06:00