Граф коммитов

6428 Коммитов

Автор SHA1 Сообщение Дата
Ben Sander b314777bc1 D2H multi-buffer 2016-02-13 01:15:23 -06:00
Ben Sander c3720c19a8 Result formatting 2016-02-13 01:14:01 -06:00
Ben Sander bcb5953d6e Add D2H test 2016-02-12 22:47:26 -06:00
Ben Sander 559db057d5 Add D2H test 2016-02-12 22:46:34 -06:00
Ben Sander f3fd6476eb Add Bus Bandwidth test, leveraged from SHOC. 2016-02-12 21:30:43 -06:00
Ben Sander 1bfd3cdbd0 Improve copy testing 2016-02-12 18:24:08 -06:00
Ben Sander 134d7975ce Improve copy testing implementation.
- add tests for (unpinned/pinned) x H2H x D2D.
- Free memory at end of test.
2016-02-12 18:24:08 -06:00
Ben Sander 24c1fdb864 Step1 in staging buffer copy.
- use StagingBuffer class for copies.
- refactor g_device to use array rather than vector.
   (keeps pointers from moving).
2016-02-12 18:24:08 -06:00
Ben Sander d7396b5af3 Query tracked memory sizes.
Support more accurate hipMemGetInfo.  Add test to hipPointerAttrib.
2016-02-12 18:24:08 -06:00
Ben Sander 0370cd1cfc Remove ! USE_PINNED_HOST support 2016-02-12 18:24:08 -06:00
Ben Sander 00fd172c64 Use memtracker 'appID' to store deviceID associated with ptr 2016-02-12 18:24:08 -06:00
Ben Sander de45e2291e Tracker improvements
- add API to add / remove user-pointers from the tracker.
- test for thread-safety with MultiThreadtest_2 - rapid
  insertions/removal.
- add mutex to provide thread-safety.
- rename tracker interface to "memtracker_..." for consistency.
- add am_memtracker_reset, connect to hipDeviceReset.
-
2016-02-12 18:24:08 -06:00
Ben Sander 4ee2a5229b Create address tracker for am_alloc.
Tracks device where memory is allocated, pinned-host or device, and
more.

Uses memory-range-based lookups - so pointers that exist anywhere in

the range of hostPtr + size will find the associated AmPointerInfo.

The insertions and lookups use a self-balancing binary tree and
should support O(logN) lookup speed.
2016-02-12 18:24:08 -06:00
Ben Sander e483eea85b Fix bug in device bounds comparison.
Shows up in multi-GPU.
2016-02-12 18:24:08 -06:00
Maneesh Gupta 447201a6a0 Documented supported fastmath functions 2016-02-12 14:21:58 +05:30
Maneesh Gupta 2c01480131 Updated integer intrinsics documentation 2016-02-12 13:58:35 +05:30
Evgeny Mankov ea8f99702d Fix typo: maxThreadsPerMultiProcessor -> MaxSharedMemoryPerMultiprocessor
Device property MaxSharedMemoryPerMultiprocessor set equal to totalGlobalMem (HIP path).
Reason: MaxSharedMemoryPerMultiprocessor should be as the same as group memory size. Group memory will not be paged out, so, the physical memory size = total shared memory size = group region size. NVCC path remains untouched: CUDA's device property MaxSharedMemoryPerMultiprocessor is reported.

hipify is updated as well.
2016-02-12 01:29:20 +03:00
Evgeny Mankov 9f05a52c74 Device property maxThreadsPerMultiProcessor set equal to totalGlobalMem (HIP path).
Reason: maxThreadsPerMultiProcessor should be as the same as group memory size. Group memory will not be paged out, so, the physical memory size = total shared memory size = group region size.

NVCC path remains untouched: CUDA's device property maxThreadsPerMultiProcessor is reported.
2016-02-12 00:04:14 +03:00
Evgeny Mankov 33f60c300d BDFID (BusID/DeviceID/FunctionID) support.
Except FunctionID (or DomainID in CUDA) support, because cudaDeviceProp::pciDomainID is not reported by CUDA.
2016-02-11 22:26:01 +03:00
sunway513 1a8bd21eb6 Merge branch 'privatestaging' of https://github.com/AMDComputeLibraries/HIP-privatestaging into privatestaging 2016-02-11 22:22:47 +05:30
sunway513 d15f22113f Add reminder to keep ROCR runtime on the system library path 2016-02-11 22:22:00 +05:30
dfukalov dca4a70bb7 adding ability to build in llvm source tree, updated README 2016-02-11 15:27:00 +03:00
Maneesh Gupta a87c7988e8 Updated readme for test 2016-02-11 13:06:58 +05:30
dfukalov 5f224280ce Initial version of CLANG based HIPIFY tool for CUDA -> HIP sources conversion 2016-02-10 20:20:11 +03:00
Ben Sander 09fe3cc7d4 Initial commit 2016-02-10 09:29:29 -06:00
Evgeny Mankov 12d9d9be82 Merge branch 'privatestaging' of https://github.com/AMDComputeLibraries/HIP-privatestaging into privatestaging 2016-02-10 17:21:53 +03:00
Evgeny Mankov 254da4ec53 Formatting, no functional changes 2016-02-10 17:21:18 +03:00
streamhsa 51bff8757c Remove test for atomicInc and atomicDec 2016-02-10 21:02:52 +08:00
streamhsa 6f2a94fb93 Updated readme for test 2016-02-10 20:05:59 +08:00
streamhsa 4035b71df4 Resolved test issues 2016-02-10 20:01:16 +08:00
gargrahul 8c40a4ace4 Removed atomicInc and atomicDec support from HIP 2016-02-10 04:29:55 +05:30
Peng Sun 1fb48b0714 Merge branch 'privatestaging' of https://github.com/AMDComputeLibraries/HIP-privatestaging into doc_update 2016-02-09 15:08:39 -06:00
Peng Sun 28025f6a74 Fix TODO-Doc in hip_texture.h 2016-02-09 10:58:23 -06:00
Evgeny Mankov 950c3baacd Device property concurrentKernels is added to hipDeviceProp_t struct.
For HCC path concurrentKernels is set to true since all ROCR hardware supports this feature.
For NVCC path concurrentKernels is obtained from CUDA's device property cudaDeviceProp::concurrentKernels.
2016-02-09 17:10:35 +03:00
Maneesh Gupta 7cc7f88717 which_hip -> hipconfig 2016-02-09 11:51:26 +05:30
Maneesh Gupta 08036ccb9b Merge branch 'privatestaging' of https://github.com/AMDComputeLibraries/HIP-privatestaging into privatestaging 2016-02-09 10:57:46 +05:30
Maneesh Gupta 3291e0ec96 Move HIP_DEVICE_COMPILE defines to hip_common.h 2016-02-09 10:57:20 +05:30
streamhsa 5aa254f32f Rename test hipInfo as hipGetDeviceAttribute 2016-02-09 13:19:32 +08:00
Ben Sander 0c74d3fc44 Test fixes:
- Remove reference to missing test.
- Add hipMemset back.
- Parse --gpu option to specify default starting GPU.
2016-02-08 22:55:23 -06:00
Ben Sander 9e2c3c8df3 minor doc touchup 2016-02-08 22:11:11 -06:00
Ben Sander b8f7126662 in HIPCHECK, only run command once even if error occurs 2016-02-08 21:45:49 -06:00
Ben Sander fbc27c222b Doc update 2016-02-08 21:44:55 -06:00
Ben Sander 2a33197a06 Add hcc-config info to --full 2016-02-08 21:44:55 -06:00
Ben Sander 7cb5412ff4 iScript cleanup, add --full 2016-02-08 21:44:55 -06:00
Peng Sun fb3b11774b fix merging conflicts 2016-02-08 15:35:49 -06:00
Ben Sander 272ae52506 Fix HIP_PLATFORM detection 2016-02-05 07:15:46 -06:00
Ben Sander 76ebe6dcfd Fix getdeviceattr compilation for NVCC 2016-02-04 16:26:33 -06:00
Sam Kolton 0a27507208 Implementation of hipDeviceGetAttribute() 2016-02-04 17:39:27 +03:00
Peng Sun 3d5608ea84 Additional typo and extra space fix 2016-02-03 09:42:16 -06:00
Ben Sander 1327d3e03d Merge branch 'master' into privatestaging 2016-02-03 09:39:19 -06:00