Граф коммитов

65 Коммитов

Автор SHA1 Сообщение Дата
Ben Sander 83eb66eb8e Improve copy testing implementation.
- add tests for (unpinned/pinned) x H2H x D2D.
- Free memory at end of test.


[ROCm/hip commit: 134d7975ce]
2016-02-12 18:24:08 -06:00
Ben Sander c441d5ec29 Step1 in staging buffer copy.
- use StagingBuffer class for copies.
- refactor g_device to use array rather than vector.
   (keeps pointers from moving).


[ROCm/hip commit: 24c1fdb864]
2016-02-12 18:24:08 -06:00
Ben Sander b9dc0e9497 Query tracked memory sizes.
Support more accurate hipMemGetInfo.  Add test to hipPointerAttrib.


[ROCm/hip commit: d7396b5af3]
2016-02-12 18:24:08 -06:00
Ben Sander c1fd0e1608 Remove ! USE_PINNED_HOST support
[ROCm/hip commit: 0370cd1cfc]
2016-02-12 18:24:08 -06:00
Ben Sander 305076d78f Use memtracker 'appID' to store deviceID associated with ptr
[ROCm/hip commit: 00fd172c64]
2016-02-12 18:24:08 -06:00
Ben Sander 680b600b4a Tracker improvements
- add API to add / remove user-pointers from the tracker.
- test for thread-safety with MultiThreadtest_2 - rapid
  insertions/removal.
- add mutex to provide thread-safety.
- rename tracker interface to "memtracker_..." for consistency.
- add am_memtracker_reset, connect to hipDeviceReset.
-


[ROCm/hip commit: de45e2291e]
2016-02-12 18:24:08 -06:00
Ben Sander d4a90f8afd Create address tracker for am_alloc.
Tracks device where memory is allocated, pinned-host or device, and
more.

Uses memory-range-based lookups - so pointers that exist anywhere in

the range of hostPtr + size will find the associated AmPointerInfo.

The insertions and lookups use a self-balancing binary tree and
should support O(logN) lookup speed.


[ROCm/hip commit: 4ee2a5229b]
2016-02-12 18:24:08 -06:00
Ben Sander 0a6e6e3b7e Fix bug in device bounds comparison.
Shows up in multi-GPU.


[ROCm/hip commit: e483eea85b]
2016-02-12 18:24:08 -06:00
Maneesh Gupta 5657d7bae7 Documented supported fastmath functions
[ROCm/hip commit: 447201a6a0]
2016-02-12 14:21:58 +05:30
Maneesh Gupta 99052abbdb Updated integer intrinsics documentation
[ROCm/hip commit: 2c01480131]
2016-02-12 13:58:35 +05:30
Evgeny Mankov fcd154097f Fix typo: maxThreadsPerMultiProcessor -> MaxSharedMemoryPerMultiprocessor
Device property MaxSharedMemoryPerMultiprocessor set equal to totalGlobalMem (HIP path).
Reason: MaxSharedMemoryPerMultiprocessor should be as the same as group memory size. Group memory will not be paged out, so, the physical memory size = total shared memory size = group region size. NVCC path remains untouched: CUDA's device property MaxSharedMemoryPerMultiprocessor is reported.

hipify is updated as well.


[ROCm/hip commit: ea8f99702d]
2016-02-12 01:29:20 +03:00
Evgeny Mankov c587f89de7 Device property maxThreadsPerMultiProcessor set equal to totalGlobalMem (HIP path).
Reason: maxThreadsPerMultiProcessor should be as the same as group memory size. Group memory will not be paged out, so, the physical memory size = total shared memory size = group region size.

NVCC path remains untouched: CUDA's device property maxThreadsPerMultiProcessor is reported.


[ROCm/hip commit: 9f05a52c74]
2016-02-12 00:04:14 +03:00
Evgeny Mankov 4eade0ce83 BDFID (BusID/DeviceID/FunctionID) support.
Except FunctionID (or DomainID in CUDA) support, because cudaDeviceProp::pciDomainID is not reported by CUDA.


[ROCm/hip commit: 33f60c300d]
2016-02-11 22:26:01 +03:00
sunway513 18119645ab Merge branch 'privatestaging' of https://github.com/AMDComputeLibraries/HIP-privatestaging into privatestaging
[ROCm/hip commit: 1a8bd21eb6]
2016-02-11 22:22:47 +05:30
sunway513 286f4e783c Add reminder to keep ROCR runtime on the system library path
[ROCm/hip commit: d15f22113f]
2016-02-11 22:22:00 +05:30
Maneesh Gupta f8290141b1 Updated readme for test
[ROCm/hip commit: a87c7988e8]
2016-02-11 13:06:58 +05:30
Evgeny Mankov 1a9bd34ca2 Merge branch 'privatestaging' of https://github.com/AMDComputeLibraries/HIP-privatestaging into privatestaging
[ROCm/hip commit: 12d9d9be82]
2016-02-10 17:21:53 +03:00
Evgeny Mankov 3a032ff317 Formatting, no functional changes
[ROCm/hip commit: 254da4ec53]
2016-02-10 17:21:18 +03:00
streamhsa 4c861f0284 Remove test for atomicInc and atomicDec
[ROCm/hip commit: 51bff8757c]
2016-02-10 21:02:52 +08:00
streamhsa 1c89a8853a Updated readme for test
[ROCm/hip commit: 6f2a94fb93]
2016-02-10 20:05:59 +08:00
streamhsa 837875df37 Resolved test issues
[ROCm/hip commit: 4035b71df4]
2016-02-10 20:01:16 +08:00
gargrahul 1ab2294657 Removed atomicInc and atomicDec support from HIP
[ROCm/hip commit: 8c40a4ace4]
2016-02-10 04:29:55 +05:30
Evgeny Mankov c38a69ef33 Device property concurrentKernels is added to hipDeviceProp_t struct.
For HCC path concurrentKernels is set to true since all ROCR hardware supports this feature.
For NVCC path concurrentKernels is obtained from CUDA's device property cudaDeviceProp::concurrentKernels.


[ROCm/hip commit: 950c3baacd]
2016-02-09 17:10:35 +03:00
Maneesh Gupta 1803451df7 which_hip -> hipconfig
[ROCm/hip commit: 7cc7f88717]
2016-02-09 11:51:26 +05:30
Maneesh Gupta 1688644e07 Merge branch 'privatestaging' of https://github.com/AMDComputeLibraries/HIP-privatestaging into privatestaging
[ROCm/hip commit: 08036ccb9b]
2016-02-09 10:57:46 +05:30
Maneesh Gupta 77f61d1a46 Move HIP_DEVICE_COMPILE defines to hip_common.h
[ROCm/hip commit: 3291e0ec96]
2016-02-09 10:57:20 +05:30
streamhsa 902c5aa972 Rename test hipInfo as hipGetDeviceAttribute
[ROCm/hip commit: 5aa254f32f]
2016-02-09 13:19:32 +08:00
Ben Sander 473a40af80 Test fixes:
- Remove reference to missing test.
- Add hipMemset back.
- Parse --gpu option to specify default starting GPU.


[ROCm/hip commit: 0c74d3fc44]
2016-02-08 22:55:23 -06:00
Ben Sander a2dac9e12c minor doc touchup
[ROCm/hip commit: 9e2c3c8df3]
2016-02-08 22:11:11 -06:00
Ben Sander f9f308c48d in HIPCHECK, only run command once even if error occurs
[ROCm/hip commit: b8f7126662]
2016-02-08 21:45:49 -06:00
Ben Sander 2170b86afc Doc update
[ROCm/hip commit: fbc27c222b]
2016-02-08 21:44:55 -06:00
Ben Sander ebf94bcf3a Add hcc-config info to --full
[ROCm/hip commit: 2a33197a06]
2016-02-08 21:44:55 -06:00
Ben Sander ff36d32095 iScript cleanup, add --full
[ROCm/hip commit: 7cb5412ff4]
2016-02-08 21:44:55 -06:00
Ben Sander c54d10e92a Fix HIP_PLATFORM detection
[ROCm/hip commit: 272ae52506]
2016-02-05 07:15:46 -06:00
Ben Sander 9cb14a455c Fix getdeviceattr compilation for NVCC
[ROCm/hip commit: 76ebe6dcfd]
2016-02-04 16:26:33 -06:00
Sam Kolton 2306293526 Implementation of hipDeviceGetAttribute()
[ROCm/hip commit: 0a27507208]
2016-02-04 17:39:27 +03:00
Ben Sander 2bf594e928 Merge branch 'master' into privatestaging
[ROCm/hip commit: 1327d3e03d]
2016-02-03 09:39:19 -06:00
Peng Sun 503ec9ad24 Fix all TODO-doc
[ROCm/hip commit: c73996d041]
2016-02-02 21:29:09 -06:00
Peng Sun 03630ee0a4 Finish all TODO for error code
[ROCm/hip commit: 8b74333204]
2016-02-02 17:39:46 -06:00
scchan 39fb16bc5f add inline attribute to shfl functions
[ROCm/hip commit: 265c42500f]
2016-02-02 12:53:17 -06:00
Ben Sander 5593dd846a Merge branch 'master' of https://github.com/GPUOpen-ProfessionalCompute-Tools/HIP
[ROCm/hip commit: 6175d665fc]
2016-02-02 10:05:44 -06:00
Ben Sander 57d88913f8 Remove warning on ballot/any/all and pop/clz.
Since these are supported in HIP no reason to emit warnings.


[ROCm/hip commit: 3898f6af3c]
2016-02-02 10:02:48 -06:00
streamhsa af8cc35552 Adjusted the value of __any as per CUDA -sandeep
[ROCm/hip commit: 974d491902]
2016-02-02 15:25:42 +05:30
streamhsa e4635c36a0 ADDED Support for __ffs() and __ffsll() having signed input -sandeep
[ROCm/hip commit: 23904df99b]
2016-02-02 15:05:46 +05:30
streamhsa 19d0b8095f Added test for ballot and removing HIP_FUNCTION from hipSampleAtomicsTest.cpp -sandeep
[ROCm/hip commit: 2b02fe082f]
2016-02-02 14:50:55 +05:30
Jack Chung 114874e5ee Merge branch 'privatestaging' of github.com:AMDComputeLibraries/HIP-privatestaging into privatestaging
[ROCm/hip commit: 8551855f89]
2016-02-02 16:28:02 +08:00
Jack Chung c94a8de983 Suppress linker warnings in case HCC distribution contains OpenCL/SPIR symbols
[ROCm/hip commit: 1e7642c569]
2016-02-02 16:27:42 +08:00
scchan ca142c6d9c adding shfl, shfl_up, shfl_down, shfl_xor intrinsics
[ROCm/hip commit: 04f3e3e598]
2016-02-01 23:55:31 -06:00
Ben Sander 7a7e230b81 Merge pull request #10 from SethosII/patch-1
Update hip_faq.md based on Sethosll review.  Closes #10 

[ROCm/hip commit: b2b43d55cb]
2016-02-01 22:01:17 -06:00
Maneesh Gupta 9ed3ef50fe Add double and integer intrinsics to test
[ROCm/hip commit: 861cba6f75]
2016-02-01 16:00:45 +05:30