rocm-systems

Автор	SHA1	Сообщение	Дата
Ben Sander	24c1fdb864	Step1 in staging buffer copy. - use StagingBuffer class for copies. - refactor g_device to use array rather than vector. (keeps pointers from moving).	2016-02-12 18:24:08 -06:00
Ben Sander	d7396b5af3	Query tracked memory sizes. Support more accurate hipMemGetInfo. Add test to hipPointerAttrib.	2016-02-12 18:24:08 -06:00
Ben Sander	0370cd1cfc	Remove ! USE_PINNED_HOST support	2016-02-12 18:24:08 -06:00
Ben Sander	00fd172c64	Use memtracker 'appID' to store deviceID associated with ptr	2016-02-12 18:24:08 -06:00
Ben Sander	de45e2291e	Tracker improvements - add API to add / remove user-pointers from the tracker. - test for thread-safety with MultiThreadtest_2 - rapid insertions/removal. - add mutex to provide thread-safety. - rename tracker interface to "memtracker_..." for consistency. - add am_memtracker_reset, connect to hipDeviceReset. -	2016-02-12 18:24:08 -06:00
Ben Sander	4ee2a5229b	Create address tracker for am_alloc. Tracks device where memory is allocated, pinned-host or device, and more. Uses memory-range-based lookups - so pointers that exist anywhere in the range of hostPtr + size will find the associated AmPointerInfo. The insertions and lookups use a self-balancing binary tree and should support O(logN) lookup speed.	2016-02-12 18:24:08 -06:00
Ben Sander	e483eea85b	Fix bug in device bounds comparison. Shows up in multi-GPU.	2016-02-12 18:24:08 -06:00
Evgeny Mankov	ea8f99702d	Fix typo: maxThreadsPerMultiProcessor -> MaxSharedMemoryPerMultiprocessor Device property MaxSharedMemoryPerMultiprocessor set equal to totalGlobalMem (HIP path). Reason: MaxSharedMemoryPerMultiprocessor should be as the same as group memory size. Group memory will not be paged out, so, the physical memory size = total shared memory size = group region size. NVCC path remains untouched: CUDA's device property MaxSharedMemoryPerMultiprocessor is reported. hipify is updated as well.	2016-02-12 01:29:20 +03:00
Evgeny Mankov	9f05a52c74	Device property maxThreadsPerMultiProcessor set equal to totalGlobalMem (HIP path). Reason: maxThreadsPerMultiProcessor should be as the same as group memory size. Group memory will not be paged out, so, the physical memory size = total shared memory size = group region size. NVCC path remains untouched: CUDA's device property maxThreadsPerMultiProcessor is reported.	2016-02-12 00:04:14 +03:00
Evgeny Mankov	33f60c300d	BDFID (BusID/DeviceID/FunctionID) support. Except FunctionID (or DomainID in CUDA) support, because cudaDeviceProp::pciDomainID is not reported by CUDA.	2016-02-11 22:26:01 +03:00
Evgeny Mankov	950c3baacd	Device property concurrentKernels is added to hipDeviceProp_t struct. For HCC path concurrentKernels is set to true since all ROCR hardware supports this feature. For NVCC path concurrentKernels is obtained from CUDA's device property cudaDeviceProp::concurrentKernels.	2016-02-09 17:10:35 +03:00
Sam Kolton	0a27507208	Implementation of hipDeviceGetAttribute()	2016-02-04 17:39:27 +03:00
Ben Sander	f38e63ff18	Initial commit for GPUOpen Launch	2016-01-26 20:14:33 -06:00

13 Коммитов