Граф коммитов

88 Коммитов

Автор SHA1 Сообщение Дата
Ben Sander beae41517d Describe how to update HTML docs
[ROCm/clr commit: 16ff0757a6]
2016-02-19 01:56:17 -06:00
Ben Sander 7760c82196 Merge branch 'memtracker' of https://github.com/AMDComputeLibraries/HIP-privatestaging into memtracker
[ROCm/clr commit: d0f9881d60]
2016-02-17 23:06:51 -06:00
Ben Sander f19c6d8342 Enable Tracker and ROCR by default, verify with HCC
[ROCm/clr commit: 400dcb8bcb]
2016-02-17 23:03:37 -06:00
Ben Sander 8ed32daefa Remove HIP-local AM tracker (now in HCC)
[ROCm/clr commit: b08e468c06]
2016-02-17 21:33:32 -06:00
Ben Sander 98e608a5ce USE_AM_TRACKER=0 works
[ROCm/clr commit: 354c9f945a]
2016-02-17 21:23:36 -06:00
Ben Sander f8d9017103 Tweak full formatting
[ROCm/clr commit: d75279c8eb]
2016-02-17 21:22:31 -06:00
Ben Sander 527d64a2da Support HSA_PATH env, async path tweak
[ROCm/clr commit: 9a82d316c3]
2016-02-17 21:22:07 -06:00
Ben Sander 6856d28ca3 more work on async copies
[ROCm/clr commit: 0cdbe1ff05]
2016-02-17 00:59:12 -06:00
Ben Sander 0e83efe14d Add per-stream pool for hsa_signals.
[ROCm/clr commit: 5d721a2649]
2016-02-16 01:59:13 -06:00
Ben Sander d0b29d9192 Add comments to tests
[ROCm/clr commit: 731a2a58d3]
2016-02-16 01:58:24 -06:00
Ben Sander f8f40e07bf Update before checkin to HCC.
Add support for USE_AM_TRACKER=2 (HCC version).
Add AM_ALLOC, AM_FREE indirection to ease swapping AM implementations.


[ROCm/clr commit: 1ed431c0f6]
2016-02-15 21:16:00 -06:00
Ben Sander 93c07bc3d1 Move warpSize to header, have shuffles use default warpsize.
[ROCm/clr commit: bd7e3b83b9]
2016-02-15 05:41:09 -06:00
Ben Sander dc6f0ef3a6 Remove old include path.
[ROCm/clr commit: 3b2d4acabc]
2016-02-15 05:40:37 -06:00
Ben Sander 84810268c0 Update docs, cleanup
[ROCm/clr commit: 322a3bd9b2]
2016-02-15 05:40:12 -06:00
Ben Sander 7e46e90591 Fix tests to account for multi-gpu
[ROCm/clr commit: afbe451b0d]
2016-02-15 05:19:52 -06:00
Ben Sander ed618e637e Add multi-threading synchonization on staging buffers and signals.
Also pre-allocate a couple signals for copies.


[ROCm/clr commit: 8939b4f0e5]
2016-02-13 03:18:01 -06:00
Ben Sander 275a36ee47 Enable -O3, style points on array size
[ROCm/clr commit: 56b3d2e7c4]
2016-02-13 03:17:42 -06:00
Ben Sander 61b2c054a2 D2H multi-buffer
[ROCm/clr commit: a002833a89]
2016-02-13 01:15:23 -06:00
Ben Sander e7666bd9d0 Result formatting
[ROCm/clr commit: 8e3dd664eb]
2016-02-13 01:14:01 -06:00
Ben Sander 6096173cd1 Add D2H test
[ROCm/clr commit: 1cd35f6371]
2016-02-12 22:47:26 -06:00
Ben Sander 06e7be346a Add D2H test
[ROCm/clr commit: 81c275da13]
2016-02-12 22:46:34 -06:00
Ben Sander ea298fae23 Add Bus Bandwidth test, leveraged from SHOC.
[ROCm/clr commit: ba7497b409]
2016-02-12 21:30:43 -06:00
Ben Sander b8ea1ec374 Improve copy testing
[ROCm/clr commit: 2353cbb028]
2016-02-12 18:24:08 -06:00
Ben Sander 4dfe77a99b Improve copy testing implementation.
- add tests for (unpinned/pinned) x H2H x D2D.
- Free memory at end of test.


[ROCm/clr commit: 1128610801]
2016-02-12 18:24:08 -06:00
Ben Sander 89e461988e Step1 in staging buffer copy.
- use StagingBuffer class for copies.
- refactor g_device to use array rather than vector.
   (keeps pointers from moving).


[ROCm/clr commit: 90af462b85]
2016-02-12 18:24:08 -06:00
Ben Sander 5978d5f372 Query tracked memory sizes.
Support more accurate hipMemGetInfo.  Add test to hipPointerAttrib.


[ROCm/clr commit: f464cedcf4]
2016-02-12 18:24:08 -06:00
Ben Sander 80d7c867d1 Remove ! USE_PINNED_HOST support
[ROCm/clr commit: f2c1bf3bc0]
2016-02-12 18:24:08 -06:00
Ben Sander 712750e1a5 Use memtracker 'appID' to store deviceID associated with ptr
[ROCm/clr commit: c04b5d3afb]
2016-02-12 18:24:08 -06:00
Ben Sander 2089e549eb Tracker improvements
- add API to add / remove user-pointers from the tracker.
- test for thread-safety with MultiThreadtest_2 - rapid
  insertions/removal.
- add mutex to provide thread-safety.
- rename tracker interface to "memtracker_..." for consistency.
- add am_memtracker_reset, connect to hipDeviceReset.
-


[ROCm/clr commit: 7216727fba]
2016-02-12 18:24:08 -06:00
Ben Sander fe67be1134 Create address tracker for am_alloc.
Tracks device where memory is allocated, pinned-host or device, and
more.

Uses memory-range-based lookups - so pointers that exist anywhere in

the range of hostPtr + size will find the associated AmPointerInfo.

The insertions and lookups use a self-balancing binary tree and
should support O(logN) lookup speed.


[ROCm/clr commit: 721508cc2f]
2016-02-12 18:24:08 -06:00
Ben Sander a50fa0f78e Fix bug in device bounds comparison.
Shows up in multi-GPU.


[ROCm/clr commit: f1bc9af294]
2016-02-12 18:24:08 -06:00
Maneesh Gupta 00d106a777 Documented supported fastmath functions
[ROCm/clr commit: c82511258c]
2016-02-12 14:21:58 +05:30
Maneesh Gupta 7c93bdebaf Updated integer intrinsics documentation
[ROCm/clr commit: 2659e70d48]
2016-02-12 13:58:35 +05:30
Evgeny Mankov 6add51ef8c Fix typo: maxThreadsPerMultiProcessor -> MaxSharedMemoryPerMultiprocessor
Device property MaxSharedMemoryPerMultiprocessor set equal to totalGlobalMem (HIP path).
Reason: MaxSharedMemoryPerMultiprocessor should be as the same as group memory size. Group memory will not be paged out, so, the physical memory size = total shared memory size = group region size. NVCC path remains untouched: CUDA's device property MaxSharedMemoryPerMultiprocessor is reported.

hipify is updated as well.


[ROCm/clr commit: 460b501cbb]
2016-02-12 01:29:20 +03:00
Evgeny Mankov 735d4738ad Device property maxThreadsPerMultiProcessor set equal to totalGlobalMem (HIP path).
Reason: maxThreadsPerMultiProcessor should be as the same as group memory size. Group memory will not be paged out, so, the physical memory size = total shared memory size = group region size.

NVCC path remains untouched: CUDA's device property maxThreadsPerMultiProcessor is reported.


[ROCm/clr commit: 1025341300]
2016-02-12 00:04:14 +03:00
Evgeny Mankov a8b7647f8b BDFID (BusID/DeviceID/FunctionID) support.
Except FunctionID (or DomainID in CUDA) support, because cudaDeviceProp::pciDomainID is not reported by CUDA.


[ROCm/clr commit: 658e9f0484]
2016-02-11 22:26:01 +03:00
sunway513 6bfdfc34a0 Merge branch 'privatestaging' of https://github.com/AMDComputeLibraries/HIP-privatestaging into privatestaging
[ROCm/clr commit: fe1000df17]
2016-02-11 22:22:47 +05:30
sunway513 38cc074f08 Add reminder to keep ROCR runtime on the system library path
[ROCm/clr commit: c7cbcfa2e9]
2016-02-11 22:22:00 +05:30
Maneesh Gupta f826c7aaae Updated readme for test
[ROCm/clr commit: ed2d86f3a9]
2016-02-11 13:06:58 +05:30
Evgeny Mankov cedd1c0947 Merge branch 'privatestaging' of https://github.com/AMDComputeLibraries/HIP-privatestaging into privatestaging
[ROCm/clr commit: 3139c72756]
2016-02-10 17:21:53 +03:00
Evgeny Mankov 2478fc078f Formatting, no functional changes
[ROCm/clr commit: d9a94191f2]
2016-02-10 17:21:18 +03:00
streamhsa 03c2768897 Remove test for atomicInc and atomicDec
[ROCm/clr commit: 90add185fd]
2016-02-10 21:02:52 +08:00
streamhsa 688a9a19a5 Updated readme for test
[ROCm/clr commit: 56f1832e70]
2016-02-10 20:05:59 +08:00
streamhsa 5d857b2bc3 Resolved test issues
[ROCm/clr commit: 2f8d56e903]
2016-02-10 20:01:16 +08:00
gargrahul 91a5b0aa77 Removed atomicInc and atomicDec support from HIP
[ROCm/clr commit: 51f46d9ddf]
2016-02-10 04:29:55 +05:30
Evgeny Mankov 9f596e0aab Device property concurrentKernels is added to hipDeviceProp_t struct.
For HCC path concurrentKernels is set to true since all ROCR hardware supports this feature.
For NVCC path concurrentKernels is obtained from CUDA's device property cudaDeviceProp::concurrentKernels.


[ROCm/clr commit: 4d4ca3ef3f]
2016-02-09 17:10:35 +03:00
Maneesh Gupta 4df8743f84 which_hip -> hipconfig
[ROCm/clr commit: f8bfc7f54c]
2016-02-09 11:51:26 +05:30
Maneesh Gupta 978aac7fe0 Merge branch 'privatestaging' of https://github.com/AMDComputeLibraries/HIP-privatestaging into privatestaging
[ROCm/clr commit: 0df78ac9bf]
2016-02-09 10:57:46 +05:30
Maneesh Gupta 8442259da0 Move HIP_DEVICE_COMPILE defines to hip_common.h
[ROCm/clr commit: f6e7abd710]
2016-02-09 10:57:20 +05:30
streamhsa 71d3c9f306 Rename test hipInfo as hipGetDeviceAttribute
[ROCm/clr commit: 310023e273]
2016-02-09 13:19:32 +08:00