Ben Sander
beae41517d
Describe how to update HTML docs
...
[ROCm/clr commit: 16ff0757a6 ]
2016-02-19 01:56:17 -06:00
Ben Sander
7760c82196
Merge branch 'memtracker' of https://github.com/AMDComputeLibraries/HIP-privatestaging into memtracker
...
[ROCm/clr commit: d0f9881d60 ]
2016-02-17 23:06:51 -06:00
Ben Sander
f19c6d8342
Enable Tracker and ROCR by default, verify with HCC
...
[ROCm/clr commit: 400dcb8bcb ]
2016-02-17 23:03:37 -06:00
Ben Sander
8ed32daefa
Remove HIP-local AM tracker (now in HCC)
...
[ROCm/clr commit: b08e468c06 ]
2016-02-17 21:33:32 -06:00
Ben Sander
98e608a5ce
USE_AM_TRACKER=0 works
...
[ROCm/clr commit: 354c9f945a ]
2016-02-17 21:23:36 -06:00
Ben Sander
f8d9017103
Tweak full formatting
...
[ROCm/clr commit: d75279c8eb ]
2016-02-17 21:22:31 -06:00
Ben Sander
527d64a2da
Support HSA_PATH env, async path tweak
...
[ROCm/clr commit: 9a82d316c3 ]
2016-02-17 21:22:07 -06:00
Ben Sander
6856d28ca3
more work on async copies
...
[ROCm/clr commit: 0cdbe1ff05 ]
2016-02-17 00:59:12 -06:00
Ben Sander
0e83efe14d
Add per-stream pool for hsa_signals.
...
[ROCm/clr commit: 5d721a2649 ]
2016-02-16 01:59:13 -06:00
Ben Sander
d0b29d9192
Add comments to tests
...
[ROCm/clr commit: 731a2a58d3 ]
2016-02-16 01:58:24 -06:00
Ben Sander
f8f40e07bf
Update before checkin to HCC.
...
Add support for USE_AM_TRACKER=2 (HCC version).
Add AM_ALLOC, AM_FREE indirection to ease swapping AM implementations.
[ROCm/clr commit: 1ed431c0f6 ]
2016-02-15 21:16:00 -06:00
Ben Sander
93c07bc3d1
Move warpSize to header, have shuffles use default warpsize.
...
[ROCm/clr commit: bd7e3b83b9 ]
2016-02-15 05:41:09 -06:00
Ben Sander
dc6f0ef3a6
Remove old include path.
...
[ROCm/clr commit: 3b2d4acabc ]
2016-02-15 05:40:37 -06:00
Ben Sander
84810268c0
Update docs, cleanup
...
[ROCm/clr commit: 322a3bd9b2 ]
2016-02-15 05:40:12 -06:00
Ben Sander
7e46e90591
Fix tests to account for multi-gpu
...
[ROCm/clr commit: afbe451b0d ]
2016-02-15 05:19:52 -06:00
Ben Sander
ed618e637e
Add multi-threading synchonization on staging buffers and signals.
...
Also pre-allocate a couple signals for copies.
[ROCm/clr commit: 8939b4f0e5 ]
2016-02-13 03:18:01 -06:00
Ben Sander
275a36ee47
Enable -O3, style points on array size
...
[ROCm/clr commit: 56b3d2e7c4 ]
2016-02-13 03:17:42 -06:00
Ben Sander
61b2c054a2
D2H multi-buffer
...
[ROCm/clr commit: a002833a89 ]
2016-02-13 01:15:23 -06:00
Ben Sander
e7666bd9d0
Result formatting
...
[ROCm/clr commit: 8e3dd664eb ]
2016-02-13 01:14:01 -06:00
Ben Sander
6096173cd1
Add D2H test
...
[ROCm/clr commit: 1cd35f6371 ]
2016-02-12 22:47:26 -06:00
Ben Sander
06e7be346a
Add D2H test
...
[ROCm/clr commit: 81c275da13 ]
2016-02-12 22:46:34 -06:00
Ben Sander
ea298fae23
Add Bus Bandwidth test, leveraged from SHOC.
...
[ROCm/clr commit: ba7497b409 ]
2016-02-12 21:30:43 -06:00
Ben Sander
b8ea1ec374
Improve copy testing
...
[ROCm/clr commit: 2353cbb028 ]
2016-02-12 18:24:08 -06:00
Ben Sander
4dfe77a99b
Improve copy testing implementation.
...
- add tests for (unpinned/pinned) x H2H x D2D.
- Free memory at end of test.
[ROCm/clr commit: 1128610801 ]
2016-02-12 18:24:08 -06:00
Ben Sander
89e461988e
Step1 in staging buffer copy.
...
- use StagingBuffer class for copies.
- refactor g_device to use array rather than vector.
(keeps pointers from moving).
[ROCm/clr commit: 90af462b85 ]
2016-02-12 18:24:08 -06:00
Ben Sander
5978d5f372
Query tracked memory sizes.
...
Support more accurate hipMemGetInfo. Add test to hipPointerAttrib.
[ROCm/clr commit: f464cedcf4 ]
2016-02-12 18:24:08 -06:00
Ben Sander
80d7c867d1
Remove ! USE_PINNED_HOST support
...
[ROCm/clr commit: f2c1bf3bc0 ]
2016-02-12 18:24:08 -06:00
Ben Sander
712750e1a5
Use memtracker 'appID' to store deviceID associated with ptr
...
[ROCm/clr commit: c04b5d3afb ]
2016-02-12 18:24:08 -06:00
Ben Sander
2089e549eb
Tracker improvements
...
- add API to add / remove user-pointers from the tracker.
- test for thread-safety with MultiThreadtest_2 - rapid
insertions/removal.
- add mutex to provide thread-safety.
- rename tracker interface to "memtracker_..." for consistency.
- add am_memtracker_reset, connect to hipDeviceReset.
-
[ROCm/clr commit: 7216727fba ]
2016-02-12 18:24:08 -06:00
Ben Sander
fe67be1134
Create address tracker for am_alloc.
...
Tracks device where memory is allocated, pinned-host or device, and
more.
Uses memory-range-based lookups - so pointers that exist anywhere in
the range of hostPtr + size will find the associated AmPointerInfo.
The insertions and lookups use a self-balancing binary tree and
should support O(logN) lookup speed.
[ROCm/clr commit: 721508cc2f ]
2016-02-12 18:24:08 -06:00
Ben Sander
a50fa0f78e
Fix bug in device bounds comparison.
...
Shows up in multi-GPU.
[ROCm/clr commit: f1bc9af294 ]
2016-02-12 18:24:08 -06:00
Maneesh Gupta
00d106a777
Documented supported fastmath functions
...
[ROCm/clr commit: c82511258c ]
2016-02-12 14:21:58 +05:30
Maneesh Gupta
7c93bdebaf
Updated integer intrinsics documentation
...
[ROCm/clr commit: 2659e70d48 ]
2016-02-12 13:58:35 +05:30
Evgeny Mankov
6add51ef8c
Fix typo: maxThreadsPerMultiProcessor -> MaxSharedMemoryPerMultiprocessor
...
Device property MaxSharedMemoryPerMultiprocessor set equal to totalGlobalMem (HIP path).
Reason: MaxSharedMemoryPerMultiprocessor should be as the same as group memory size. Group memory will not be paged out, so, the physical memory size = total shared memory size = group region size. NVCC path remains untouched: CUDA's device property MaxSharedMemoryPerMultiprocessor is reported.
hipify is updated as well.
[ROCm/clr commit: 460b501cbb ]
2016-02-12 01:29:20 +03:00
Evgeny Mankov
735d4738ad
Device property maxThreadsPerMultiProcessor set equal to totalGlobalMem (HIP path).
...
Reason: maxThreadsPerMultiProcessor should be as the same as group memory size. Group memory will not be paged out, so, the physical memory size = total shared memory size = group region size.
NVCC path remains untouched: CUDA's device property maxThreadsPerMultiProcessor is reported.
[ROCm/clr commit: 1025341300 ]
2016-02-12 00:04:14 +03:00
Evgeny Mankov
a8b7647f8b
BDFID (BusID/DeviceID/FunctionID) support.
...
Except FunctionID (or DomainID in CUDA) support, because cudaDeviceProp::pciDomainID is not reported by CUDA.
[ROCm/clr commit: 658e9f0484 ]
2016-02-11 22:26:01 +03:00
sunway513
6bfdfc34a0
Merge branch 'privatestaging' of https://github.com/AMDComputeLibraries/HIP-privatestaging into privatestaging
...
[ROCm/clr commit: fe1000df17 ]
2016-02-11 22:22:47 +05:30
sunway513
38cc074f08
Add reminder to keep ROCR runtime on the system library path
...
[ROCm/clr commit: c7cbcfa2e9 ]
2016-02-11 22:22:00 +05:30
Maneesh Gupta
f826c7aaae
Updated readme for test
...
[ROCm/clr commit: ed2d86f3a9 ]
2016-02-11 13:06:58 +05:30
Evgeny Mankov
cedd1c0947
Merge branch 'privatestaging' of https://github.com/AMDComputeLibraries/HIP-privatestaging into privatestaging
...
[ROCm/clr commit: 3139c72756 ]
2016-02-10 17:21:53 +03:00
Evgeny Mankov
2478fc078f
Formatting, no functional changes
...
[ROCm/clr commit: d9a94191f2 ]
2016-02-10 17:21:18 +03:00
streamhsa
03c2768897
Remove test for atomicInc and atomicDec
...
[ROCm/clr commit: 90add185fd ]
2016-02-10 21:02:52 +08:00
streamhsa
688a9a19a5
Updated readme for test
...
[ROCm/clr commit: 56f1832e70 ]
2016-02-10 20:05:59 +08:00
streamhsa
5d857b2bc3
Resolved test issues
...
[ROCm/clr commit: 2f8d56e903 ]
2016-02-10 20:01:16 +08:00
gargrahul
91a5b0aa77
Removed atomicInc and atomicDec support from HIP
...
[ROCm/clr commit: 51f46d9ddf ]
2016-02-10 04:29:55 +05:30
Evgeny Mankov
9f596e0aab
Device property concurrentKernels is added to hipDeviceProp_t struct.
...
For HCC path concurrentKernels is set to true since all ROCR hardware supports this feature.
For NVCC path concurrentKernels is obtained from CUDA's device property cudaDeviceProp::concurrentKernels.
[ROCm/clr commit: 4d4ca3ef3f ]
2016-02-09 17:10:35 +03:00
Maneesh Gupta
4df8743f84
which_hip -> hipconfig
...
[ROCm/clr commit: f8bfc7f54c ]
2016-02-09 11:51:26 +05:30
Maneesh Gupta
978aac7fe0
Merge branch 'privatestaging' of https://github.com/AMDComputeLibraries/HIP-privatestaging into privatestaging
...
[ROCm/clr commit: 0df78ac9bf ]
2016-02-09 10:57:46 +05:30
Maneesh Gupta
8442259da0
Move HIP_DEVICE_COMPILE defines to hip_common.h
...
[ROCm/clr commit: f6e7abd710 ]
2016-02-09 10:57:20 +05:30
streamhsa
71d3c9f306
Rename test hipInfo as hipGetDeviceAttribute
...
[ROCm/clr commit: 310023e273 ]
2016-02-09 13:19:32 +08:00