gilbertlee-amd
|
a062c80298
|
[TransferBench] Displaying PCIe Bus ID (#288)
* Adding PCIe BusID per GPU in topology display
[ROCm/rccl commit: 61e1a71d14]
|
2020-10-21 16:13:36 -06:00 |
|
gilbertlee-amd
|
0282595de5
|
TransferBench Typo. Pinned host memory uses C not P (#286)
[ROCm/rccl commit: 769418c5c7]
|
2020-10-21 12:05:38 -06:00 |
|
gilbertlee-amd
|
5ca117d7cd
|
New TransferBench features (#273)
* Upgrading TransferBench to support pinned CPU memory, expanding functionality, cleaning up env vars
[ROCm/rccl commit: ee262819a7]
|
2020-09-25 12:20:48 -06:00 |
|
gilbertlee-amd
|
3e4ddd065b
|
Upgrading various TransferBench features (#257)
[ROCm/rccl commit: ec9af40fcd]
|
2020-08-19 09:47:19 -06:00 |
|
gilbertlee-amd
|
1a9b00a7fd
|
Fixes to make TransferBench compile for hipclang (#254)
[ROCm/rccl commit: c985478133]
|
2020-08-13 12:25:28 -06:00 |
|
Gilbert Lee
|
eebc6f2844
|
Adding option to re-use streams instead of re-creating per topology
[ROCm/rccl commit: 339bf9ff19]
|
2020-04-23 15:53:40 +00:00 |
|
Aaron Enye Shi
|
bfbfe370c3
|
Fix HIP-Clang build with HSA headers
HIP-Clang does not include these HSA headers, and they need to be explicitly added in RCCL.
[ROCm/rccl commit: a95090d981]
|
2020-04-03 17:58:23 -04:00 |
|
Stanley Tsang
|
e5419407c4
|
Updating copyright notices for 2020.
[ROCm/rccl commit: 20fa04d9b6]
|
2020-01-29 15:28:08 -08:00 |
|
Gilbert Lee
|
5783917a75
|
Changing single sync mode to time all iterations instead of just last
[ROCm/rccl commit: e5074ce94d]
|
2019-12-20 17:08:39 -08:00 |
|
gilbertlee-amd
|
a461b6d139
|
Adding new sleep after sync capability for data fabric profiling (#162)
Fixing missing header include for ROCM 3.0 changes
[ROCm/rccl commit: 2f4269d06d]
|
2019-12-12 15:20:54 -07:00 |
|
gilbertlee-amd
|
22cbbb9004
|
Adding interactive mode for profiling purposes (#150)
[ROCm/rccl commit: fd94f4fa25]
|
2019-11-05 17:10:16 -07:00 |
|
gilbertlee-amd
|
f9ef1553aa
|
Single Sync Timing mode (#144)
* Adding single sync timing mode to emulate timing reported by rccl-prim-test / rccl-tests
* Adding duration / overhead info
[ROCm/rccl commit: 2f9edd2432]
|
2019-11-01 10:18:25 -06:00 |
|
Gilbert Lee
|
a99accb2cb
|
Adding ability to switch between fine/coarse grain destination GPU memory
Adding ability to switch between memset/memcpy
[ROCm/rccl commit: 648c1ee7cc]
|
2019-10-29 12:00:32 -06:00 |
|
gilbertlee-amd
|
8645391260
|
Adding TransferBench tool (#113)
* Adding standalone TransferBench tool
[ROCm/rccl commit: b8cf48fc16]
|
2019-08-07 17:21:41 -06:00 |
|