Wenkai Du
0f4d497edc
Add gfx90a target ( #344 )
...
* Add gfx90a target
* Support gfx90a topology
Co-authored-by: Eiden Yoshida <eiden.yoshida@amd.com >
[ROCm/rccl commit: 1fe031402a ]
2021-04-14 09:29:00 -06:00
Wenkai Du
065bde98d8
collnet: support multiple NICs ( #335 )
...
[ROCm/rccl commit: d87dc7c2e8 ]
2021-03-25 20:59:32 -07:00
Wenkai Du
287ed0f18a
Enable collnet in RCCL ( #333 )
...
* Enable CollNet and use different number of channels
* topo_expl: enable collnet
[ROCm/rccl commit: 1d6244b18d ]
2021-03-19 12:58:13 -07:00
Wenkai Du
fe8923ebba
Add gfx908 Rome 4 NICs model
...
[ROCm/rccl commit: 6dfdfef98f ]
2021-02-06 00:19:47 +00:00
Wenkai Du
4ea285c527
Fix Rome PCIe 2 node topology generation ( #310 )
...
[ROCm/rccl commit: 373a108516 ]
2020-12-15 17:16:17 -08:00
Wenkai Du
b68ff1ebba
Add Rome model and improve search ( #305 )
...
[ROCm/rccl commit: 975b14dffa ]
2020-11-17 14:55:06 -08:00
Wenkai Du
c0c64d970a
Add more Rome models ( #292 )
...
[ROCm/rccl commit: dfa3c41ede ]
2020-10-30 21:26:04 -07:00
Wenkai Du
8b120c0508
Update Rome single node models ( #277 )
...
[ROCm/rccl commit: 33babcb5e2 ]
2020-10-13 13:33:09 -07:00
Wenkai Du
41260bb948
Rework Rome detection and add multiple network ports models ( #274 )
...
* Rework Rome detection and add multiple network ports models
* Remove unused opCount in p2p transport
[ROCm/rccl commit: ae008fd2db ]
2020-10-07 13:37:36 -07:00
lijietang
f6b08ca547
Add rccl bw test script in tools ( #255 )
...
[ROCm/rccl commit: bbe233f8c1 ]
2020-09-11 16:59:03 +08:00
Wenkai Du
03bb6bcb54
Increase minimal channels for gfx908 ( #259 )
...
[ROCm/rccl commit: c5cbece6d0 ]
2020-08-26 11:40:11 -07:00
Wenkai Du
5f49a0e088
Add NPS4 support on some models ( #256 )
...
* Add NPS4 support on some models
* Add XML models
[ROCm/rccl commit: 391bbf3f1e ]
2020-08-19 11:03:20 -07:00
Wenkai Du
3d5fb8142e
Add another Rome model ( #249 )
...
* Add another Rome model
* Add gfx908 4P3L models and support
* Revert "Use cached value for detecting GDR support only once"
This reverts commit 0108a1219d .
* Skip using ibverb for GPU direct RDMA detection
* Fine tune one Rome model
[ROCm/rccl commit: a51e4071e3 ]
2020-08-17 10:51:02 -07:00
Wenkai Du
c9815aaa36
Add more Rome 4P2H models
...
[ROCm/rccl commit: 09ef75656a ]
2020-08-06 18:20:02 +00:00
Wenkai Du
487f93b83f
Topology tuning for 4P2H on Rome ( #242 )
...
* Topology tuning for 4P2H on Rome
* Use ncclTopoIdToIndex
[ROCm/rccl commit: e7a10aa0e4 ]
2020-07-27 11:53:57 -07:00
Wenkai Du
f604fc774e
Add 8P6L multi-node models ( #239 )
...
[ROCm/rccl commit: d5f90e19b5 ]
2020-07-21 14:10:36 -07:00
Wenkai Du
27519fd019
Give preference to path with more XGMI connections
...
[ROCm/rccl commit: b3c9852634 ]
2020-05-14 15:33:16 -07:00
Wenkai Du
7882b2f0c5
topo_expl: add a few more single node models
...
[ROCm/rccl commit: 32388d60a9 ]
2020-03-02 11:43:03 -08:00
Wenkai Du
a36c2ecbc4
Add topology visualizer tool
...
[ROCm/rccl commit: 498d5029ad ]
2020-02-26 15:23:34 -08:00