2ab8a3a750cc0ced8c8789c2f8a34c8b3b3d8661
Add support for alternating rings, allow for cross-nic rings without
cross-rail communication.
Add support for user buffer registration for network send/recv.
Optimize aggregated operations to better utilize all channels.
Add flattening for BCM PCI gen5 switches.
Add support for inter-node NVLink communication
Add support for port fusion in NET/IB.
Add support for ReduceScatter and AllGather using Collnet.
Update net API to v8.
Fix hang during A2A connection.
[ROCm/rccl commit: b6475625fb]
Beschreibung
Keine Beschreibung vorhanden
Sprachen
C++
67.5%
C
20.6%
Python
6.6%
CMake
3.4%
Shell
0.6%
Andere
1.1%