2
0
Add support for CUDA graphs.
Fuse BCM Gen4 switches to avoid suboptimal performance on some platforms. Issue #439.
Fix bootstrap issue caused by connection reordering.
Fix CPU locking block.
Improve CollNet algorithm.
Improve performance on DGX A100 for communicators with only one GPU per node.
Este cometimento está contido em:
Sylvain Jeaugey
2021-04-12 16:00:11 -07:00
ascendente 911d61f214
cometimento a46ea10583
43 ficheiros modificados com 2687 adições e 1244 eliminações
+2 -2
Ver ficheiro
@@ -1,6 +1,6 @@
##### version
NCCL_MAJOR := 2
NCCL_MINOR := 8
NCCL_PATCH := 4
NCCL_MINOR := 9
NCCL_PATCH := 6
NCCL_SUFFIX :=
PKG_REVISION := 1