Documentation updates for NCCL 2.7.0 (#219)
* Making hip-clang the default compiler; documentation update * Adding back --hip-clang to install.sh as a silent option for CI * Documentation updates for NCCL 2.7 * Restoring deleted line in install script
Este cometimento está contido em:
cometido por
GitHub
ascendente
0023b9b081
cometimento
8d21adb5e3
+1
-1
@@ -4,7 +4,7 @@ ROCm Communication Collectives Library
|
||||
|
||||
## Introduction
|
||||
|
||||
RCCL (pronounced "Rickle") is a stand-alone library of standard collective communication routines for GPUs, implementing all-reduce, all-gather, reduce, broadcast, and reduce-scatter. It has been optimized to achieve high bandwidth on platforms using PCIe, xGMI as well as networking using InfiniBand Verbs or TCP/IP sockets. RCCL supports an arbitrary number of GPUs installed in a single node or multiple nodes, and can be used in either single- or multi-process (e.g., MPI) applications.
|
||||
RCCL (pronounced "Rickle") is a stand-alone library of standard collective communication routines for GPUs, implementing all-reduce, all-gather, reduce, broadcast, reduce-scatter, gather, scatter, and all-to-all. There is also initial support for direct GPU-to-GPU send and receive operations. It has been optimized to achieve high bandwidth on platforms using PCIe, xGMI as well as networking using InfiniBand Verbs or TCP/IP sockets. RCCL supports an arbitrary number of GPUs installed in a single node or multiple nodes, and can be used in either single- or multi-process (e.g., MPI) applications.
|
||||
|
||||
The collective operations are implemented using ring and tree algorithms and have been optimized for throughput and latency. For best performance, small operations can be either batched into larger operations or aggregated through the API.
|
||||
|
||||
|
||||
Criar uma nova questão referindo esta
Bloquear um utilizador