From 53478f138ed96bb50ee6d7823fdbefffc947c1e2 Mon Sep 17 00:00:00 2001 From: Sandra Polifroni Date: Mon, 23 Sep 2024 14:27:58 -0400 Subject: [PATCH] Updated the information for 6.2.1 in the changelog so that it reflects what's in the 6.2.1 release notes [ROCm/rccl commit: 7f87b0cd85e9b83ce98770f6a3063f09b012f240] --- projects/rccl/CHANGELOG.md | 14 +++++++++++++- 1 file changed, 13 insertions(+), 1 deletion(-) diff --git a/projects/rccl/CHANGELOG.md b/projects/rccl/CHANGELOG.md index 3ba664a2f7..d753056f0d 100644 --- a/projects/rccl/CHANGELOG.md +++ b/projects/rccl/CHANGELOG.md @@ -1,7 +1,19 @@ -# Change Log for RCCL +# Changelog for RCCL Full documentation for RCCL is available at [https://rccl.readthedocs.io](https://rccl.readthedocs.io) +## RCCL 2.20.5 for ROCm 6.2.1 + +### Known issues + +On systems running Linux kernel 6.8.0, such as Ubuntu 24.04, Direct Memory Access (DMA) transfers between the GPU and NIC are disabled and impacts multi-node RCCL performance. + +This issue was reproduced with RCCL 2.20.5 (ROCm 6.2.0 and 6.2.1) on systems with Broadcom Thor-2 NICs and affects other systems with RoCE networks using Linux 6.8.0 or newer. + +Older RCCL versions are also impacted. + +This issue will be addressed in a future ROCm release. + ## Unreleased - RCCL 2.20.5 for ROCm 6.2.0 ### Changed - Compatibility with NCCL 2.20.5