From 46e09f18c8372fa0be870dfa9858bc0071b8a3da Mon Sep 17 00:00:00 2001 From: David Addison Date: Mon, 19 May 2025 18:20:22 -0700 Subject: [PATCH] Add support for Symmetric Memory Registration From NCCL 2.27.x we can now use the Symmetric Memory APIs (-R 2) [ROCm/rccl-tests commit: a5c539e68bb7263304997012498b0cd0667b99e8] --- projects/rccl-tests/README.md | 2 +- projects/rccl-tests/src/common.cu | 58 +++++++++++++++++++++++++------ 2 files changed, 48 insertions(+), 12 deletions(-) diff --git a/projects/rccl-tests/README.md b/projects/rccl-tests/README.md index bdafbe5a16..22687d4771 100644 --- a/projects/rccl-tests/README.md +++ b/projects/rccl-tests/README.md @@ -78,7 +78,7 @@ All tests support the same set of arguments : * `-z,--blocking <0/1>` Make NCCL collective blocking, i.e. have CPUs wait and sync after each collective. Default : 0. * `-G,--cudagraph ` Capture iterations as a CUDA graph and then replay specified number of times. Default : 0. * `-C,--report_cputime <0/1>]` Report CPU time instead of latency. Default : 0. - * `-R,--local_register <1/0>` enable local buffer registration on send/recv buffers. Default : 0. + * `-R,--local_register <0/1/2> enable local (1) or symmetric (2) buffer registration on send/recv buffers. Default : 0. * `-T,--timeout