From NCCL 2.27.x we can now use the Symmetric Memory APIs (-R 2) [ROCm/rccl-tests commit: a5c539e68b]
a5c539e68b