Add support for A100 GPU and related platforms. Add support for CUDA 11. Add support for send/receive operations (beta). [ROCm/rccl commit: 5949d96f36]
5949d96f36