964c4c2061
This commit was cherry-picked and modified from https://github.com/NVIDIA/nccl/commit/5949d96f36d050e59d05872f8bbffd2549318e95