f29d59aa00
This commit ensures that GPU finishes all kernel before destroying
communicator thread.
[ROCm/rccl commit: 52654e2301]