Current NCCL code does not abort for failed Flush operations by underlying network. This may compromise data integrity. Signed-off-by: Rashika Kheria <rashika@amazon.com> [ROCm/rccl commit: 6c61492eba]
6c61492eba