Revert "Tuning the inline and unroll to reduce the scratch usage" [ROCm/rccl commit: af703877cf]
af703877cf