This fixes a deadlock introduced by the switch to TTAS loops, and is therefore mildly urgent (to prevent the CI from hoovering in the broken code). [ROCm/hip commit: a855a13c22]
a855a13c22