4c4fff6b51
This test does not work if block size is greater than wave size since it relies on lock-step execution of the kernel in the block. If there are more than waves in the block, the threads in the block miss synchronization since one wave may finish before another wave. Due to this bug, the test fails on GFX10 wave32 mode. This patch fixes that so that it works for block size greater than wave size. Change-Id: Ie0097066081df36cb6fe025a71d0ee5a83ec00a2