* use 64 threads for reduction test much faster with IPC backend. * change all relevant collective tests. [ROCm/rocshmem commit: c35210f174]
c35210f174