bb47eee7cc47102a16edd87c6d1faf5000363981
Co-authored-by: Prasannakumar Murugesan <prmuruge@amd.com>
As part of an earlier commit, bfloat16 handling in reduce kernel for FuncMinMax fell into generic/default template when there is no SPECIALIZE_REDUCE for a particular type, this generic template does a bitwise integer comparison and it broke bfloat16 ops.
change the else-if statement to else statement, that way it covers both ROCm version < 6.0 and >= 6.0 (with ROCm > 6.0, device.h already typedefs __hip_bfloat16 to hip_bfloat16, so no special case is needed here).
[ROCm/rccl commit: fa366ac03f]
Açıklama
Hiçbir açıklama sağlanmadı
Dil
C++
67.5%
C
20.6%
Python
6.6%
CMake
3.4%
Shell
0.6%
Diğer
1.1%