prasanna-amd bb47eee7cc fix bug in reduce kernel bfloat16 for ROCm >= 6.0 (#2139)
Co-authored-by: Prasannakumar Murugesan <prmuruge@amd.com>
As part of an earlier commit, bfloat16 handling in reduce kernel for FuncMinMax fell into generic/default template when there is no SPECIALIZE_REDUCE for a particular type, this generic template does a bitwise integer comparison and it broke bfloat16 ops.
change the else-if statement to else statement, that way it covers both ROCm version < 6.0 and >= 6.0 (with ROCm > 6.0, device.h already typedefs __hip_bfloat16 to hip_bfloat16, so no special case is needed here).

[ROCm/rccl commit: fa366ac03f]
2026-01-20 14:07:20 -08:00
S
Açıklama
Hiçbir açıklama sağlanmadı
282 MiB
Dil
C++ 67.5%
C 20.6%
Python 6.6%
CMake 3.4%
Shell 0.6%
Diğer 1.1%