Andy li e373bd44bf Enable fp8 support (#1101)
* initial checkin

* resolve cr comments

* resolve the build issue

* fix the data correctless issue

* update fp8 header file and update the unit test for fp8 support

* remove fp16 from fp8 headers

* fix ut issue and catch up the latest code from develop

* udate according to cr comments

* update ut according to cr comments

* update num floats for each SumPostDiv from 4 to 6

* update fp8 header file name

* fix the typo

[ROCm/rccl commit: 6777e65c1d]
2024-03-08 15:17:53 -08:00
2024-03-08 15:17:53 -08:00
S
Açıklama
Hiçbir açıklama sağlanmadı
282 MiB
Dil
C++ 67.5%
C 20.6%
Python 6.6%
CMake 3.4%
Shell 0.6%
Diğer 1.1%