Mustafa Abduljabbar ef6d75b3ee MSCCL Multithreaded regression root cause fix (#1347)
* Make sure the target device is used for MSCCL

* Enable single process mode by default to use MSCCL in MT

* Create a per-rank state when GPUs share a thread

[ROCm/rccl commit: 03a3ef3c34]
2024-09-25 15:24:25 -04:00
S
Beschreibung
Keine Beschreibung vorhanden
282 MiB
Sprachen
C++ 67.5%
C 20.6%
Python 6.6%
CMake 3.4%
Shell 0.6%
Andere 1.1%