Logo
Procházet Nápověda
Přihlásit se
AI/rocm-systems
2
0
Rozštěpit 0
Již jsi rozštěpil rocm-systems
Zdrojový kód Úkoly Pull requesty Akce Balíčky Projekty Vydání Wiki Aktivita
Files
fdf75fd2c1131a2108d4868771851b3992f6b6dc
rocm-systems/ext-src
T
Historie
Nusrat Islam fdf75fd2c1 ext-src: tuning for allreduce8 kernel (#1560)
This PR tunes the number of threadblocks used for larger (>1MB)
message sizes.
2025-02-21 19:34:38 -06:00
..
json @ 9cca280a4d
Added nlohmann/json:v3.11.3 as a submodule in ext-src and passed its path into the mscclpp build to avoid downloading the package at build time. (#1330)
2024-09-11 16:54:26 -06:00
mscclpp @ 4ee15b7ad0
update mscclpp (#1488)
2025-01-20 08:06:43 -06:00
bf16-tuning.patch
ext-src: tuning for allreduce8 kernel (#1560)
2025-02-21 19:34:38 -06:00
check_ibv_access_relaxed_ordering.cc
[MSCCLPP] IBVerbs: Check if IBV_ACCESS_RELAXED_ORDERING exists (#1483)
2025-01-08 08:38:51 -06:00
cpx.patch
Enable MSCCLPP use in CPX mode (#1355)
2024-10-02 11:52:04 -05:00
mem-reg.patch
Update MSCCL++ register/deregister (#1523)
2025-02-04 09:09:56 -06:00
mscclpp_ibv_access_relaxed_ordering.patch
[MSCCLPP] IBVerbs: Check if IBV_ACCESS_RELAXED_ORDERING exists (#1483)
2025-01-08 08:38:51 -06:00
non-multiple-128-fix.patch
ext-src: fix mscclpp allreduce for non-multiple of 128 message sizes (#1556)
2025-02-21 11:58:10 -06:00
read-allred.patch
Tune allreduce performance in CPX mode (single OAM) (#1508)
2025-01-29 08:58:48 -06:00
© 2020 badstorm.xyz - : 1.26.2
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어