Logo
Explore Help
Sign In
AI/rocm-systems
2
0
Fork 0
You've already forked rocm-systems
Code Issues Pull Requests Actions Packages Projects Releases Wiki Activity
Files
85eb1f16bcdab1fa7abc754ad6a3606a9405fb12
rocm-systems/ext-src
T
History
Nusrat Islam fdf75fd2c1 ext-src: tuning for allreduce8 kernel (#1560)
This PR tunes the number of threadblocks used for larger (>1MB)
message sizes.
2025-02-21 19:34:38 -06:00
..
json @ 9cca280a4d
Added nlohmann/json:v3.11.3 as a submodule in ext-src and passed its path into the mscclpp build to avoid downloading the package at build time. (#1330)
2024-09-11 16:54:26 -06:00
mscclpp @ 4ee15b7ad0
update mscclpp (#1488)
2025-01-20 08:06:43 -06:00
bf16-tuning.patch
ext-src: tuning for allreduce8 kernel (#1560)
2025-02-21 19:34:38 -06:00
check_ibv_access_relaxed_ordering.cc
[MSCCLPP] IBVerbs: Check if IBV_ACCESS_RELAXED_ORDERING exists (#1483)
2025-01-08 08:38:51 -06:00
cpx.patch
Enable MSCCLPP use in CPX mode (#1355)
2024-10-02 11:52:04 -05:00
mem-reg.patch
Update MSCCL++ register/deregister (#1523)
2025-02-04 09:09:56 -06:00
mscclpp_ibv_access_relaxed_ordering.patch
[MSCCLPP] IBVerbs: Check if IBV_ACCESS_RELAXED_ORDERING exists (#1483)
2025-01-08 08:38:51 -06:00
non-multiple-128-fix.patch
ext-src: fix mscclpp allreduce for non-multiple of 128 message sizes (#1556)
2025-02-21 11:58:10 -06:00
read-allred.patch
Tune allreduce performance in CPX mode (single OAM) (#1508)
2025-01-29 08:58:48 -06:00
© 2020 badstorm.xyz - : 1.26.4
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어