Wenkai Du
f98715baea
Merge remote-tracking branch 'nccl/master' into develop
...
[ROCm/rccl commit: abd0615351 ]
2023-06-26 22:51:56 +00:00
Ziyue Yang
f7f669e7f0
MSCCL: Improve executor and integrate scheduler ( #694 )
...
* MSCCL: improve executor and add scheduler for testing
* Use external scheduler
* Fix cmake error
* Address comments
* Fix thread safe issue
* Make MSCCL lifecycle APIs thread safe
* Make MSCCL internal scheduler aware of topology hint
* Revise error message
[ROCm/rccl commit: e3b2342f39 ]
2023-03-14 14:34:25 -07:00
Wenkai Du
c76bc214c8
Merge remote-tracking branch 'nccl/master' into HEAD
...
[ROCm/rccl commit: e1cb45ff22 ]
2023-02-04 01:44:43 +00:00
Wenkai Du
36e5e02e46
Merge remote-tracking branch 'nccl/master' into develop
...
[ROCm/rccl commit: 4f0e223db4 ]
2022-10-20 15:41:29 +00:00
Wenkai Du
7874a99c75
Merge remote-tracking branch 'nccl/master' into develop
...
[ROCm/rccl commit: a79d9e3586 ]
2022-09-09 16:05:38 +00:00
Wenkai Du
67e7e6507e
Merge remote-tracking branch 'nccl/master' into develop
...
[ROCm/rccl commit: d28e1cb44f ]
2022-04-18 11:15:25 -07:00
Wenkai Du
a76bebf8b6
Merge remote-tracking branch 'nccl/master' into develop
...
[ROCm/rccl commit: a4ea1fed5b ]
2021-05-05 16:01:01 -07:00
Wenkai Du
b4a7fa7011
Cleanup number of channels calculation ( #340 )
...
[ROCm/rccl commit: e26ad2995e ]
2021-04-05 17:51:56 -07:00
Wenkai Du
065bde98d8
collnet: support multiple NICs ( #335 )
...
[ROCm/rccl commit: d87dc7c2e8 ]
2021-03-25 20:59:32 -07:00
Wenkai Du
adff98765c
Merge remote-tracking branch 'nccl/master' into no-target-id
...
[ROCm/rccl commit: d469947641 ]
2021-01-14 19:27:53 -05:00
Wenkai Du
dbde26e681
Add Alltoallv RCCL kernel implementation ( #269 )
...
* Add alltoallv API and implementation
* Extend Rome P2P channel limit to multinode and alltoall kernels
* topo_expl: fix compilation and sync up with main
* gtest: use RCCL alltoallv API
* Code review changes
[ROCm/rccl commit: b871ea3c0c ]
2020-09-30 16:25:36 -07:00
Wenkai Du
03bb6bcb54
Increase minimal channels for gfx908 ( #259 )
...
[ROCm/rccl commit: c5cbece6d0 ]
2020-08-26 11:40:11 -07:00
Wenkai Du
8852e54181
topo_expl: update to 2.6
...
[ROCm/rccl commit: 6f54b23503 ]
2020-04-01 13:37:08 -07:00
Wenkai Du
00f421ccbd
Add topology explorer
...
[ROCm/rccl commit: 55f8e2dec7 ]
2020-02-19 14:42:06 -08:00