16 Commits

Author SHA1 Message Date
Jeffrey Novotny ed70fc066a Update license file for 2025 (#1480)
[ROCm/rccl commit: 3fefd31b07]
2025-01-07 14:47:55 -05:00
Jeffrey Novotny d7498b88a5 Refactor how to docs and formatting fixes (#1444)
[ROCm/rccl commit: 9aa5b9f02e]
2024-12-10 08:47:24 -05:00
corey-derochie-amd b8542c2477 Integrated RCCL with MSCCL++ for small message sizes (#1231)
[ROCm/rccl commit: 6dc47eecd7]
2024-07-12 15:32:58 -06:00
gilbertlee-amd 05b8523a7b Updating NOTICES.txt and LICENSE.txt (#770)
[ROCm/rccl commit: 20b567caac]
2023-06-07 09:45:03 -06:00
Ziyue Yang f7f669e7f0 MSCCL: Improve executor and integrate scheduler (#694)
* MSCCL: improve executor and add scheduler for testing

* Use external scheduler

* Fix cmake error

* Address comments

* Fix thread safe issue

* Make MSCCL lifecycle APIs thread safe

* Make MSCCL internal scheduler aware of topology hint

* Revise error message

[ROCm/rccl commit: e3b2342f39]
2023-03-14 14:34:25 -07:00
gilbertlee-amd 0ca30fb88a Updating files for missing licenses (#637)
[ROCm/rccl commit: ebb8b5bf63]
2022-10-14 13:49:16 -06:00
rachanaramanna 709b3dc85b Update LICENSE.txt (#450)
[ROCm/rccl commit: 04c10a6025]
2021-11-05 09:13:53 -06:00
Wenkai Du ab71643c99 Merge remote-tracking branch 'nccl/master' into 2.8.3
[ROCm/rccl commit: c985358e11]
2021-02-15 18:44:47 -05:00
Sylvain Jeaugey fc7bdb38a5 2.8.4-1
Fix hang in corner cases of alltoallv using point to point send/recv.
Harmonize error messages.
Fix missing NVTX section in the license.
Update README.


[ROCm/rccl commit: 911d61f214]
2021-02-09 15:36:48 -08:00
Stanley Tsang e5419407c4 Updating copyright notices for 2020.
[ROCm/rccl commit: 20fa04d9b6]
2020-01-29 15:28:08 -08:00
Wenkai Du 6047487815 RCCL 2.4 update
[ROCm/rccl commit: f11c8f60cd]
2019-08-14 10:42:35 -07:00
David Addison 17c8317cb1 NCCL 2.4.6-1
Added detection of IBM/Power NVLink bridge device.
    Add NUMA support to PCI distance calculations.
    Added NCCL_IGNORE_CPU_AFFINITY env var.
    Fix memory leaks; GithubIssue#180
    Compiler warning fix; GithubIssue#178
    Replace non-standard variable length arrays. GithubIssue#171
    Fix Tree+Shared Memory crash. GithubPR#185
    Fix LL cleanup hang during long running DL jobs.
    Fix NCCL_RINGS environment variable handling.
    Added extra checks to catch repeat calls to ncclCommDestroy() GithubIssue#191
    Improve bootstrap socket connection reliability at scale.
    Fix hostname hashing issue. GithubIssue#187
    Code cleanup to rename all non device files from *.cu to *.cc


[ROCm/rccl commit: f40ce73e89]
2019-04-05 13:05:45 -07:00
Sylvain Jeaugey 8ffcfac437 2.3.5-5
Add support for inter-node communication using sockets and InfiniBand/RoCE.
Improve latency.
Add support for aggregation.
Improve LL/regular tuning.
Remove tests as those are now at github.com/nvidia/nccl-tests .


[ROCm/rccl commit: f93fe9bfd9]
2018-09-25 14:12:01 -07:00
Cliff Woolley 0c51957874 Update LICENSE.txt
[ROCm/rccl commit: 234c8c9ef3]
2016-08-26 15:39:21 -07:00
Sylvain Jeaugey a59d93cf7a Updated LICENCE.txt
[ROCm/rccl commit: 75bad643bd]
2016-08-26 15:08:20 -07:00
Nathan Luehr 529702aec6 Initial release.
[ROCm/rccl commit: 0673d5f44f]
2015-11-17 11:30:40 -08:00