David Addison
17c8317cb1
NCCL 2.4.6-1
...
Added detection of IBM/Power NVLink bridge device.
Add NUMA support to PCI distance calculations.
Added NCCL_IGNORE_CPU_AFFINITY env var.
Fix memory leaks; GithubIssue#180
Compiler warning fix; GithubIssue#178
Replace non-standard variable length arrays. GithubIssue#171
Fix Tree+Shared Memory crash. GithubPR#185
Fix LL cleanup hang during long running DL jobs.
Fix NCCL_RINGS environment variable handling.
Added extra checks to catch repeat calls to ncclCommDestroy() GithubIssue#191
Improve bootstrap socket connection reliability at scale.
Fix hostname hashing issue. GithubIssue#187
Code cleanup to rename all non device files from *.cu to *.cc
[ROCm/rccl commit: f40ce73e89 ]
2019-04-05 13:05:45 -07:00
Sylvain Jeaugey
1cc88accd5
Add install target
...
Fix issue #145
[ROCm/rccl commit: bed43524cc ]
2018-11-05 09:53:59 -08:00
Sylvain Jeaugey
8ffcfac437
2.3.5-5
...
Add support for inter-node communication using sockets and InfiniBand/RoCE.
Improve latency.
Add support for aggregation.
Improve LL/regular tuning.
Remove tests as those are now at github.com/nvidia/nccl-tests .
[ROCm/rccl commit: f93fe9bfd9 ]
2018-09-25 14:12:01 -07:00
Sylvain Jeaugey
338f6eaae1
Add support for CUDA9 half semantics
...
[ROCm/rccl commit: 29a1a916dc ]
2017-06-14 11:20:24 -07:00
Boris Fomitchev
12e9519ed0
Added Pascal nvcc flags, bumped version
...
[ROCm/rccl commit: 649f04d077 ]
2017-03-24 11:58:14 -07:00
Sylvain Jeaugey
4644095088
Bumping version to 1.3.3
...
[ROCm/rccl commit: 7fef264bfa ]
2017-03-01 16:44:27 -08:00
Sylvain Jeaugey
db3103d96a
1.3.2 release
...
Broadcast tuning
Better checking of inputs
Copy/reduce code simplification
[ROCm/rccl commit: 34d27771c6 ]
2016-12-01 15:17:50 -08:00
Peter Jin
95860f0721
Add a static library target "staticlib" to the Makefile.
...
Rename the static library "libnccl_static.a" to disambiguate from the
dynamic libraries.
[ROCm/rccl commit: 5765d608cc ]
2016-11-24 11:31:03 -08:00
Kyle Fernandes, ne Jacobs
a35c827aa6
Add Fortran bindings
...
[ROCm/rccl commit: 5f2b32e45b ]
2016-11-17 15:33:34 -08:00
Sylvain Jeaugey
5d496d605d
Bump to 1.3.1
...
[ROCm/rccl commit: 534b9a1697 ]
2016-10-13 10:33:05 -07:00
Sylvain Jeaugey
d4c7d8014a
Add scan tests
...
[ROCm/rccl commit: ca330b110a ]
2016-09-22 11:58:33 -07:00
Sylvain Jeaugey
b540c2b5cc
Heavy code refactoring to remove a lot of code in collectives (~1000 lines).
...
Have all collectives use the same args, the same ring, and the same primitives for synchronization between threads with the same pattern.
[ROCm/rccl commit: cabd6848e4 ]
2016-09-22 11:57:56 -07:00
Sylvain Jeaugey
996a743a44
Merge pull request #41 from jia-kai/master
...
Some minor fixes for compile/usage
[ROCm/rccl commit: 9ee6189bf9 ]
2016-09-15 09:45:52 -07:00
Sylvain Jeaugey
a59d93cf7a
Updated LICENCE.txt
...
[ROCm/rccl commit: 75bad643bd ]
2016-08-26 15:08:20 -07:00
jiakai
d74f2f154d
link library with -lrt; otherwise there is undefined reference to shm_open
...
[ROCm/rccl commit: ed401cc29b ]
2016-08-19 18:58:56 +08:00
Sylvain Jeaugey
5779f12272
Improved Deb generation
...
[ROCm/rccl commit: 7a1aa6b563 ]
2016-07-07 16:31:57 +02:00
Sylvain Jeaugey
b865b44417
Fix version number
...
[ROCm/rccl commit: 9ae84f5d6b ]
2016-06-16 17:07:42 -07:00
Sylvain Jeaugey
2a9f6574a5
Add a debug level to NCCL and CUDA versions at init
...
[ROCm/rccl commit: e51e922924 ]
2016-06-16 17:04:41 -07:00
Sylvain Jeaugey
cabb9878d2
Increased version to 1.2.3
...
[ROCm/rccl commit: 9fcc523485 ]
2016-06-15 19:18:13 -07:00
Sylvain Jeaugey
4d7076f7dc
Packaging : Generate shlibs.local
...
[ROCm/rccl commit: 67d1ab9106 ]
2016-06-15 19:03:08 -07:00
Sylvain Jeaugey
5749822385
Move deb to build directory
...
[ROCm/rccl commit: da6d2009e0 ]
2016-06-15 18:20:10 -07:00
Sylvain Jeaugey
d51c6b0f89
Fix make install to use BUILDDIR
...
[ROCm/rccl commit: 155132d336 ]
2016-06-15 18:20:02 -07:00
Sylvain Jeaugey
0ff5aa2cff
Rework debian packaging
...
[ROCm/rccl commit: 08ddfe03d2 ]
2016-06-15 18:18:44 -07:00
Boris Fomitchev
6e4066f5c5
Gencodes changed to NV recommended
...
[ROCm/rccl commit: 177505b757 ]
2016-06-06 00:06:18 -07:00
Sylvain Jeaugey
fbd1a386db
Bump to 1.2.2
...
[ROCm/rccl commit: 9d9d8cd59f ]
2016-06-03 17:21:53 -07:00
Sylvain Jeaugey
7ac49d9cdc
Better name for GENCODE
...
[ROCm/rccl commit: 1657af1567 ]
2016-06-03 10:25:37 -07:00
Sylvain Jeaugey
778bbbce93
Makefile improvements
...
- Use standard CXX env var
- Permit redefinition of more env
- Separate lib from tests
[ROCm/rccl commit: 889ad3d4e6 ]
2016-06-02 15:01:03 -07:00
Boris Fomitchev
8265e45d65
Merge pull request #21 from borisfom/master
...
Fixed install location, new .deb version
[ROCm/rccl commit: 0629fb62d7 ]
2016-04-21 14:46:41 -07:00
Boris Fomitchev
14b094e21b
Fixed install location, new .deb version
...
[ROCm/rccl commit: 0177cf3ea4 ]
2016-04-21 14:10:31 -07:00
Nathan Luehr
d3b27aa92d
Merge pull request #17 from Hopobcn/master
...
Enable compilation with specific g++
[ROCm/rccl commit: 658aca1469 ]
2016-04-21 13:25:18 -07:00
Nathan Luehr
6d6120cd1e
Moved no-as-needed flag to link rule.
...
Avoids link errors for tests linked with nvcc.
[ROCm/rccl commit: 03df4c7759 ]
2016-04-19 14:51:03 -07:00
Adam Paszke
3baec150aa
Add --no-as-needed to make sure that cudart library gets liked
...
[ROCm/rccl commit: c0c959b1be ]
2016-04-13 10:04:38 -04:00
Pau Farré
c23edf3d49
Enable compilation with old g++ when the default g++ is not supported (+5.0)
...
[ROCm/rccl commit: e30bf95989 ]
2016-04-12 12:49:13 +02:00
Boris Fomitchev
3070897cb8
Removing Tegra
...
[ROCm/rccl commit: e6f4a83da6 ]
2016-03-17 17:25:27 -07:00
Boris Fomitchev
801d962bfe
Use arch=5.3 as well
...
[ROCm/rccl commit: 62b551798f ]
2016-03-16 23:09:36 -07:00
Boris Fomitchev
cf44496d6e
Added compute 5.3
...
[ROCm/rccl commit: 92d2123d8d ]
2016-03-16 19:24:48 -07:00
Boris Fomitchev
07a4274a66
Preparing for pbuild
...
[ROCm/rccl commit: ec3de28ae5 ]
2016-03-16 19:23:49 -07:00
Boris Fomitchev
711444221f
Updated package version, added manpage
...
[ROCm/rccl commit: 941d9da08c ]
2016-02-29 12:10:34 -08:00
Nathan Luehr
16379ec27b
Added NCCL error checking to tests.
...
Also cleaned up makefile so that tests and lib are not built unnecessarily.
Change-Id: Ia0c596cc2213628de2f066be97615c09bb1bb262
Reviewed-on: http://git-master/r/999627
Reviewed-by: Przemek Tredak <ptredak@nvidia.com >
Tested-by: Przemek Tredak <ptredak@nvidia.com >
[ROCm/rccl commit: 2758353380 ]
2016-01-29 11:09:05 -08:00
Sylvain Jeaugey
42c3edc35a
Moved tests to separate dir and improved MPI test
...
test sources moved to test/ directory.
MPI test displays PASS/FAIL and returns code accordingly.
Change-Id: I058ebd1bd5202d8f38cc9787898b2480100c102b
Reviewed-on: http://git-master/r/936086
Reviewed-by: Przemek Tredak <ptredak@nvidia.com >
Tested-by: Przemek Tredak <ptredak@nvidia.com >
[ROCm/rccl commit: c05312f151 ]
2016-01-28 12:56:36 -08:00
Nathan Luehr
bb5df3dbb4
Merge pull request #6 from lukeyeager/deb
...
Deb packaging
[ROCm/rccl commit: 90af7c73ef ]
2016-01-07 13:06:28 -08:00
Luke Yeager
d529f63072
Update deb packaging scripts
...
[ROCm/rccl commit: c9da89254b ]
2015-12-18 14:23:34 -08:00
Nathan Luehr
8915ecb588
Merge pull request #5 from lukeyeager/tests-nvml
...
Don't link tests with NVML
[ROCm/rccl commit: eb2d869f71 ]
2015-12-18 13:36:20 -08:00
Boris Fomitchev
b3db23e837
Don't link tests with NVML
...
[ROCm/rccl commit: b5400c54df ]
2015-12-18 13:27:55 -08:00
Nathan Luehr
87be580f28
Merge pull request #4 from lukeyeager/build-sm50
...
Build SM 5.0 code
[ROCm/rccl commit: a4de6016f8 ]
2015-12-18 13:23:48 -08:00
Luke Yeager
eff4cd1cb3
Build SM 5.0 code
...
[ROCm/rccl commit: dd0884b707 ]
2015-12-18 13:19:50 -08:00
Luke Yeager
a0d7e36891
Use semantic versioning
...
[ROCm/rccl commit: e1634ca6cb ]
2015-12-18 12:02:17 -08:00
Nathan Luehr
529702aec6
Initial release.
...
[ROCm/rccl commit: 0673d5f44f ]
2015-11-17 11:30:40 -08:00