Sylvain Jeaugey
d4fbd0464e
Removing unneeded includes
...
[ROCm/rccl commit: acb93d1aed ]
2016-06-02 17:33:43 -07:00
Sylvain Jeaugey
778bbbce93
Makefile improvements
...
- Use standard CXX env var
- Permit redefinition of more env
- Separate lib from tests
[ROCm/rccl commit: 889ad3d4e6 ]
2016-06-02 15:01:03 -07:00
Boris Fomitchev
d3ea2e79ce
Merge pull request #22 from borisfom/master
...
Fixed version in ChangeLog
[ROCm/rccl commit: 93538def65 ]
2016-04-21 18:58:44 -07:00
Boris Fomitchev
12a8dc51f1
Fixed version in ChangeLog
...
[ROCm/rccl commit: e5067b6611 ]
2016-04-21 16:28:13 -07:00
Boris Fomitchev
8265e45d65
Merge pull request #21 from borisfom/master
...
Fixed install location, new .deb version
[ROCm/rccl commit: 0629fb62d7 ]
2016-04-21 14:46:41 -07:00
Boris Fomitchev
14b094e21b
Fixed install location, new .deb version
...
[ROCm/rccl commit: 0177cf3ea4 ]
2016-04-21 14:10:31 -07:00
Nathan Luehr
d3b27aa92d
Merge pull request #17 from Hopobcn/master
...
Enable compilation with specific g++
[ROCm/rccl commit: 658aca1469 ]
2016-04-21 13:25:18 -07:00
Nathan Luehr
6d6120cd1e
Moved no-as-needed flag to link rule.
...
Avoids link errors for tests linked with nvcc.
[ROCm/rccl commit: 03df4c7759 ]
2016-04-19 14:51:03 -07:00
Nathan Luehr
bd7eec1f8d
Merge pull request #18 from apaszke/master
...
Add --no-as-needed to make sure that cudart library gets linked
[ROCm/rccl commit: 0d4f8f4e95 ]
2016-04-19 11:11:39 -07:00
Sylvain Jeaugey
ae125930ee
Fix readme to reflect the new test paths
...
[ROCm/rccl commit: ddd3f2084d ]
2016-04-19 11:09:25 -07:00
Sylvain Jeaugey
50903c8ce0
Fix random deadlock during ncclCommInitRank.
...
[ROCm/rccl commit: dba3ec9428 ]
2016-04-19 10:47:27 -07:00
Sylvain Jeaugey
6e57ed83df
Fix MPI test usage
...
Only display usage from rank 0 and exit instead of continuing (and seg fault).
[ROCm/rccl commit: 9de361a1b9 ]
2016-04-19 10:43:38 -07:00
Adam Paszke
3baec150aa
Add --no-as-needed to make sure that cudart library gets liked
...
[ROCm/rccl commit: c0c959b1be ]
2016-04-13 10:04:38 -04:00
Pau Farré
c23edf3d49
Enable compilation with old g++ when the default g++ is not supported (+5.0)
...
[ROCm/rccl commit: e30bf95989 ]
2016-04-12 12:49:13 +02:00
Boris Fomitchev
014160c90a
Merge pull request #16 from borisfom/master
...
Remved Tegra, fixed + format.
[ROCm/rccl commit: b16cc5d197 ]
2016-03-17 17:35:04 -07:00
Boris Fomitchev
3070897cb8
Removing Tegra
...
[ROCm/rccl commit: e6f4a83da6 ]
2016-03-17 17:25:27 -07:00
Boris Fomitchev
55cab0074c
fixed version format
...
[ROCm/rccl commit: 1a8bae5b2f ]
2016-03-17 17:13:45 -07:00
Boris Fomitchev
6a08588c8a
Merge pull request #15 from borisfom/master
...
Fixing version number and compile param for 5.3
[ROCm/rccl commit: e8eb285a59 ]
2016-03-17 16:03:05 -07:00
Boris Fomitchev
b154c4e121
Version with . 7.5
...
[ROCm/rccl commit: b508d28123 ]
2016-03-17 15:48:48 -07:00
Boris Fomitchev
801d962bfe
Use arch=5.3 as well
...
[ROCm/rccl commit: 62b551798f ]
2016-03-16 23:09:36 -07:00
Boris Fomitchev
de8239e29c
Delete libnccl1_1.1.1+cuda75_amd64.deb
...
[ROCm/rccl commit: dfbebe395c ]
2016-03-16 21:44:13 -07:00
Boris Fomitchev
2728930f32
Delete libnccl-dev_1.1.1+cuda75_amd64.deb
...
[ROCm/rccl commit: 85280b5bf4 ]
2016-03-16 21:44:04 -07:00
Boris Fomitchev
ebe3a86610
Added files via upload
...
[ROCm/rccl commit: fb53cfd9b0 ]
2016-03-16 21:42:47 -07:00
Boris Fomitchev
cf44496d6e
Added compute 5.3
...
[ROCm/rccl commit: 92d2123d8d ]
2016-03-16 19:24:48 -07:00
Boris Fomitchev
07a4274a66
Preparing for pbuild
...
[ROCm/rccl commit: ec3de28ae5 ]
2016-03-16 19:23:49 -07:00
Boris Fomitchev
80f568563d
Moved to pbuilder
...
[ROCm/rccl commit: 86dc136fa9 ]
2016-03-16 18:41:54 -07:00
Boris Fomitchev
96e5299fe7
Moved release files to proper area
...
Bumping a version; building for 7.5
[ROCm/rccl commit: 172f316ac2 ]
2016-03-16 18:30:53 -07:00
Boris Fomitchev
711444221f
Updated package version, added manpage
...
[ROCm/rccl commit: 941d9da08c ]
2016-02-29 12:10:34 -08:00
Nathan Luehr
b029c954f1
Fixed useRemoteRecv consistency issue.
...
Change-Id: Ib093a8dc3bb093eddc89dad81d3fffa53c03a6a2
Reviewed-on: http://git-master/r/1013543
Reviewed-by: Cliff Woolley <jwoolley@nvidia.com >
Tested-by: Przemek Tredak <ptredak@nvidia.com >
[ROCm/rccl commit: 5554a4c9f0 ]
2016-02-18 13:45:42 -08:00
Nathan Luehr
b5a2ca138d
Fixed buffer overflow in ReduceOrCopy
...
Bug caused AllGathers and ReduceScatters of less than
8 bytes to fail in certain cases.
Change-Id: I33e1beb50805bfdb457ae16a90e3f91c1b283b9b
Reviewed-on: http://git-master/r/1011505
Reviewed-by: Przemek Tredak <ptredak@nvidia.com >
Tested-by: Przemek Tredak <ptredak@nvidia.com >
[ROCm/rccl commit: 9442285526 ]
2016-02-12 15:13:56 -08:00
Nathan Luehr
60799f9427
Libwrap checks for LIB.so.1 if LIB.so not found
...
Change-Id: I6f07f887f828cb2259dcfd496a2ad707db898cf5
Reviewed-on: http://git-master/r/1000162
Reviewed-by: Przemek Tredak <ptredak@nvidia.com >
Tested-by: Przemek Tredak <ptredak@nvidia.com >
[ROCm/rccl commit: caa40b8dd3 ]
2016-01-29 12:36:42 -08:00
Nathan Luehr
16379ec27b
Added NCCL error checking to tests.
...
Also cleaned up makefile so that tests and lib are not built unnecessarily.
Change-Id: Ia0c596cc2213628de2f066be97615c09bb1bb262
Reviewed-on: http://git-master/r/999627
Reviewed-by: Przemek Tredak <ptredak@nvidia.com >
Tested-by: Przemek Tredak <ptredak@nvidia.com >
[ROCm/rccl commit: 2758353380 ]
2016-01-29 11:09:05 -08:00
Nathan Luehr
c019d205a9
Enabled support for char type to be unsigned.
...
GCC on POWER arch defines char type as unsigned.
Change-Id: Ic143cb058fe42414b1f6f1f45b02132c837726ae
Reviewed-on: http://git-master/r/999614
Reviewed-by: Przemek Tredak <ptredak@nvidia.com >
Tested-by: Przemek Tredak <ptredak@nvidia.com >
[ROCm/rccl commit: fe1a956715 ]
2016-01-28 13:38:18 -08:00
Sylvain Jeaugey
42c3edc35a
Moved tests to separate dir and improved MPI test
...
test sources moved to test/ directory.
MPI test displays PASS/FAIL and returns code accordingly.
Change-Id: I058ebd1bd5202d8f38cc9787898b2480100c102b
Reviewed-on: http://git-master/r/936086
Reviewed-by: Przemek Tredak <ptredak@nvidia.com >
Tested-by: Przemek Tredak <ptredak@nvidia.com >
[ROCm/rccl commit: c05312f151 ]
2016-01-28 12:56:36 -08:00
Nathan Luehr
a867db80f1
Added support for more than 8 GPUs.
...
Change-Id: Iaa1841036a7bfdad6ebec99fed0adcd2bbe6ffad
Reviewed-on: http://git-master/r/935459
Reviewed-by: Cliff Woolley <jwoolley@nvidia.com >
Tested-by: Przemek Tredak <ptredak@nvidia.com >
[ROCm/rccl commit: 5966316771 ]
2016-01-21 13:00:21 -08:00
Nathan Luehr
c4f8bd6da3
Fixed deadlock in back-to-back reduce_scatters.
...
Change-Id: I92d32b15e516a39710b676aee692ae9b70638937
Reviewed-on: http://git-master/r/935458
Reviewed-by: Przemek Tredak <ptredak@nvidia.com >
Tested-by: Przemek Tredak <ptredak@nvidia.com >
[ROCm/rccl commit: 130ee246e2 ]
2016-01-21 10:36:03 -08:00
Nathan Luehr
bb5df3dbb4
Merge pull request #6 from lukeyeager/deb
...
Deb packaging
[ROCm/rccl commit: 90af7c73ef ]
2016-01-07 13:06:28 -08:00
Nathan Luehr
00a915bb69
Merge branch 'yangky11-patch-1'
...
[ROCm/rccl commit: 3251681207 ]
2016-01-06 16:48:29 -08:00
Kaiyu Yang
3cb0f5bc78
fix a typo in README.md
...
[ROCm/rccl commit: d332c41e71 ]
2015-12-24 00:01:02 +08:00
Luke Yeager
d529f63072
Update deb packaging scripts
...
[ROCm/rccl commit: c9da89254b ]
2015-12-18 14:23:34 -08:00
Nathan Luehr
8915ecb588
Merge pull request #5 from lukeyeager/tests-nvml
...
Don't link tests with NVML
[ROCm/rccl commit: eb2d869f71 ]
2015-12-18 13:36:20 -08:00
Boris Fomitchev
da24020fd4
Added Debian packaging files
...
[ROCm/rccl commit: f1e92fe2a3 ]
2015-12-18 13:36:10 -08:00
Boris Fomitchev
b3db23e837
Don't link tests with NVML
...
[ROCm/rccl commit: b5400c54df ]
2015-12-18 13:27:55 -08:00
Nathan Luehr
87be580f28
Merge pull request #4 from lukeyeager/build-sm50
...
Build SM 5.0 code
[ROCm/rccl commit: a4de6016f8 ]
2015-12-18 13:23:48 -08:00
Nathan Luehr
5c5cb7fb67
Merge pull request #3 from lukeyeager/semver
...
Use semantic versioning
[ROCm/rccl commit: 4807909e3f ]
2015-12-18 13:22:19 -08:00
Luke Yeager
eff4cd1cb3
Build SM 5.0 code
...
[ROCm/rccl commit: dd0884b707 ]
2015-12-18 13:19:50 -08:00
Luke Yeager
a0d7e36891
Use semantic versioning
...
[ROCm/rccl commit: e1634ca6cb ]
2015-12-18 12:02:17 -08:00
Nathan Luehr
d99f295375
Fixed bug in MPI initialization.
...
[ROCm/rccl commit: 651a6edc5c ]
2015-12-10 17:54:41 -08:00
Nathan Luehr
7a446eab56
Merge pull request #1 from slayton58/int64_uint64
...
Add int64 and uint64 types for all algorithms and tests
[ROCm/rccl commit: ada5edce88 ]
2015-12-10 17:22:50 -08:00
Simon Layton
4f66264713
Add int64 and uint64 types for all algorithms and tests
...
[ROCm/rccl commit: 41ce4ca9fc ]
2015-12-04 13:28:36 -05:00