David Addison
ea6eec9e80
Updated with new command line arguments
...
[ROCm/rccl-tests commit: 11cff17a04 ]
2021-07-06 16:27:45 -07:00
David Addison
230983c84e
Merge branch 'bfloat16'
...
[ROCm/rccl-tests commit: f476f4a17a ]
2021-07-06 10:20:32 -07:00
David Addison
a23cffe28a
Added new option to report average iteration time
...
[ROCm/rccl-tests commit: 1dfc76eccc ]
2021-06-30 19:36:07 -07:00
David Addison
1044cd1f32
Resync with changes in gitilab-master code
...
[ROCm/rccl-tests commit: 1ae8cdc315 ]
2021-06-30 13:16:04 -07:00
David Addison
efaaf56199
Merge pull request #88 from nzmsv/master
...
Cleanup argument error handling and messages
[ROCm/rccl-tests commit: 44df0bf010 ]
2021-06-30 12:35:47 -07:00
David Addison
d30e35f150
Added new tests: scatter, sendrecv, hypercube
...
[ROCm/rccl-tests commit: 9dae3d3a37 ]
2021-06-28 16:49:10 -07:00
David Addison
e73e5a239b
Added support for CUDA graph capture/replay (-G)
...
[ROCm/rccl-tests commit: e55ad3796d ]
2021-06-28 14:19:45 -07:00
David Addison
20b63cf465
Fixed formatting for bfloat16 support
...
[ROCm/rccl-tests commit: 526eacadf7 ]
2021-06-28 10:12:34 -07:00
David Addison
a41268e26e
Add support for ncclAvg operation
...
[ROCm/rccl-tests commit: cde7e769c1 ]
2021-06-28 09:41:58 -07:00
Greg Inozemtsev
45c28c6c36
Cleanup argument error handling and messages
...
Add error checking for minbytes and maxbytes arguments
Also accept lowercase literals when parsing size arguments and print errors and usage on stderr.
[ROCm/rccl-tests commit: c4de829d91 ]
2021-06-04 21:47:40 +00:00
Sylvain Jeaugey
05f0ab10e6
Update PERFORMANCE.md
...
[ROCm/rccl-tests commit: e12c35d84b ]
2021-05-27 09:12:52 -07:00
David Addison
882c60210b
Add support for new datatype: bfloat16
...
[ROCm/rccl-tests commit: e37545e491 ]
2021-03-15 17:13:35 -07:00
David Addison
a74716696b
Merge pull request #67 from NVIDIA/big_buffers
...
Do not allocate memory for expected buffer if checking disabled
[ROCm/rccl-tests commit: 0b30de583f ]
2021-02-04 09:24:09 -08:00
David Addison
c62bde3272
Do not allocate memory for expected buffer if checking disabled
...
This allows the tests to be run with larger buffers
[ROCm/rccl-tests commit: 7677f3f608 ]
2021-01-20 17:08:40 -08:00
David Addison
281348cba9
Merge pull request #64 from NVIDIA/hosthash_boot_id
...
Add boot_id to the hostname hash due to collisions on Azure
[ROCm/rccl-tests commit: 2f9bba9f20 ]
2021-01-11 10:02:20 -08:00
David Addison
819d6ce228
Add boot_id to the hostname hash due to collisions on Azure
...
Fixes #60
[ROCm/rccl-tests commit: ae1ce98e69 ]
2021-01-04 11:38:45 -08:00
Sylvain Jeaugey
5a9f62c2b7
Merge pull request #61 from jithinjosepkl/master
...
Use DJB2a hash algorithm in getHostHash()
[ROCm/rccl-tests commit: 464f038106 ]
2020-12-18 10:39:43 -08:00
Jithin Jose
f770d161f3
Use DJB2a hash algorithm in getHostHash()
...
[ROCm/rccl-tests commit: da67a81c8e ]
2020-12-18 10:12:54 -08:00
Sylvain Jeaugey
f35cba73c8
Merge pull request #48 from NVIDIA/fix-makefile-typo
...
Fix typo in src/Makefile
[ROCm/rccl-tests commit: bd0755c95c ]
2020-06-24 14:52:55 -07:00
Luke Yeager
8b83a414c5
Fix typo in src/Makefile
...
[ROCm/rccl-tests commit: afdaf59b3b ]
2020-06-24 14:39:22 -07:00
Sylvain Jeaugey
0624d2cede
Add gencode for CUDA11
...
[ROCm/rccl-tests commit: b2603a2e85 ]
2020-06-23 18:16:46 -07:00
Sylvain Jeaugey
12d86bd58f
Change all_gather/reduce_scatter algbw to match the documentation.
...
Fix #45 : All_gather and reduce_scatter algorithm bandwidth was
computed as time/count*(nranks-1) which is not consistent with the
way we compute it for other collectives.
This change makes algbw higher; busbw is unchanged.
[ROCm/rccl-tests commit: ec1b5e22e6 ]
2020-06-19 10:42:19 -07:00
Sylvain Jeaugey
fcaaf2c4a1
Fix #47 : compilation error on NCCL<2.7
...
Return an error when trying to run alltoall test when compiled
against NCCL<2.7.
[ROCm/rccl-tests commit: 07ac716c1a ]
2020-06-18 15:02:51 -07:00
Sylvain Jeaugey
cf70df2498
Merge pull request #46 from NVIDIA/p2p
...
Add alltoall perf test
[ROCm/rccl-tests commit: a7b304dde5 ]
2020-06-17 10:45:29 -07:00
Luke Yeager
3a6293b748
Fix some memory leaks
...
[ROCm/rccl-tests commit: af4fa0f4cf ]
2020-06-17 10:44:32 -07:00
Sylvain Jeaugey
0dfae3da28
Remove sm_30
...
[ROCm/rccl-tests commit: 7a833631b2 ]
2020-06-15 08:54:21 -07:00
Sylvain Jeaugey
e260c673fe
Fix #43 : Add .gitignore for build dir
...
[ROCm/rccl-tests commit: ba924dac95 ]
2020-06-03 15:10:38 -07:00
Sylvain Jeaugey
c633de20d6
Add alltoall perf test
...
[ROCm/rccl-tests commit: 119a0ecf60 ]
2020-03-17 12:00:19 -07:00
Sylvain Jeaugey
ac5385f663
Merge pull request #31 from wzamazon/fix_makefile
...
Add -L$(MPI_HOME)/lib64 to NVLDFLAGS
[ROCm/rccl-tests commit: c864b73a27 ]
2020-01-06 10:38:40 -08:00
Wei Zhang
c76094c704
Add -L$(MPI_HOME)/lib64 to NVLDFLAGS
...
In some cases, the MPI library is not in $(MPI_HOME)/lib but
in $(MPI_HOME)/lib64. For example, on RedHat like Linux system
(CentOS, Amazon Linux), and MPI is installed by yum or rpm.
Under such circumstance, the current make file will cause failure.
This patch address this issue by adding -L$(MPI_HOME)/lib64 to
NVLDFLAGS in src/Makefile.
Signed-off-by: Wei Zhang <wzam@amazon.com >
[ROCm/rccl-tests commit: 0f173234bb ]
2019-12-16 16:18:22 -08:00
Sylvain Jeaugey
23326c8d34
Update README.md
...
Checks are now fully local, no need to disable them at scale.
[ROCm/rccl-tests commit: a2af1d959d ]
2019-10-10 10:51:05 -07:00
Sylvain Jeaugey
6e12e2d665
Update README.md
...
[ROCm/rccl-tests commit: ca7a565236 ]
2019-08-16 09:06:28 -07:00
David Addison
18902f40a7
Resync all tests with test code from NCCL 2.4
...
Major rework to merge most of the changes from the NCCL internal
tests into the public ones
Added "-m <agg_iters>" operation aggregation option.
Data integrity checking is now much more performant at scale.
Startup times at scale are improved.
Test latency units are now displayed in usec.
[ROCm/rccl-tests commit: cbe7f65400 ]
2019-04-05 13:42:15 -07:00
Sylvain Jeaugey
2b951dc7dd
Added a precision for AllGather and ReduceScatter sizes since NCCL uses the size per rank.
...
[ROCm/rccl-tests commit: dcf818955f ]
2018-08-17 14:58:44 -07:00
Sylvain Jeaugey
ee3da4e0b7
Clarification
...
[ROCm/rccl-tests commit: eb4c43ff3d ]
2018-01-30 09:17:29 -08:00
Sylvain Jeaugey
e5b278882e
Typos/Clarifications
...
[ROCm/rccl-tests commit: e00cb1f1c4 ]
2018-01-30 09:15:58 -08:00
Sylvain Jeaugey
0765416643
Fix link to performance page
...
[ROCm/rccl-tests commit: db39a88f8a ]
2018-01-30 09:14:49 -08:00
Sylvain Jeaugey
a4fc980f82
Added explanation about performance numbers
...
[ROCm/rccl-tests commit: 222f94f949 ]
2018-01-30 09:13:52 -08:00
Sylvain Jeaugey
3958cc347e
Print NCCL version at start
...
[ROCm/rccl-tests commit: 925a70576e ]
2017-12-21 15:10:09 -08:00
Sylvain Jeaugey
224eddfc15
Fix NCCL_HOME to be consistent with README
...
[ROCm/rccl-tests commit: 25016c8eeb ]
2017-08-09 10:41:31 -07:00
Sylvain Jeaugey
ee54464922
Fix typo in Readme
...
[ROCm/rccl-tests commit: 9ec3e35276 ]
2017-08-08 16:29:25 -07:00
Sylvain Jeaugey
0f03b55914
Improve Readme
...
[ROCm/rccl-tests commit: a15599f5cf ]
2017-08-08 16:28:46 -07:00
Sylvain Jeaugey
0a90cde105
Improve Readme
...
[ROCm/rccl-tests commit: caede2fbd6 ]
2017-08-08 16:25:07 -07:00
Sylvain Jeaugey
4cb47ccb21
Initial commit
...
[ROCm/rccl-tests commit: b188a15299 ]
2017-08-08 16:18:34 -07:00