Commit Graph

44 Commitit

Tekijä SHA1 Viesti Päivämäärä
David Addison ea6eec9e80 Updated with new command line arguments
[ROCm/rccl-tests commit: 11cff17a04]
2021-07-06 16:27:45 -07:00
David Addison 230983c84e Merge branch 'bfloat16'
[ROCm/rccl-tests commit: f476f4a17a]
2021-07-06 10:20:32 -07:00
David Addison a23cffe28a Added new option to report average iteration time
[ROCm/rccl-tests commit: 1dfc76eccc]
2021-06-30 19:36:07 -07:00
David Addison 1044cd1f32 Resync with changes in gitilab-master code
[ROCm/rccl-tests commit: 1ae8cdc315]
2021-06-30 13:16:04 -07:00
David Addison efaaf56199 Merge pull request #88 from nzmsv/master
Cleanup argument error handling and messages

[ROCm/rccl-tests commit: 44df0bf010]
2021-06-30 12:35:47 -07:00
David Addison d30e35f150 Added new tests: scatter, sendrecv, hypercube
[ROCm/rccl-tests commit: 9dae3d3a37]
2021-06-28 16:49:10 -07:00
David Addison e73e5a239b Added support for CUDA graph capture/replay (-G)
[ROCm/rccl-tests commit: e55ad3796d]
2021-06-28 14:19:45 -07:00
David Addison 20b63cf465 Fixed formatting for bfloat16 support
[ROCm/rccl-tests commit: 526eacadf7]
2021-06-28 10:12:34 -07:00
David Addison a41268e26e Add support for ncclAvg operation
[ROCm/rccl-tests commit: cde7e769c1]
2021-06-28 09:41:58 -07:00
Greg Inozemtsev 45c28c6c36 Cleanup argument error handling and messages
Add error checking for minbytes and maxbytes arguments

Also accept lowercase literals when parsing size arguments and print errors and usage on stderr.


[ROCm/rccl-tests commit: c4de829d91]
2021-06-04 21:47:40 +00:00
Sylvain Jeaugey 05f0ab10e6 Update PERFORMANCE.md
[ROCm/rccl-tests commit: e12c35d84b]
2021-05-27 09:12:52 -07:00
David Addison 882c60210b Add support for new datatype: bfloat16
[ROCm/rccl-tests commit: e37545e491]
2021-03-15 17:13:35 -07:00
David Addison a74716696b Merge pull request #67 from NVIDIA/big_buffers
Do not allocate memory for expected buffer if checking disabled

[ROCm/rccl-tests commit: 0b30de583f]
2021-02-04 09:24:09 -08:00
David Addison c62bde3272 Do not allocate memory for expected buffer if checking disabled
This allows the tests to be run with larger buffers


[ROCm/rccl-tests commit: 7677f3f608]
2021-01-20 17:08:40 -08:00
David Addison 281348cba9 Merge pull request #64 from NVIDIA/hosthash_boot_id
Add boot_id to the hostname hash due to collisions on Azure

[ROCm/rccl-tests commit: 2f9bba9f20]
2021-01-11 10:02:20 -08:00
David Addison 819d6ce228 Add boot_id to the hostname hash due to collisions on Azure
Fixes #60


[ROCm/rccl-tests commit: ae1ce98e69]
2021-01-04 11:38:45 -08:00
Sylvain Jeaugey 5a9f62c2b7 Merge pull request #61 from jithinjosepkl/master
Use DJB2a hash algorithm in getHostHash()

[ROCm/rccl-tests commit: 464f038106]
2020-12-18 10:39:43 -08:00
Jithin Jose f770d161f3 Use DJB2a hash algorithm in getHostHash()
[ROCm/rccl-tests commit: da67a81c8e]
2020-12-18 10:12:54 -08:00
Sylvain Jeaugey f35cba73c8 Merge pull request #48 from NVIDIA/fix-makefile-typo
Fix typo in src/Makefile

[ROCm/rccl-tests commit: bd0755c95c]
2020-06-24 14:52:55 -07:00
Luke Yeager 8b83a414c5 Fix typo in src/Makefile
[ROCm/rccl-tests commit: afdaf59b3b]
2020-06-24 14:39:22 -07:00
Sylvain Jeaugey 0624d2cede Add gencode for CUDA11
[ROCm/rccl-tests commit: b2603a2e85]
2020-06-23 18:16:46 -07:00
Sylvain Jeaugey 12d86bd58f Change all_gather/reduce_scatter algbw to match the documentation.
Fix #45 : All_gather and reduce_scatter algorithm bandwidth was
computed as time/count*(nranks-1) which is not consistent with the
way we compute it for other collectives.

This change makes algbw higher; busbw is unchanged.


[ROCm/rccl-tests commit: ec1b5e22e6]
2020-06-19 10:42:19 -07:00
Sylvain Jeaugey fcaaf2c4a1 Fix #47 : compilation error on NCCL<2.7
Return an error when trying to run alltoall test when compiled
against NCCL<2.7.


[ROCm/rccl-tests commit: 07ac716c1a]
2020-06-18 15:02:51 -07:00
Sylvain Jeaugey cf70df2498 Merge pull request #46 from NVIDIA/p2p
Add alltoall perf test

[ROCm/rccl-tests commit: a7b304dde5]
2020-06-17 10:45:29 -07:00
Luke Yeager 3a6293b748 Fix some memory leaks
[ROCm/rccl-tests commit: af4fa0f4cf]
2020-06-17 10:44:32 -07:00
Sylvain Jeaugey 0dfae3da28 Remove sm_30
[ROCm/rccl-tests commit: 7a833631b2]
2020-06-15 08:54:21 -07:00
Sylvain Jeaugey e260c673fe Fix #43 : Add .gitignore for build dir
[ROCm/rccl-tests commit: ba924dac95]
2020-06-03 15:10:38 -07:00
Sylvain Jeaugey c633de20d6 Add alltoall perf test
[ROCm/rccl-tests commit: 119a0ecf60]
2020-03-17 12:00:19 -07:00
Sylvain Jeaugey ac5385f663 Merge pull request #31 from wzamazon/fix_makefile
Add -L$(MPI_HOME)/lib64 to NVLDFLAGS

[ROCm/rccl-tests commit: c864b73a27]
2020-01-06 10:38:40 -08:00
Wei Zhang c76094c704 Add -L$(MPI_HOME)/lib64 to NVLDFLAGS
In some cases, the MPI library is not in $(MPI_HOME)/lib but
in $(MPI_HOME)/lib64. For example, on RedHat like Linux system
(CentOS, Amazon Linux), and MPI is installed by yum or rpm.

Under such circumstance, the current make file will cause failure.
This patch address this issue by adding -L$(MPI_HOME)/lib64 to
NVLDFLAGS in src/Makefile.

Signed-off-by: Wei Zhang <wzam@amazon.com>


[ROCm/rccl-tests commit: 0f173234bb]
2019-12-16 16:18:22 -08:00
Sylvain Jeaugey 23326c8d34 Update README.md
Checks are now fully local, no need to disable them at scale.

[ROCm/rccl-tests commit: a2af1d959d]
2019-10-10 10:51:05 -07:00
Sylvain Jeaugey 6e12e2d665 Update README.md
[ROCm/rccl-tests commit: ca7a565236]
2019-08-16 09:06:28 -07:00
David Addison 18902f40a7 Resync all tests with test code from NCCL 2.4
Major rework to merge most of the changes from the NCCL internal
tests into the public ones

Added "-m <agg_iters>" operation aggregation option.
Data integrity checking is now much more performant at scale.
Startup times at scale are improved.
Test latency units are now displayed in usec.


[ROCm/rccl-tests commit: cbe7f65400]
2019-04-05 13:42:15 -07:00
Sylvain Jeaugey 2b951dc7dd Added a precision for AllGather and ReduceScatter sizes since NCCL uses the size per rank.
[ROCm/rccl-tests commit: dcf818955f]
2018-08-17 14:58:44 -07:00
Sylvain Jeaugey ee3da4e0b7 Clarification
[ROCm/rccl-tests commit: eb4c43ff3d]
2018-01-30 09:17:29 -08:00
Sylvain Jeaugey e5b278882e Typos/Clarifications
[ROCm/rccl-tests commit: e00cb1f1c4]
2018-01-30 09:15:58 -08:00
Sylvain Jeaugey 0765416643 Fix link to performance page
[ROCm/rccl-tests commit: db39a88f8a]
2018-01-30 09:14:49 -08:00
Sylvain Jeaugey a4fc980f82 Added explanation about performance numbers
[ROCm/rccl-tests commit: 222f94f949]
2018-01-30 09:13:52 -08:00
Sylvain Jeaugey 3958cc347e Print NCCL version at start
[ROCm/rccl-tests commit: 925a70576e]
2017-12-21 15:10:09 -08:00
Sylvain Jeaugey 224eddfc15 Fix NCCL_HOME to be consistent with README
[ROCm/rccl-tests commit: 25016c8eeb]
2017-08-09 10:41:31 -07:00
Sylvain Jeaugey ee54464922 Fix typo in Readme
[ROCm/rccl-tests commit: 9ec3e35276]
2017-08-08 16:29:25 -07:00
Sylvain Jeaugey 0f03b55914 Improve Readme
[ROCm/rccl-tests commit: a15599f5cf]
2017-08-08 16:28:46 -07:00
Sylvain Jeaugey 0a90cde105 Improve Readme
[ROCm/rccl-tests commit: caede2fbd6]
2017-08-08 16:25:07 -07:00
Sylvain Jeaugey 4cb47ccb21 Initial commit
[ROCm/rccl-tests commit: b188a15299]
2017-08-08 16:18:34 -07:00