38 커밋

작성자 SHA1 메시지 날짜
Mustafa Abduljabbar cb4b286d2b Enable viewing algo/proto/channels used in rccl-tests output (#151)
* Enable algo/proto/channel viewing 

* Use dynamic symbol loading to avoid build/runtime issues with non-compatible RCCL versions

* Reduce code duplication

[ROCm/rccl-tests commit: 0c94d4d2b3]
2025-09-26 18:09:01 -04:00
BertanDogancay 0010193b64 Merge remote-tracking branch 'nccl-tests/master' into develop
[ROCm/rccl-tests commit: 50a26637fb]
2025-07-23 14:23:22 -05:00
David Addison cc15c84a01 Fix formatting errors in README.md
[ROCm/rccl-tests commit: 0c60e6a8e4]
2025-05-30 17:43:30 -07:00
David Addison 46e09f18c8 Add support for Symmetric Memory Registration
From NCCL 2.27.x we can now use the Symmetric Memory APIs (-R 2)


[ROCm/rccl-tests commit: a5c539e68b]
2025-05-30 17:31:34 -07:00
Wenkai Du fe47d3dd77 Automatically set in-place option from out-of-place (#123)
[ROCm/rccl-tests commit: cac33a8c2f]
2025-05-09 16:48:42 -05:00
David Addison b8dcb4dd83 Make verifiable a DSO and add NAME_SUFFIX support
Build option DSO=1 generates libverifiable.so which can be
used to reduce the combined binary size.

Build option NAME_SUFFIX can be used to a add suffix to all
generated binaries. e.g. NAME_SUFFIX=_mpi

Added new make target: clean_intermediates


[ROCm/rccl-tests commit: 1021260ca9]
2025-04-23 17:07:24 -07:00
nileshnegi 8d887aad0d Merge remote-tracking branch 'nccl-tests/master' into develop
[ROCm/rccl-tests commit: 5625599dda]
2025-04-21 19:46:10 -05:00
Alex Breslow 9da345dadf Add instructions to README regarding benchmarking on pre ROCm 6.4.x versions with HSA_NO_SCRATCH_RECLAIM=1 (#114)
[ROCm/rccl-tests commit: 284ff2ac84]
2025-04-08 09:59:57 -07:00
Sylvain Jeaugey b740da9a31 Add NCCL_TESTS_SPLIT documentation in the README
[ROCm/rccl-tests commit: 903918fc54]
2025-02-06 14:10:07 +01:00
David Sidler 8135eefce7 Use find_package for MPI (#92)
* Use find_package for MPI
* Minor fixes

[ROCm/rccl-tests commit: 46152785f0]
2025-01-14 11:49:20 -06:00
AtlantaPepsi e67844cc67 Fixing typo in readme
Signed-off-by: AtlantaPepsi <timhu102@amd.com>


[ROCm/rccl-tests commit: 71355df959]
2024-07-31 14:59:47 +00:00
David Addison 98b958afbd Added some missing command line options to README.md
Also updated single and multi-node examples.


[ROCm/rccl-tests commit: 0d86b5a6e7]
2024-07-30 14:50:45 -07:00
saurabhAMD ce8e61cc3b Enable cache flush after every -F iteration. Default : 0 (No cache flush)
[ROCm/rccl-tests commit: 699478dadf]
2024-05-07 11:32:30 -05:00
Nilesh M Negi af102613e4 [DOCS] Update README for performance-oriented runs (#73)
Signed-off-by: nileshnegi <Nilesh.Negi@amd.com>

[ROCm/rccl-tests commit: e8650b1844]
2024-04-23 14:30:06 -05:00
Wenkai Du b49f6da1ec Merge remote-tracking branch 'nccl-tests/master' into HEAD
[ROCm/rccl-tests commit: 621dde544d]
2024-03-01 18:34:44 +00:00
David Addison e1f13fac90 Make the -c option be a datacheck iteration count parameter
Default is 1


[ROCm/rccl-tests commit: 6c46206a47]
2023-09-13 14:03:38 -07:00
Wenkai Du a2e8a44adb Merge remote-tracking branch 'nccl/master' into develop
[ROCm/rccl-tests commit: bb0f15d407]
2023-06-14 08:21:02 -07:00
Felix Abecassis b3db782c3f Update README.md
[ROCm/rccl-tests commit: 17d0a42d5a]
2023-03-23 09:05:41 -07:00
Edgar Gabriel e86966907d revamp cmake MPI detection
we honor user requested MPI installations using MPI_PATH first,
and check afterwards for MPICH and Open MPI in the default
Ubuntu and RHEL installation directories.


[ROCm/rccl-tests commit: bdf58b1656]
2023-03-02 19:40:13 +00:00
Edgar Gabriel cc584c4825 auto-detect and enable MPI
[ROCm/rccl-tests commit: 2b2f23f42d]
2023-02-23 18:27:08 +00:00
Sylvain Jeaugey b70cac2b33 Update README.md
Improve MPI example to avoid confusion of number of processes / total number of GPUs.

https://github.com/NVIDIA/nccl-tests/issues/54#issuecomment-1212023369

[ROCm/rccl-tests commit: 2cbb968101]
2023-01-03 08:47:43 +01:00
Wenkai Du 06f4ccd9d2 Merge remote-tracking branch 'nccl/master' into develop
[ROCm/rccl-tests commit: 9f8ddadcdf]
2021-07-13 08:11:44 -07:00
David Addison ea6eec9e80 Updated with new command line arguments
[ROCm/rccl-tests commit: 11cff17a04]
2021-07-06 16:27:45 -07:00
Wenkai Du 075041a9fa Merge remote-tracking branch 'nccl/master' into HEAD
[ROCm/rccl-tests commit: 5361dd8177]
2020-07-06 17:54:31 +00:00
Sylvain Jeaugey 23326c8d34 Update README.md
Checks are now fully local, no need to disable them at scale.

[ROCm/rccl-tests commit: a2af1d959d]
2019-10-10 10:51:05 -07:00
Sylvain Jeaugey 6e12e2d665 Update README.md
[ROCm/rccl-tests commit: ca7a565236]
2019-08-16 09:06:28 -07:00
Stanley Tsang 30d348de0e Adding unit tests and files for CI (#4)
* Adding initial unit test and Jenkins code.

Fixing scope of unit tests

Adding unit tests and files for CI

Fixing Jenkinsfile

* Removing typos from Jenkinsfile

* Making some fixes to the Jenkins file; temporarily disabling MPI

* Making corrections to Jenkinsfile

* Correcting dockerNodes entry in Jenkinsfile

* Fixed Jenkinsfile for CI

* Correcting Jenkinsfile for CI

* Updating README to include instructions on how to run unit tests.


[ROCm/rccl-tests commit: 924521ff57]
2019-06-28 08:52:44 -07:00
Stanley Tsang 1ced548da6 Update README.md
[ROCm/rccl-tests commit: 3aa32972f4]
2019-05-15 11:22:34 -06:00
Stanley Tsang 1f30820bdc Updating README to include CUSTOM_RCCL_LIB.
[ROCm/rccl-tests commit: 79356ec218]
2019-05-15 16:59:47 +00:00
Stanley Tsang aac7cfb64f Adding AMD copyright notices
[ROCm/rccl-tests commit: 71e663e62d]
2019-04-10 15:28:40 -07:00
Wenkai Du 3c8cfb2d6e hipify nccl-tests to become rccl-tests
[ROCm/rccl-tests commit: a15f771cb2]
2019-04-10 13:43:58 -07:00
David Addison 18902f40a7 Resync all tests with test code from NCCL 2.4
Major rework to merge most of the changes from the NCCL internal
tests into the public ones

Added "-m <agg_iters>" operation aggregation option.
Data integrity checking is now much more performant at scale.
Startup times at scale are improved.
Test latency units are now displayed in usec.


[ROCm/rccl-tests commit: cbe7f65400]
2019-04-05 13:42:15 -07:00
Sylvain Jeaugey 0765416643 Fix link to performance page
[ROCm/rccl-tests commit: db39a88f8a]
2018-01-30 09:14:49 -08:00
Sylvain Jeaugey a4fc980f82 Added explanation about performance numbers
[ROCm/rccl-tests commit: 222f94f949]
2018-01-30 09:13:52 -08:00
Sylvain Jeaugey ee54464922 Fix typo in Readme
[ROCm/rccl-tests commit: 9ec3e35276]
2017-08-08 16:29:25 -07:00
Sylvain Jeaugey 0f03b55914 Improve Readme
[ROCm/rccl-tests commit: a15599f5cf]
2017-08-08 16:28:46 -07:00
Sylvain Jeaugey 0a90cde105 Improve Readme
[ROCm/rccl-tests commit: caede2fbd6]
2017-08-08 16:25:07 -07:00
Sylvain Jeaugey 4cb47ccb21 Initial commit
[ROCm/rccl-tests commit: b188a15299]
2017-08-08 16:18:34 -07:00