26 Commits

Author SHA1 Message Date
corey-derochie-amd de82a18790 Fixed unit-test env var list parsing and improved filtered test run speed (#1626)
* Fixed parsing of env var lists which were overwriting the mutable env var string and polluting future parses.

* Fixed all tests to obey UT_DATATYPES and UT_REDOPS filters.

* Allow tests to bail early via `GTEST_SKIP` if UT_DATATYPES or UT_REDOPS filters give a test size of zero. This allows tests to run much faster with filters on.

* Wrapped the support checks in helper functions on `TestBed`.

[ROCm/rccl commit: 18e9ad913b]
2025-12-10 10:06:44 -07:00
Atul Kulkarni e4aef19511 Added new unit tests for AllReduce with Bias API (#2036)
* Added new unit tests for AllReduce with Bias API

* Address review comments

[ROCm/rccl commit: 7c12b0b76b]
2025-12-03 17:37:34 -06:00
BertanDogancay d045d0ca23 Merge remote-tracking branch 'nccl/master' into develop
[ROCm/rccl commit: a6bf9bfc9e]
2025-04-23 20:47:43 -07:00
gilbertlee-amd 8023be9355 Adding UT_DEBUG_PAUSE to unit tests (#1653)
[ROCm/rccl commit: ee85a70bb4]
2025-04-21 21:15:07 -06:00
mberenjk a3a598efb3 Skipping AllReduce test on more than 8 ranks for FP8 type on Hyabusa (#1598)
* Skipping AllReduce FP8 test on 9 to 16 ranks (gfx90a) as it's using Tree algorithm not RING

---------

Co-authored-by: Marzieh Berenjkoub <mberenjk@amd.com>

[ROCm/rccl commit: 5f691aaf65]
2025-03-17 10:22:49 -05:00
Wenkai Du afd04a5117 Limit P2P channels per peer to not exceeding max channels (#1594)
* Limit P2P channels per peer to not exceeding max channels

* [UT] test single GPU cases for all collectives

* [UT] fix out of range root value

[ROCm/rccl commit: 4237caad69]
2025-03-11 09:32:09 -07:00
saurabhAMD 69d976532b GPU allocation for CPX Unit Tests using PCI bus id (#1403)
* mapping devices wrt pci

* Gpu allocation by using pci mapping

* Passing gpuPriorityOrder in as an argument rather than making the functions non-static.

* Removing redundant testBed instance calling

[ROCm/rccl commit: 69b2b712ab]
2024-11-04 10:51:00 -06:00
saurabhAMD e3b39ab309 Making variable names consistent in EnvVars.cpp (#1327)
* Making variable names consistent in EnvVars.cpp

[ROCm/rccl commit: 4856309413]
2024-09-11 09:23:31 -05:00
saurabhAMD fdaef9dd82 Enabling Unit Tests for CPX mode (#1324)
* Unit Tests for RCCL in CPX mode

* override pow2gpus set by cpx mode by user argument

* Adding comment for UT_POW2_GPUS

* Additional comment on why using pow2gpus for cpx mode.

[ROCm/rccl commit: 289a80c4e9]
2024-09-09 10:12:33 -05:00
Tim 1bd3db8fc7 Update EnvVars.cpp
[ROCm/rccl commit: 757d1891e9]
2024-09-04 16:55:36 -04:00
akolliasAMD 37c44d531b gfx12 Disable ll protocol (#1268)
[ROCm/rccl commit: c246e25f8e]
2024-07-26 08:59:55 -06:00
corey-derochie-amd 37bf54b8f8 Enable multi-threading for MSCCL (#1203)
MSCCL can now run in a multi-threaded configuration. To test in the unit tests, added the ENABLE_OPENMP compile definition flag and the --openmp-test-enable flag to the unit test build script. To activate, set the environment variables UT_MULTITHREADED=1 and UT_PROCESS_MASK=1. Set Jenkins to use this mode.

[ROCm/rccl commit: 0c36d571ea]
2024-07-04 09:34:38 -06:00
saurabhAMD 44064a612c enable UT to test with channels greater than 64
[ROCm/rccl commit: 392a73fdef]
2024-06-13 13:54:08 -05:00
Andy li e373bd44bf Enable fp8 support (#1101)
* initial checkin

* resolve cr comments

* resolve the build issue

* fix the data correctless issue

* update fp8 header file and update the unit test for fp8 support

* remove fp16 from fp8 headers

* fix ut issue and catch up the latest code from develop

* udate according to cr comments

* update ut according to cr comments

* update num floats for each SumPostDiv from 4 to 6

* update fp8 header file name

* fix the typo

[ROCm/rccl commit: 6777e65c1d]
2024-03-08 15:17:53 -08:00
Bertan Dogancay 11674674fc [DEV] Configure functions in RCCL (#986)
* configure functions in rccl

[ROCm/rccl commit: 28d9b170c9]
2024-01-18 15:07:16 -07:00
Tim 0343d9ccac Relaxing default timeout limit, add error log (#1052)
Signed-off-by: Tim Hu <timhu102@amd.com>

[ROCm/rccl commit: 05850e89f2]
2024-01-18 15:09:08 -05:00
Tim 245e757b26 Adding timeout functionality/EnvVar to TestBed (#1044)
* Adding timeout functionality/EnvVar to TestBed
* updating timeout unit to microseconds

Signed-off-by: Tim Hu <timhu102@amd.com>

[ROCm/rccl commit: 9c0ef11ac7]
2024-01-17 11:33:01 -05:00
gilbertlee-amd 894b2d75e7 Report unit test environment variable values as part of output (#789)
[ROCm/rccl commit: f7c553edad]
2023-06-29 07:13:05 -06:00
gilbertlee-amd ff2c1c5d0f Unit test performance refactor (#700)
* Refactoring unit tests to improve performance
* Spawning child processes during InitComms instead of on TestBed construction
* Temporarily disabling graph unit tests

[ROCm/rccl commit: 27e0cb43c2]
2023-04-06 12:28:53 -06:00
gilbertlee-amd b859549866 Adding interactive mode for unit tests (UT_INTERACTIVE) (#715)
[ROCm/rccl commit: 00c3d8d850]
2023-03-21 10:58:24 -06:00
gilbertlee-amd 80ce7507c7 Adding UnitTest timing summary (UT_SHOW_TIMING) (#692)
[ROCm/rccl commit: f63d3b1978]
2023-02-22 08:57:13 -07:00
akolliasAMD 78fc48f1c8 added a different way for getting device count, by running it in a child process (#665)
[ROCm/rccl commit: 24aa8bd802]
2022-12-14 16:10:14 -07:00
Edgar Gabriel b32b819151 introduce support for ncclFloat16/half in UT
[ROCm/rccl commit: f6e00dec13]
2022-08-24 15:28:24 +00:00
akolliasAMD dcf46e84e0 moved default number of max ranks per gpu to 1
[ROCm/rccl commit: 8b9291eb47]
2022-06-22 17:37:49 +00:00
Edgar f7ef619ba7 extending the unit-tests for multi-rank support
[ROCm/rccl commit: a87d61db2b]
2022-06-10 14:23:19 +00:00
gilbertlee-amd a182076a0e Unit test refactor (#500)
Refactoring and consolidating single-process / multi-process unit testing

[ROCm/rccl commit: 29ad0f5fbe]
2022-02-25 08:59:07 -07:00