Граф коммитов

30 Коммитов

Автор SHA1 Сообщение Дата
corey-derochie-amd de82a18790 Fixed unit-test env var list parsing and improved filtered test run speed (#1626)
* Fixed parsing of env var lists which were overwriting the mutable env var string and polluting future parses.

* Fixed all tests to obey UT_DATATYPES and UT_REDOPS filters.

* Allow tests to bail early via `GTEST_SKIP` if UT_DATATYPES or UT_REDOPS filters give a test size of zero. This allows tests to run much faster with filters on.

* Wrapped the support checks in helper functions on `TestBed`.

[ROCm/rccl commit: 18e9ad913b]
2025-12-10 10:06:44 -07:00
Atul Kulkarni e4aef19511 Added new unit tests for AllReduce with Bias API (#2036)
* Added new unit tests for AllReduce with Bias API

* Address review comments

[ROCm/rccl commit: 7c12b0b76b]
2025-12-03 17:37:34 -06:00
corey-derochie-amd af1c448ed1 Changed TestBedChild to avoid hang if the call fails (#1875)
Changed `TestBedChild` protocol to send the result code before the return value to avoid hanging if the call fails. Switched `TestBedChild::GetUniqueId` to use this.

[ROCm/rccl commit: b88c134874]
2025-08-23 00:17:34 -05:00
Arm Patinyasakdikul 8557ea33ad Test: delete child object to address memory leak. (#1863)
[ROCm/rccl commit: 9d3acffa5f]
2025-08-20 10:15:03 -05:00
BertanDogancay c0c9312e38 Merge remote-tracking branch 'nccl/master' into develop
[ROCm/rccl commit: aaf023976a]
2025-06-20 07:54:49 -05:00
BertanDogancay d045d0ca23 Merge remote-tracking branch 'nccl/master' into develop
[ROCm/rccl commit: a6bf9bfc9e]
2025-04-23 20:47:43 -07:00
gilbertlee-amd 8023be9355 Adding UT_DEBUG_PAUSE to unit tests (#1653)
[ROCm/rccl commit: ee85a70bb4]
2025-04-21 21:15:07 -06:00
mberenjk a3a598efb3 Skipping AllReduce test on more than 8 ranks for FP8 type on Hyabusa (#1598)
* Skipping AllReduce FP8 test on 9 to 16 ranks (gfx90a) as it's using Tree algorithm not RING

---------

Co-authored-by: Marzieh Berenjkoub <mberenjk@amd.com>

[ROCm/rccl commit: 5f691aaf65]
2025-03-17 10:22:49 -05:00
Wenkai Du afd04a5117 Limit P2P channels per peer to not exceeding max channels (#1594)
* Limit P2P channels per peer to not exceeding max channels

* [UT] test single GPU cases for all collectives

* [UT] fix out of range root value

[ROCm/rccl commit: 4237caad69]
2025-03-11 09:32:09 -07:00
saurabhAMD 69d976532b GPU allocation for CPX Unit Tests using PCI bus id (#1403)
* mapping devices wrt pci

* Gpu allocation by using pci mapping

* Passing gpuPriorityOrder in as an argument rather than making the functions non-static.

* Removing redundant testBed instance calling

[ROCm/rccl commit: 69b2b712ab]
2024-11-04 10:51:00 -06:00
Tim 3261e2a5fd Adding User Buffer Registration support for Unit test (#1199)
* Adding UBR support for UT SendRecv

Signed-off-by: Tim Hu <timhu102@amd.com>

* Update test/common/TestBedChild.cpp

Co-authored-by: corey-derochie-amd <161367113+corey-derochie-amd@users.noreply.github.com>

---------

Signed-off-by: Tim Hu <timhu102@amd.com>
Co-authored-by: corey-derochie-amd <161367113+corey-derochie-amd@users.noreply.github.com>

[ROCm/rccl commit: a4793286c7]
2024-07-30 13:39:25 -04:00
corey-derochie-amd 37bf54b8f8 Enable multi-threading for MSCCL (#1203)
MSCCL can now run in a multi-threaded configuration. To test in the unit tests, added the ENABLE_OPENMP compile definition flag and the --openmp-test-enable flag to the unit test build script. To activate, set the environment variables UT_MULTITHREADED=1 and UT_PROCESS_MASK=1. Set Jenkins to use this mode.

[ROCm/rccl commit: 0c36d571ea]
2024-07-04 09:34:38 -06:00
saurabhAMD de7ea612d7 Unit Tests for testing channels (#1222)
[ROCm/rccl commit: e170f41ddd]
2024-06-25 10:10:10 -05:00
Bertan Dogancay dea5e83940 [UT] Start supporting multiple group calls and graphs (#1151)
* Start supporting multiple group calls UT

[ROCm/rccl commit: 0ec41f1386]
2024-04-25 11:11:16 -06:00
Tim 245e757b26 Adding timeout functionality/EnvVar to TestBed (#1044)
* Adding timeout functionality/EnvVar to TestBed
* updating timeout unit to microseconds

Signed-off-by: Tim Hu <timhu102@amd.com>

[ROCm/rccl commit: 9c0ef11ac7]
2024-01-17 11:33:01 -05:00
akolliasAMD af8c2194a7 lessened the amount of child processes to active ones (#720)
[ROCm/rccl commit: 2ce7d971e5]
2023-04-11 08:59:56 -06:00
gilbertlee-amd ff2c1c5d0f Unit test performance refactor (#700)
* Refactoring unit tests to improve performance
* Spawning child processes during InitComms instead of on TestBed construction
* Temporarily disabling graph unit tests

[ROCm/rccl commit: 27e0cb43c2]
2023-04-06 12:28:53 -06:00
gilbertlee-amd b859549866 Adding interactive mode for unit tests (UT_INTERACTIVE) (#715)
[ROCm/rccl commit: 00c3d8d850]
2023-03-21 10:58:24 -06:00
gilbertlee-amd 0da1d6a6cd Multi stream unit test (#693)
* Adding multi-stream support to unit tests

[ROCm/rccl commit: 80ed608a9d]
2023-02-23 13:28:50 -07:00
gilbertlee-amd 7d860d3642 Unit test fail check (#689)
* Adding fall-through on unit test failure

* Workaround for hipGraph validity check issue

[ROCm/rccl commit: a640c6983f]
2023-02-18 08:50:46 -08:00
Pedram Alizadeh f7982e9bed UnitTest: add test cases for 2.14 API (ncclCommInitRankConfig and ncclCommFinalize for non-blocking communicator) (#674)
[ROCm/rccl commit: fddb5e6be8]
2023-02-03 17:36:30 -05:00
Pedram Alizadeh a85f71a421 Revert "UnitTest: add test cases for 2.14 API (ncclCommInitRankConfig and ncclCommFinalize for non-blocking communicator) (#662)" (#666)
This reverts commit f29aa66d4f.

[ROCm/rccl commit: 54a3da04eb]
2022-12-14 11:28:40 -05:00
Pedram Alizadeh f29aa66d4f UnitTest: add test cases for 2.14 API (ncclCommInitRankConfig and ncclCommFinalize for non-blocking communicator) (#662)
[ROCm/rccl commit: 8250092367]
2022-12-13 16:05:09 -05:00
gilbertlee-amd 5871811d34 Graph unit tests (#656)
* Adding hipGraph unit tests

[ROCm/rccl commit: faed69f9fc]
2022-12-01 10:28:42 -07:00
Edgar f7ef619ba7 extending the unit-tests for multi-rank support
[ROCm/rccl commit: a87d61db2b]
2022-06-10 14:23:19 +00:00
akolliasAMD 3493750b6b Added alltoallv test and optional args variable on collective args (#514)
* Added alltoallv test and optional args variable on collective args

[ROCm/rccl commit: 65ea3d80db]
2022-03-18 13:55:11 -04:00
Nirmal Unnikrishnan 4a4c053a6a File reorganization as per the new defined standard
The header files will in /opt/rocm-xxx/include/rccl
Libraries and cmake will be in /opt/rocm-xxx/lib folder.
Added wrappers for header files using rocm-cmake functions for backward compatibility.


[ROCm/rccl commit: 676a4737c1]
2022-03-08 17:32:02 +00:00
gilbertlee-amd 8f7ec04f37 Changing initialization method for UnitTests (#510)
[ROCm/rccl commit: 0687940b84]
2022-03-07 09:22:55 -07:00
akolliasAMD 2419a950fe Added Unit test for nccl send recv (#506)
Added Send Receive test that tests through all pairs

[ROCm/rccl commit: ff54e79799]
2022-03-02 15:50:16 -05:00
gilbertlee-amd a182076a0e Unit test refactor (#500)
Refactoring and consolidating single-process / multi-process unit testing

[ROCm/rccl commit: 29ad0f5fbe]
2022-02-25 08:59:07 -07:00