Stanley Tsang
7e55b211c5
Build AllReduce only mode ( #443 )
...
* Initial commit of all_reduce_only support
* Working AllReduce only build
* Removing printfs and restoring release build
* Restore P2P index
* Updates to build_allreduce_only mode.
* cleaning up macro ifdefs
2021-10-26 17:36:46 -06:00
Wenkai Du
1faff323b4
Unit Test: support ncclAvg
2021-08-25 14:15:54 -07:00
Stanley Tsang
f6f5e16fe6
Fixing bug with ExtractSubDataset function not fully initializing subdataset ( #390 )
2021-06-10 14:35:39 -06:00
gilbertlee-amd
9d7232c091
Clique tuning upgrade ( #352 )
...
* Enabling clique for any XGMI-connected topology, adding tuning
* Updating CHANGELOG for clique tuning
* Re-working clique barrier system to work on multi-process / multi-gpu
2021-05-06 09:50:07 -06:00
gilbertlee-amd
41bcfb8878
Clique kernel support ( #295 )
...
* Adding experimental clique-based kernels (opt-in only)
Co-authored-by: Stanley Tsang <stanley.tsang@amd.com >
Co-authored-by: Gilbert Lee <gilbert.lee@amd.com >
Co-authored-by: Wenkai Du <43822138+wenkaidu@users.noreply.github.com >
2020-11-10 15:44:10 -07:00
Stanley Tsang
684f3e6af4
Adding better naming to unit tests for filtering; adding short and full unit test suites ( #235 )
2020-07-21 12:19:47 -06:00
Wenkai Du
8db0aa8f4c
gtest: extend testing up to 8 GPUs
2020-06-29 09:32:31 -07:00
Wenkai Du
fee1a20b74
gtest: add scatter, gather and all to all unit tests
2020-06-09 17:44:15 -07:00
Stanley Tsang
20fa04d9b6
Updating copyright notices for 2020.
2020-01-29 15:28:08 -08:00
gilbertlee-amd
000bce6f27
Removing OpenMP from unit tests ( #163 )
2019-12-20 11:41:56 -07:00
Wenkai Du
bdac0256a5
Add bfloat16 all reduce unit test
2019-11-18 13:50:29 -08:00
Gilbert Lee
a50c852851
Adding support for alignment tests via sub-datasets
...
Added sample alignment test for AllGather
Datasets no longer free memory on destruction so Release() must be used
2019-05-18 00:04:03 +00:00
Gilbert Lee
08fcce5ec9
Fixing GoogleTest to 1.8.1 and making changes to tests to support older API
2019-05-16 23:13:49 +00:00
Gilbert Lee
55a4b22ad7
Updating RCCL based on NCCL 2.3.7
...
- Contains modifications to support AMD hardware
- Adds unit tests
2019-05-16 16:16:18 +00:00