Stanley Tsang
dd98f1762a
Fixing bug with ExtractSubDataset function not fully initializing subdataset ( #390 )
...
[ROCm/rccl commit: f6f5e16fe6 ]
2021-06-10 14:35:39 -06:00
gilbertlee-amd
f4a12be69b
Clique tuning upgrade ( #352 )
...
* Enabling clique for any XGMI-connected topology, adding tuning
* Updating CHANGELOG for clique tuning
* Re-working clique barrier system to work on multi-process / multi-gpu
[ROCm/rccl commit: 9d7232c091 ]
2021-05-06 09:50:07 -06:00
gilbertlee-amd
a7ef699687
Clique kernel support ( #295 )
...
* Adding experimental clique-based kernels (opt-in only)
Co-authored-by: Stanley Tsang <stanley.tsang@amd.com >
Co-authored-by: Gilbert Lee <gilbert.lee@amd.com >
Co-authored-by: Wenkai Du <43822138+wenkaidu@users.noreply.github.com >
[ROCm/rccl commit: 41bcfb8878 ]
2020-11-10 15:44:10 -07:00
Stanley Tsang
56d8c7c893
Adding better naming to unit tests for filtering; adding short and full unit test suites ( #235 )
...
[ROCm/rccl commit: 684f3e6af4 ]
2020-07-21 12:19:47 -06:00
Wenkai Du
ac87c9db37
gtest: extend testing up to 8 GPUs
...
[ROCm/rccl commit: 8db0aa8f4c ]
2020-06-29 09:32:31 -07:00
Wenkai Du
d1f8fdc3a8
gtest: add scatter, gather and all to all unit tests
...
[ROCm/rccl commit: fee1a20b74 ]
2020-06-09 17:44:15 -07:00
Stanley Tsang
e5419407c4
Updating copyright notices for 2020.
...
[ROCm/rccl commit: 20fa04d9b6 ]
2020-01-29 15:28:08 -08:00
gilbertlee-amd
71635198b8
Removing OpenMP from unit tests ( #163 )
...
[ROCm/rccl commit: 000bce6f27 ]
2019-12-20 11:41:56 -07:00
Wenkai Du
7dc39b8928
Add bfloat16 all reduce unit test
...
[ROCm/rccl commit: bdac0256a5 ]
2019-11-18 13:50:29 -08:00
Gilbert Lee
57ac9a8a93
Adding support for alignment tests via sub-datasets
...
Added sample alignment test for AllGather
Datasets no longer free memory on destruction so Release() must be used
[ROCm/rccl commit: a50c852851 ]
2019-05-18 00:04:03 +00:00
Gilbert Lee
0e17bed9e6
Fixing GoogleTest to 1.8.1 and making changes to tests to support older API
...
[ROCm/rccl commit: 08fcce5ec9 ]
2019-05-16 23:13:49 +00:00
Gilbert Lee
60f91f645d
Updating RCCL based on NCCL 2.3.7
...
- Contains modifications to support AMD hardware
- Adds unit tests
[ROCm/rccl commit: 55a4b22ad7 ]
2019-05-16 16:16:18 +00:00