akolliasAMD
8438fd9e42
Wall clock update and npkit trace script Update ( #771 )
...
* changed builtin clock to wall_clock64
* updated npkit_Trace_generator to the new version of npkit
[ROCm/rccl commit: 9cdac774ea ]
2023-06-07 17:47:10 -06:00
gilbertlee-amd
35872115f8
Updating stream caching ( #614 )
...
- Adding non-captured hipStream for use in setup
[ROCm/rccl commit: dd56135a9a ]
2022-09-09 16:30:15 -06:00
Wenkai Du
adff98765c
Merge remote-tracking branch 'nccl/master' into no-target-id
...
[ROCm/rccl commit: d469947641 ]
2021-01-14 19:27:53 -05:00
Wenkai Du
49c667ac8a
ib-test: support multiple channels ( #241 )
...
[ROCm/rccl commit: 8d5fb920b6 ]
2020-07-27 11:03:12 -07:00
Wenkai Du
e7d664200f
Use posix_memalign for network buffer allocation on host memory ( #221 )
...
* Use posix_memalign for network buffer allocation on host memory
* ib-test: add ability to specify run iterations
* ib-test: define iterations as multiple of default cycles
* Add checking to posix_memalign return value
[ROCm/rccl commit: 0eb19a563a ]
2020-06-22 13:06:25 -07:00
Wenkai Du
2dc0cfcbf5
ib-test: support host memory allocation through posix_memalign ( #220 )
...
* ib-test: support host memory allocation through posix_memalign
* ib-test: add missing CUDACHECK to hip calls
[ROCm/rccl commit: dc739c4e70 ]
2020-06-17 16:16:54 -07:00
Wenkai Du
9ca3176346
Add IB/RDMA unit test
...
[ROCm/rccl commit: cfa97eccd3 ]
2020-06-16 18:29:17 +00:00