akolliasAMD
|
9bba4a2f2a
|
added npkit support into the all_gather run ring algorithm (#790)
|
2023-06-29 13:59:54 -06:00 |
|
akolliasAMD
|
9bdf6797a5
|
fixed npkit size to never be a negative number (#779)
|
2023-06-21 08:26:40 -06:00 |
|
akolliasAMD
|
9cdac774ea
|
Wall clock update and npkit trace script Update (#771)
* changed builtin clock to wall_clock64
* updated npkit_Trace_generator to the new version of npkit
|
2023-06-07 17:47:10 -06:00 |
|
Ziyue Yang
|
7d6e7bcd7d
|
revert npkit (#748)
|
2023-05-24 07:41:05 -07:00 |
|
Ziyue Yang
|
f4bf47f325
|
NPKit: improve clock calibration and fix GPU clock API (#683)
* Improve clock calibration in NPKit
* Improve gfx macro
* Fix macro
|
2023-02-17 12:26:57 -07:00 |
|
Ziyue Yang
|
7d6bbc19d4
|
apply npkit
|
2022-10-14 01:28:17 +00:00 |
|
Ziyue Yang
|
6e93fafdc3
|
Add Feature - Add NPKit Support in RCCL (#564)
* apply npkit
* fix bug
* add npkit in readme
|
2022-06-20 14:30:19 -07:00 |
|