本網站需要 JavaScript。
探索
說明
登入
AI
/
amd-strix-halo-vllm-toolboxes
關注
2
加上星號
0
Fork
0
您已經 fork 過 amd-strix-halo-vllm-toolboxes
程式碼
問題
合併請求
Actions
套件
專案
版本發布
Wiki
動態
81
提交
3
分支
0
標籤
8a20ec27b228905ffda2c8fbdf9abb121ce90464
提交線圖
6 次程式碼提交
作者
SHA1
備註
日期
Donato Capitella
6875f62ccf
improve benchmarks
2026-02-25 09:29:46 +00:00
Donato Capitella
91b6dbc270
feat: Display environment variables and allow to choose between RoCE/Ethernet and show RCCL debug information
2026-02-22 20:07:34 +00:00
Donato Capitella
4a5d6c7855
fix broken stuff
2026-02-19 20:29:28 +00:00
Donato Capitella
1ddcb9a202
feat: Configure ROCm attention via
--attention-backend
CLI argument, disable the Ray dashboard, and make eager mode configurable for cluster benchmarks.
2026-02-02 15:40:16 +00:00
Donato Capitella
6f118ff936
feat: Update ROCm benchmark result paths, improve cluster node discovery and cache clearing, and refine cluster benchmark result directory.
2026-02-02 07:35:50 +00:00
Donato Capitella
c587981d73
refactor: Centralize Ray/vLLM cluster management into a new
cluster_manager.py
module and refactor
start_vllm_cluster.py
to use it.
2026-02-01 22:19:34 +00:00