此网站需要 JavaScript。
探索
帮助
登录
AI
/
amd-strix-halo-vllm-toolboxes
关注
2
点赞
0
派生
0
您已经派生过 amd-strix-halo-vllm-toolboxes
代码
工单
合并请求
工作流
软件包
项目
发布
百科
活动
45
提交
3
分支
0
Git 标签
128ddade14f4ad9ded59d40069093679a910a748
提交图
4 次代码提交
作者
SHA1
备注
提交日期
Donato Capitella
128ddade14
fix: improve RDMA stability by configuring NCCL IB timeout and retry count.
2026-02-01 22:04:34 +00:00
Donato Capitella
0d8afba093
feat: Add
RAY_DISABLE_METRICS=1
to disable Ray metrics across cluster configurations and scripts.
2026-02-01 21:52:48 +00:00
Donato Capitella
ba503f6e61
feat: centralize model configurations and benchmark settings into a new
models.py
module and update Dockerfile and scripts to use it.
2026-02-01 21:17:15 +00:00
Donato Capitella
e5cc96bf48
feat: Introduce vLLM cluster benchmarking and setup scripts, and expand the list of models for local benchmarks.
2026-02-01 15:43:56 +00:00