本網站需要 JavaScript。
探索
說明
登入
AI
/
amd-strix-halo-vllm-toolboxes
關注
2
加上星號
0
Fork
0
您已經 fork 過 amd-strix-halo-vllm-toolboxes
程式碼
問題
合併請求
Actions
套件
專案
版本發布
Wiki
動態
79
提交
3
分支
0
標籤
b035bcb482c2e12bf8a48ecabc660b6f9d55b76e
提交線圖
6 次程式碼提交
作者
SHA1
備註
日期
Donato Capitella
b035bcb482
updated benchmarks including thunderbolt and configuratuion guides
2026-02-25 10:48:42 +00:00
Donato Capitella
e726d406fa
updated benchmarks, fix start-vllm
2026-02-23 19:39:19 +00:00
Donato Capitella
fde8f520d9
feat: Update benchmark results across various models and configurations, increasing
num_requests
from 100 to 200.
2026-02-03 08:31:54 +00:00
Donato Capitella
4d3b046870
feat: Add new benchmark results for various models and configurations, and update documentation UI with filtering for attention and tensor parallelism.
2026-02-02 21:30:17 +00:00
Donato Capitella
711de530f6
added ROCm/Triton attention comparison
2025-12-20 11:49:03 +00:00
Donato Capitella
5e8b6bb545
updates
2025-12-20 11:37:06 +00:00