Tato stránka vyžaduje JavaScript.
Procházet
Nápověda
Přihlásit se
AI
/
amd-strix-halo-vllm-toolboxes
Sledovat
2
Oblíbit
0
Rozštěpit
0
Již jsi rozštěpil amd-strix-halo-vllm-toolboxes
Zdrojový kód
Úkoly
Pull requesty
Akce
Balíčky
Projekty
Vydání
Wiki
Aktivita
Files
fde8f520d9bc6ed34d48ae959285bc64e8c8c693
amd-strix-halo-vllm-toolboxes
/
docs
T
Historie
Donato Capitella
fde8f520d9
feat: Update benchmark results across various models and configurations, increasing
num_requests
from 100 to 200.
2026-02-03 08:31:54 +00:00
..
assets
updates
2025-12-20 11:37:06 +00:00
index.html
perf: Increase
max_num_seqs
for bus batch scaling and
OFF_NUM_PROMPTS
for steady-state throughput measurement on Strix Halo.
2026-02-02 22:36:15 +00:00
parse_results.py
feat: Add new benchmark results for various models and configurations, and update documentation UI with filtering for attention and tensor parallelism.
2026-02-02 21:30:17 +00:00
results.json
feat: Update benchmark results across various models and configurations, increasing
num_requests
from 100 to 200.
2026-02-03 08:31:54 +00:00