Tato stránka vyžaduje JavaScript.
Procházet
Nápověda
Přihlásit se
AI
/
amd-strix-halo-vllm-toolboxes
Sledovat
2
Oblíbit
0
Rozštěpit
0
Již jsi rozštěpil amd-strix-halo-vllm-toolboxes
Zdrojový kód
Úkoly
Pull requesty
Akce
Balíčky
Projekty
Vydání
Wiki
Aktivita
Files
9c6d32e32679cb5ecbe94fd0a52c55df74066af2
amd-strix-halo-vllm-toolboxes
/
benchmarks
T
Historie
Donato Capitella
9c6d32e326
updating max context results
2026-02-02 11:56:26 +00:00
..
benchmark_results
updates
2025-12-20 11:37:06 +00:00
benchmark_results_rocm_attn
/benchmark_results
added ROCm/Triton attention comparison
2025-12-20 11:49:03 +00:00
find_max_context.py
feat: Optimize model
max_num_seqs
and global benchmark parameters for Strix Halo, and centralize configurations in
models.py
.
2026-02-02 08:45:13 +00:00
max_context_results.json
updating max context results
2026-02-02 11:56:26 +00:00
run_vllm_bench.py
feat: Update ROCm benchmark result paths, improve cluster node discovery and cache clearing, and refine cluster benchmark result directory.
2026-02-02 07:35:50 +00:00
vllm_cluster_bench.py
feat: Optimize model
max_num_seqs
and global benchmark parameters for Strix Halo, and centralize configurations in
models.py
.
2026-02-02 08:45:13 +00:00