Este sitio web requiere JavaScript.
Explorar
Ayuda
Iniciar sesión
AI
/
amd-strix-halo-vllm-toolboxes
Seguir
2
Destacar
0
Fork
0
Ya ha forkeado amd-strix-halo-vllm-toolboxes
Código
Incidencias
Pull Requests
Acciones
Paquetes
Proyectos
Lanzamientos
Wiki
Actividad
Files
0109e6a19b746c2292fa63a7732c9d974e4bf727
amd-strix-halo-vllm-toolboxes
/
benchmarks
T
Histórico
Donato Capitella
0109e6a19b
feat: Optimize model
max_num_seqs
and global benchmark parameters for Strix Halo, and centralize configurations in
models.py
.
2026-02-02 08:45:13 +00:00
..
benchmark_results
updates
2025-12-20 11:37:06 +00:00
benchmark_results_rocm_attn
/benchmark_results
added ROCm/Triton attention comparison
2025-12-20 11:49:03 +00:00
find_max_context.py
feat: Optimize model
max_num_seqs
and global benchmark parameters for Strix Halo, and centralize configurations in
models.py
.
2026-02-02 08:45:13 +00:00
max_context_results.json
feat: Enhance vLLM benchmarking to compare Triton and ROCm attention, introduce a new script for cluster configuration, and update Dockerfile for new tools and dependencies.
2026-02-01 19:36:07 +00:00
run_vllm_bench.py
feat: Update ROCm benchmark result paths, improve cluster node discovery and cache clearing, and refine cluster benchmark result directory.
2026-02-02 07:35:50 +00:00
vllm_cluster_bench.py
feat: Optimize model
max_num_seqs
and global benchmark parameters for Strix Halo, and centralize configurations in
models.py
.
2026-02-02 08:45:13 +00:00