amd-strix-halo-vllm-toolboxes/benchmarks at 0109e6a19b746c2292fa63a7732c9d974e4bf727 - amd-strix-halo-vllm-toolboxes - BadStorm.xyz - Code Hub

AI/amd-strix-halo-vllm-toolboxes

Files

T

Histórico

Donato Capitella 0109e6a19b feat: Optimize model max_num_seqs and global benchmark parameters for Strix Halo, and centralize configurations in models.py.

2026-02-02 08:45:13 +00:00

..

benchmark_results

updates

2025-12-20 11:37:06 +00:00

benchmark_results_rocm_attn/benchmark_results

added ROCm/Triton attention comparison

2025-12-20 11:49:03 +00:00

find_max_context.py

feat: Optimize model max_num_seqs and global benchmark parameters for Strix Halo, and centralize configurations in models.py.

2026-02-02 08:45:13 +00:00

max_context_results.json

feat: Enhance vLLM benchmarking to compare Triton and ROCm attention, introduce a new script for cluster configuration, and update Dockerfile for new tools and dependencies.

2026-02-01 19:36:07 +00:00

run_vllm_bench.py

feat: Update ROCm benchmark result paths, improve cluster node discovery and cache clearing, and refine cluster benchmark result directory.

2026-02-02 07:35:50 +00:00

vllm_cluster_bench.py

feat: Optimize model max_num_seqs and global benchmark parameters for Strix Halo, and centralize configurations in models.py.

2026-02-02 08:45:13 +00:00