amd-strix-halo-vllm-toolboxes/benchmarks at c587981d73439fe1c839584c29c087909bf0d4a8 - amd-strix-halo-vllm-toolboxes - BadStorm.xyz - Code Hub

AI/amd-strix-halo-vllm-toolboxes

Форкнуть 0

Files

T

Donato Capitella c587981d73 refactor: Centralize Ray/vLLM cluster management into a new cluster_manager.py module and refactor start_vllm_cluster.py to use it.

2026-02-01 22:19:34 +00:00

..

benchmark_results

updates

2025-12-20 11:37:06 +00:00

benchmark_results_rocm_attn/benchmark_results

added ROCm/Triton attention comparison

2025-12-20 11:49:03 +00:00

find_max_context.py

updates

2025-12-20 11:37:06 +00:00

max_context_results.json

feat: Enhance vLLM benchmarking to compare Triton and ROCm attention, introduce a new script for cluster configuration, and update Dockerfile for new tools and dependencies.

2026-02-01 19:36:07 +00:00

run_vllm_bench.py

feat: centralize model configurations and benchmark settings into a new models.py module and update Dockerfile and scripts to use it.

2026-02-01 21:17:15 +00:00

vllm_cluster_bench.py

refactor: Centralize Ray/vLLM cluster management into a new cluster_manager.py module and refactor start_vllm_cluster.py to use it.

2026-02-01 22:19:34 +00:00