Logo
Jelajahi Bantuan
Masuk
AI/amd-strix-halo-vllm-toolboxes
2
0
Garpu 0
You've already forked amd-strix-halo-vllm-toolboxes
Kode Masalah Tarik Permintaan Actions Packages Projects Rilis Wiki Kegiatan
Files
c587981d73439fe1c839584c29c087909bf0d4a8
amd-strix-halo-vllm-toolboxes/benchmarks
T
Riwayat
Donato Capitella c587981d73 refactor: Centralize Ray/vLLM cluster management into a new cluster_manager.py module and refactor start_vllm_cluster.py to use it.
2026-02-01 22:19:34 +00:00
..
benchmark_results
updates
2025-12-20 11:37:06 +00:00
benchmark_results_rocm_attn/benchmark_results
added ROCm/Triton attention comparison
2025-12-20 11:49:03 +00:00
find_max_context.py
updates
2025-12-20 11:37:06 +00:00
max_context_results.json
feat: Enhance vLLM benchmarking to compare Triton and ROCm attention, introduce a new script for cluster configuration, and update Dockerfile for new tools and dependencies.
2026-02-01 19:36:07 +00:00
run_vllm_bench.py
feat: centralize model configurations and benchmark settings into a new models.py module and update Dockerfile and scripts to use it.
2026-02-01 21:17:15 +00:00
vllm_cluster_bench.py
refactor: Centralize Ray/vLLM cluster management into a new cluster_manager.py module and refactor start_vllm_cluster.py to use it.
2026-02-01 22:19:34 +00:00
© 2020 badstorm.xyz - : 1.26.2
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어