Логотип
Обзор Помощь
Вход
AI/amd-strix-halo-vllm-toolboxes
2
0
Форкнуть 0
Вы уже форкнули amd-strix-halo-vllm-toolboxes
Код Задачи Запросы на слияние Действия Пакеты Проекты Релизы Вики Активность
Files
c587981d73439fe1c839584c29c087909bf0d4a8
amd-strix-halo-vllm-toolboxes/benchmarks
T
История
Donato Capitella c587981d73 refactor: Centralize Ray/vLLM cluster management into a new cluster_manager.py module and refactor start_vllm_cluster.py to use it.
2026-02-01 22:19:34 +00:00
..
benchmark_results
updates
2025-12-20 11:37:06 +00:00
benchmark_results_rocm_attn/benchmark_results
added ROCm/Triton attention comparison
2025-12-20 11:49:03 +00:00
find_max_context.py
updates
2025-12-20 11:37:06 +00:00
max_context_results.json
feat: Enhance vLLM benchmarking to compare Triton and ROCm attention, introduce a new script for cluster configuration, and update Dockerfile for new tools and dependencies.
2026-02-01 19:36:07 +00:00
run_vllm_bench.py
feat: centralize model configurations and benchmark settings into a new models.py module and update Dockerfile and scripts to use it.
2026-02-01 21:17:15 +00:00
vllm_cluster_bench.py
refactor: Centralize Ray/vLLM cluster management into a new cluster_manager.py module and refactor start_vllm_cluster.py to use it.
2026-02-01 22:19:34 +00:00
© 2020 badstorm.xyz - : 1.26.2
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어