Logo
Explorateur Aide
Connexion
AI/amd-strix-halo-vllm-toolboxes
2
0
Bifurcation 0
Vous avez déjà forké amd-strix-halo-vllm-toolboxes
Code Tickets Demandes d'ajout Actions Paquets Projets Publications Wiki Activité
Fichiers
afe985afca29c6da7b81274e0035c7f143a4ab22
amd-strix-halo-vllm-toolboxes/benchmarks
T
Historique
Donato Capitella fde8f520d9 feat: Update benchmark results across various models and configurations, increasing num_requests from 100 to 200.
2026-02-03 08:31:54 +00:00
..
benchmark_results
feat: Update benchmark results across various models and configurations, increasing num_requests from 100 to 200.
2026-02-03 08:31:54 +00:00
benchmark_results_rocm
feat: Update benchmark results across various models and configurations, increasing num_requests from 100 to 200.
2026-02-03 08:31:54 +00:00
find_max_context.py
feat: Optimize model max_num_seqs and global benchmark parameters for Strix Halo, and centralize configurations in models.py.
2026-02-02 08:45:13 +00:00
max_context_results.json
updating max context results
2026-02-02 11:56:26 +00:00
run_vllm_bench.py
feat: Configure ROCm attention via --attention-backend CLI argument, disable the Ray dashboard, and make eager mode configurable for cluster benchmarks.
2026-02-02 15:40:16 +00:00
vllm_cluster_bench.py
feat: Extract benchmark output file path generation into a helper function and add checks to skip runs if results already exist.
2026-02-03 08:28:21 +00:00
© 2020 badstorm.xyz - : 1.26.2
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어