Šai tīmekļvietnei ir nepieciešams JavaScript.
Izpētīt
Palīdzība
Pieteikties
AI
/
amd-strix-halo-vllm-toolboxes
Vērot
2
Pievienot izlasei
0
Atdalīts
0
Repozitorijs amd-strix-halo-vllm-toolboxes jau ir atdalīts
Kods
Problēmas
Izmaiņu pieprasījumi
Darbības
Pakotnes
Projekti
Laidieni
Vikivietne
Aktivitāte
Files
backup-before-cleanup
amd-strix-halo-vllm-toolboxes
/
benchmarks
T
Pievienot
Jauna datne
Augšupielādēt failu
Pielietot ielāpu
Kopēt saiti
Download directory as ZIP
Download directory as TAR.GZ
Delete Directory
Vēsture
Donato Capitella
fde8f520d9
feat: Update benchmark results across various models and configurations, increasing
num_requests
from 100 to 200.
2026-02-03 08:31:54 +00:00
..
benchmark_results
feat: Update benchmark results across various models and configurations, increasing
num_requests
from 100 to 200.
2026-02-03 08:31:54 +00:00
benchmark_results_rocm
feat: Update benchmark results across various models and configurations, increasing
num_requests
from 100 to 200.
2026-02-03 08:31:54 +00:00
find_max_context.py
feat: Optimize model
max_num_seqs
and global benchmark parameters for Strix Halo, and centralize configurations in
models.py
.
2026-02-02 08:45:13 +00:00
max_context_results.json
updating max context results
2026-02-02 11:56:26 +00:00
run_vllm_bench.py
feat: Configure ROCm attention via
--attention-backend
CLI argument, disable the Ray dashboard, and make eager mode configurable for cluster benchmarks.
2026-02-02 15:40:16 +00:00
vllm_cluster_bench.py
feat: Extract benchmark output file path generation into a helper function and add checks to skip runs if results already exist.
2026-02-03 08:28:21 +00:00