Bu web sitesinin çalışması için JavaScript gereklidir.
Keşfet
Yardım
Giriş Yap
AI
/
amd-strix-halo-vllm-toolboxes
İzle
2
Yıldızla
0
Çatalla
0
amd-strix-halo-vllm-toolboxes deposunu zaten çatalladınız
Kod
Konular
Değişiklik İstekleri
İşlemler
Paketler
Projeler
Sürüm
Viki
Aktivite
Dosyalar
ba503f6e61e344874aac5daa7f576af2357c0fdb
amd-strix-halo-vllm-toolboxes
/
benchmarks
T
Geçmiş
Donato Capitella
ba503f6e61
feat: centralize model configurations and benchmark settings into a new
models.py
module and update Dockerfile and scripts to use it.
2026-02-01 21:17:15 +00:00
..
benchmark_results
updates
2025-12-20 11:37:06 +00:00
benchmark_results_rocm_attn
/benchmark_results
added ROCm/Triton attention comparison
2025-12-20 11:49:03 +00:00
find_max_context.py
updates
2025-12-20 11:37:06 +00:00
max_context_results.json
feat: Enhance vLLM benchmarking to compare Triton and ROCm attention, introduce a new script for cluster configuration, and update Dockerfile for new tools and dependencies.
2026-02-01 19:36:07 +00:00
run_vllm_bench.py
feat: centralize model configurations and benchmark settings into a new
models.py
module and update Dockerfile and scripts to use it.
2026-02-01 21:17:15 +00:00
vllm_cluster_bench.py
feat: centralize model configurations and benchmark settings into a new
models.py
module and update Dockerfile and scripts to use it.
2026-02-01 21:17:15 +00:00