This website requires JavaScript.
Odkrywaj
Pomoc
Zaloguj się
AI
/
amd-strix-halo-vllm-toolboxes
Obserwuj
2
Polub
0
Forkuj
0
You've already forked amd-strix-halo-vllm-toolboxes
Kod
Zgłoszenia
Oczekujące zmiany
Actions
Packages
Projekty
Wydania
Wiki
Aktywność
54
Commity
3
Gałęzie
0
Tagi
693757f5d945bb9f28949ea52ce76c7ea3cf1e42
Wykres commitów
6 Commity
Autor
SHA1
Wiadomość
Data
Donato Capitella
4d3b046870
feat: Add new benchmark results for various models and configurations, and update documentation UI with filtering for attention and tensor parallelism.
2026-02-02 21:30:17 +00:00
Donato Capitella
1f96c391fb
feat: Add comprehensive RDMA cluster setup guide, enforce eager mode in cluster benchmarks, and update documentation with cluster details.
2026-02-02 19:34:33 +00:00
Donato Capitella
6f118ff936
feat: Update ROCm benchmark result paths, improve cluster node discovery and cache clearing, and refine cluster benchmark result directory.
2026-02-02 07:35:50 +00:00
Donato Capitella
a1105a0b96
feat: Enhance vLLM benchmarking to compare Triton and ROCm attention, introduce a new script for cluster configuration, and update Dockerfile for new tools and dependencies.
2026-02-01 19:36:07 +00:00
Donato Capitella
711de530f6
added ROCm/Triton attention comparison
2025-12-20 11:49:03 +00:00
Donato Capitella
5e8b6bb545
updates
2025-12-20 11:37:06 +00:00