Ce site Web nécessite JavaScript.
Explorateur
Aide
Connexion
AI
/
amd-strix-halo-vllm-toolboxes
Suivre
2
Ajouter aux favoris
0
Bifurcation
0
Vous avez déjà forké amd-strix-halo-vllm-toolboxes
Code
Tickets
Demandes d'ajout
Actions
Paquets
Projets
Publications
Wiki
Activité
88
Révisions
3
Branches
0
Étiquette
039363b819646fab09fb73b12e787152de4c58ea
Graphe des révisions
6 Révisions
Auteur
SHA1
Message
Date
Donato Capitella
6875f62ccf
improve benchmarks
2026-02-25 09:29:46 +00:00
Donato Capitella
91b6dbc270
feat: Display environment variables and allow to choose between RoCE/Ethernet and show RCCL debug information
2026-02-22 20:07:34 +00:00
Donato Capitella
4a5d6c7855
fix broken stuff
2026-02-19 20:29:28 +00:00
Donato Capitella
1ddcb9a202
feat: Configure ROCm attention via
--attention-backend
CLI argument, disable the Ray dashboard, and make eager mode configurable for cluster benchmarks.
2026-02-02 15:40:16 +00:00
Donato Capitella
6f118ff936
feat: Update ROCm benchmark result paths, improve cluster node discovery and cache clearing, and refine cluster benchmark result directory.
2026-02-02 07:35:50 +00:00
Donato Capitella
c587981d73
refactor: Centralize Ray/vLLM cluster management into a new
cluster_manager.py
module and refactor
start_vllm_cluster.py
to use it.
2026-02-01 22:19:34 +00:00