This website requires JavaScript.
Explore
Help
Sign In
AI
/
amd-strix-halo-vllm-toolboxes
Watch
2
Star
0
Fork
0
You've already forked amd-strix-halo-vllm-toolboxes
Code
Issues
Pull Requests
Actions
Packages
Projects
Releases
Wiki
Activity
88
Commits
3
Branches
0
Tags
main
Commit Graph
6 Commits
Author
SHA1
Message
Date
Donato Capitella
6875f62ccf
improve benchmarks
2026-02-25 09:29:46 +00:00
Donato Capitella
91b6dbc270
feat: Display environment variables and allow to choose between RoCE/Ethernet and show RCCL debug information
2026-02-22 20:07:34 +00:00
Donato Capitella
4a5d6c7855
fix broken stuff
2026-02-19 20:29:28 +00:00
Donato Capitella
1ddcb9a202
feat: Configure ROCm attention via
--attention-backend
CLI argument, disable the Ray dashboard, and make eager mode configurable for cluster benchmarks.
2026-02-02 15:40:16 +00:00
Donato Capitella
6f118ff936
feat: Update ROCm benchmark result paths, improve cluster node discovery and cache clearing, and refine cluster benchmark result directory.
2026-02-02 07:35:50 +00:00
Donato Capitella
c587981d73
refactor: Centralize Ray/vLLM cluster management into a new
cluster_manager.py
module and refactor
start_vllm_cluster.py
to use it.
2026-02-01 22:19:34 +00:00