amd-strix-halo-vllm-toolboxes

Auteur	SHA1	Message	Date
Donato Capitella	6875f62ccf	improve benchmarks	2026-02-25 09:29:46 +00:00
Donato Capitella	91b6dbc270	feat: Display environment variables and allow to choose between RoCE/Ethernet and show RCCL debug information	2026-02-22 20:07:34 +00:00
Donato Capitella	4a5d6c7855	fix broken stuff	2026-02-19 20:29:28 +00:00
Donato Capitella	1ddcb9a202	feat: Configure ROCm attention via `--attention-backend` CLI argument, disable the Ray dashboard, and make eager mode configurable for cluster benchmarks.	2026-02-02 15:40:16 +00:00
Donato Capitella	6f118ff936	feat: Update ROCm benchmark result paths, improve cluster node discovery and cache clearing, and refine cluster benchmark result directory.	2026-02-02 07:35:50 +00:00
Donato Capitella	c587981d73	refactor: Centralize Ray/vLLM cluster management into a new `cluster_manager.py` module and refactor `start_vllm_cluster.py` to use it.	2026-02-01 22:19:34 +00:00