Grafico dei commit

11 Commit

Autore SHA1 Messaggio Data
Donato Capitella 16405e8943 config: Add VLLM_DISABLE_COMPILE_CACHE=1 to environment variables across VLLM scripts. 2026-03-09 14:07:43 +00:00
Donato Capitella 6875f62ccf improve benchmarks 2026-02-25 09:29:46 +00:00
Donato Capitella 91b6dbc270 feat: Display environment variables and allow to choose between RoCE/Ethernet and show RCCL debug information 2026-02-22 20:07:34 +00:00
Donato Capitella 4a5d6c7855 fix broken stuff 2026-02-19 20:29:28 +00:00
Donato Capitella 49b85fc1fb add MiniMax 2026-02-18 15:22:12 +00:00
Donato Capitella 1f96c391fb feat: Add comprehensive RDMA cluster setup guide, enforce eager mode in cluster benchmarks, and update documentation with cluster details. 2026-02-02 19:34:33 +00:00
Donato Capitella c587981d73 refactor: Centralize Ray/vLLM cluster management into a new cluster_manager.py module and refactor start_vllm_cluster.py to use it. 2026-02-01 22:19:34 +00:00
Donato Capitella 128ddade14 fix: improve RDMA stability by configuring NCCL IB timeout and retry count. 2026-02-01 22:04:34 +00:00
Donato Capitella 0d8afba093 feat: Add RAY_DISABLE_METRICS=1 to disable Ray metrics across cluster configurations and scripts. 2026-02-01 21:52:48 +00:00
Donato Capitella ba503f6e61 feat: centralize model configurations and benchmark settings into a new models.py module and update Dockerfile and scripts to use it. 2026-02-01 21:17:15 +00:00
Donato Capitella e5cc96bf48 feat: Introduce vLLM cluster benchmarking and setup scripts, and expand the list of models for local benchmarks. 2026-02-01 15:43:56 +00:00