Donato Capitella
|
16405e8943
|
config: Add VLLM_DISABLE_COMPILE_CACHE=1 to environment variables across VLLM scripts.
|
2026-03-09 14:07:43 +00:00 |
|
Donato Capitella
|
6875f62ccf
|
improve benchmarks
|
2026-02-25 09:29:46 +00:00 |
|
Donato Capitella
|
91b6dbc270
|
feat: Display environment variables and allow to choose between RoCE/Ethernet and show RCCL debug information
|
2026-02-22 20:07:34 +00:00 |
|
Donato Capitella
|
4a5d6c7855
|
fix broken stuff
|
2026-02-19 20:29:28 +00:00 |
|
Donato Capitella
|
49b85fc1fb
|
add MiniMax
|
2026-02-18 15:22:12 +00:00 |
|
Donato Capitella
|
1f96c391fb
|
feat: Add comprehensive RDMA cluster setup guide, enforce eager mode in cluster benchmarks, and update documentation with cluster details.
|
2026-02-02 19:34:33 +00:00 |
|
Donato Capitella
|
c587981d73
|
refactor: Centralize Ray/vLLM cluster management into a new cluster_manager.py module and refactor start_vllm_cluster.py to use it.
|
2026-02-01 22:19:34 +00:00 |
|
Donato Capitella
|
128ddade14
|
fix: improve RDMA stability by configuring NCCL IB timeout and retry count.
|
2026-02-01 22:04:34 +00:00 |
|
Donato Capitella
|
0d8afba093
|
feat: Add RAY_DISABLE_METRICS=1 to disable Ray metrics across cluster configurations and scripts.
|
2026-02-01 21:52:48 +00:00 |
|
Donato Capitella
|
ba503f6e61
|
feat: centralize model configurations and benchmark settings into a new models.py module and update Dockerfile and scripts to use it.
|
2026-02-01 21:17:15 +00:00 |
|
Donato Capitella
|
e5cc96bf48
|
feat: Introduce vLLM cluster benchmarking and setup scripts, and expand the list of models for local benchmarks.
|
2026-02-01 15:43:56 +00:00 |
|