amd-strix-halo-vllm-toolboxes

Автор	SHA1	Сообщение	Дата
Donato Capitella	16405e8943	config: Add VLLM_DISABLE_COMPILE_CACHE=1 to environment variables across VLLM scripts.	2026-03-09 14:07:43 +00:00
Donato Capitella	6875f62ccf	improve benchmarks	2026-02-25 09:29:46 +00:00
Donato Capitella	91b6dbc270	feat: Display environment variables and allow to choose between RoCE/Ethernet and show RCCL debug information	2026-02-22 20:07:34 +00:00
Donato Capitella	4a5d6c7855	fix broken stuff	2026-02-19 20:29:28 +00:00
Donato Capitella	49b85fc1fb	add MiniMax	2026-02-18 15:22:12 +00:00
Donato Capitella	1f96c391fb	feat: Add comprehensive RDMA cluster setup guide, enforce eager mode in cluster benchmarks, and update documentation with cluster details.	2026-02-02 19:34:33 +00:00
Donato Capitella	c587981d73	refactor: Centralize Ray/vLLM cluster management into a new `cluster_manager.py` module and refactor `start_vllm_cluster.py` to use it.	2026-02-01 22:19:34 +00:00
Donato Capitella	128ddade14	fix: improve RDMA stability by configuring NCCL IB timeout and retry count.	2026-02-01 22:04:34 +00:00
Donato Capitella	0d8afba093	feat: Add `RAY_DISABLE_METRICS=1` to disable Ray metrics across cluster configurations and scripts.	2026-02-01 21:52:48 +00:00
Donato Capitella	ba503f6e61	feat: centralize model configurations and benchmark settings into a new `models.py` module and update Dockerfile and scripts to use it.	2026-02-01 21:17:15 +00:00
Donato Capitella	e5cc96bf48	feat: Introduce vLLM cluster benchmarking and setup scripts, and expand the list of models for local benchmarks.	2026-02-01 15:43:56 +00:00

11 Коммитов