Для роботи цього сайту потрібен JavaScript.
Огляд
Довідка
Увійти
AI
/
amd-strix-halo-vllm-toolboxes
Стежити
2
В обрані
0
Форк
0
You've already forked amd-strix-halo-vllm-toolboxes
Код
Задачі
Запити на злиття
Дії
Пакети
Проєкти
Релізи
Вікі
Активність
Файли
8de950d9ca404e65bbf9eaa602a362e6df941ba7
amd-strix-halo-vllm-toolboxes
/
scripts
T
Історія
Donato Capitella
8de950d9ca
feat: Override
_get_gcn_arch
function to return "gfx1151" and rename the original implementation to
_old_get_gcn_arch
.
2026-03-09 12:13:27 +00:00
..
01-rocm-env-for-triton.sh
updated envs for better strix halo support on vllm
2025-12-19 08:30:02 +00:00
99-toolbox-banner.sh
feat: Introduce vLLM cluster benchmarking and setup scripts, and expand the list of models for local benchmarks.
2026-02-01 15:43:56 +00:00
build_rccl_gfx1151.sh
feat: Introduce custom RCCL library management for gfx1151, including build scripts, Docker integration, and VLLM benchmarks.
2026-02-01 13:23:10 +00:00
cluster_manager.py
improve benchmarks
2026-02-25 09:29:46 +00:00
configure_cluster.sh
feat: Add
RAY_DISABLE_METRICS=1
to disable Ray metrics across cluster configurations and scripts.
2026-02-01 21:52:48 +00:00
generate_readme_table.py
feat: Add script to automate README benchmark table generation and update max context benchmarks with new models and a kernel parameter change.
2026-02-02 22:32:12 +00:00
install_deps.sh
Downgrade Python to 3.12 and remove the
--no-deps
flag from a pip install command in the Dockerfile.
2026-03-09 11:08:11 +00:00
install_rocm_sdk.sh
fixing
https://github.com/kyuz0/amd-strix-halo-vllm-toolboxes/issues/21
2026-02-26 12:36:03 +00:00
manage_rccl_install.sh
feat: Introduce custom RCCL library management for gfx1151, including build scripts, Docker integration, and VLLM benchmarks.
2026-02-01 13:23:10 +00:00
measure_bandwidth.sh
feat: Introduce
measure_bandwidth.sh
script, install
perfquery
, and add the script to the Docker image for RDMA bandwidth monitoring.
2026-02-07 10:40:53 +00:00
models.py
force egaer mode to make gemma stable
2026-02-23 18:19:15 +00:00
patch_strix.py
feat: Override
_get_gcn_arch
function to return "gfx1151" and rename the original implementation to
_old_get_gcn_arch
.
2026-03-09 12:13:27 +00:00
start_vllm_cluster.py
improve benchmarks
2026-02-25 09:29:46 +00:00
start_vllm.py
updated benchmarks, fix start-vllm
2026-02-23 19:39:19 +00:00
zz-venv-last.sh
Updating toolbox and pushing GitHub Action
2025-11-30 14:57:37 +00:00