amd-strix-halo-vllm-toolboxes/benchmarks at 128ddade14f4ad9ded59d40069093679a910a748 - amd-strix-halo-vllm-toolboxes - BadStorm.xyz - Code Hub

AI/amd-strix-halo-vllm-toolboxes

Files

T

Histórico

Donato Capitella 128ddade14 fix: improve RDMA stability by configuring NCCL IB timeout and retry count.

2026-02-01 22:04:34 +00:00

..

benchmark_results

updates

2025-12-20 11:37:06 +00:00

benchmark_results_rocm_attn/benchmark_results

added ROCm/Triton attention comparison

2025-12-20 11:49:03 +00:00

find_max_context.py

updates

2025-12-20 11:37:06 +00:00

max_context_results.json

feat: Enhance vLLM benchmarking to compare Triton and ROCm attention, introduce a new script for cluster configuration, and update Dockerfile for new tools and dependencies.

2026-02-01 19:36:07 +00:00

run_vllm_bench.py

feat: centralize model configurations and benchmark settings into a new models.py module and update Dockerfile and scripts to use it.

2026-02-01 21:17:15 +00:00

vllm_cluster_bench.py

fix: improve RDMA stability by configuring NCCL IB timeout and retry count.

2026-02-01 22:04:34 +00:00