amd-strix-halo-vllm-toolboxes/docs at fde8f520d9bc6ed34d48ae959285bc64e8c8c693 - amd-strix-halo-vllm-toolboxes - BadStorm.xyz - Code Hub

AI/amd-strix-halo-vllm-toolboxes

Files

T

Donato Capitella fde8f520d9 feat: Update benchmark results across various models and configurations, increasing num_requests from 100 to 200.

2026-02-03 08:31:54 +00:00

..

updates

2025-12-20 11:37:06 +00:00

index.html

perf: Increase max_num_seqs for bus batch scaling and OFF_NUM_PROMPTS for steady-state throughput measurement on Strix Halo.

2026-02-02 22:36:15 +00:00

parse_results.py

feat: Add new benchmark results for various models and configurations, and update documentation UI with filtering for attention and tensor parallelism.

2026-02-02 21:30:17 +00:00

results.json

feat: Update benchmark results across various models and configurations, increasing num_requests from 100 to 200.

2026-02-03 08:31:54 +00:00