amd-strix-halo-vllm-toolboxes/benchmarks/benchmark_results/btbtyler09_Qwen3-Coder-30B-A3B-Instruct-gptq-4bit_cluster_tp2_throughput.json at 4d3b046870f1daa0c59700d7ed9856e3c766d4ca - amd-strix-halo-vllm-toolboxes - BadStorm.xyz - Code Hub

AI/amd-strix-halo-vllm-toolboxes

Files

T

Donato Capitella 4d3b046870 feat: Add new benchmark results for various models and configurations, and update documentation UI with filtering for attention and tensor parallelism.

2026-02-02 21:30:17 +00:00

7 lines

189 B

JSON

Raw Blame History

 {
     "elapsed_time": 442.1101265470061,
     "num_requests": 100,
     "total_num_tokens": 75285,
     "requests_per_second": 0.2261879880043141,
     "tokens_per_second": 170.28562676904787
 }