Files
amd-strix-halo-vllm-toolboxes/benchmarks/benchmark_results/btbtyler09_Qwen3-Coder-30B-A3B-Instruct-gptq-4bit_tp1_throughput.json
T
2026-02-23 19:39:19 +00:00

7 regels
191 B
JSON

{
"elapsed_time": 686.8188757880125,
"num_requests": 200,
"total_num_tokens": 146805,
"requests_per_second": 0.29119758796747197,
"tokens_per_second": 213.74630950782364
}