Files
amd-strix-halo-vllm-toolboxes/benchmarks/benchmark_results/cpatonn_Qwen3-Coder-30B-A3B-Instruct-GPTQ-4bit_tp1_throughput.json
T
Donato Capitella 5e8b6bb545 updates
2025-12-20 11:37:06 +00:00

7 rader
190 B
JSON

{
"elapsed_time": 540.2676798280002,
"num_requests": 200,
"total_num_tokens": 146805,
"requests_per_second": 0.37018686748700586,
"tokens_per_second": 271.7264154071495
}