2
0
Ficheiros
amd-strix-halo-vllm-toolboxes/benchmarks/benchmark_results_rocm/dazipe_Qwen3-Next-80B-A3B-Instruct-GPTQ-Int4A16_cluster_tp2_throughput.json
2026-02-23 19:39:19 +00:00

7 linhas
190 B
JSON

{
"elapsed_time": 757.2171181479935,
"num_requests": 200,
"total_num_tokens": 146805,
"requests_per_second": 0.2641250378612165,
"tokens_per_second": 193.87438091607942
}