文件
amd-strix-halo-vllm-toolboxes/benchmarks/benchmark_results_rocm/dazipe_Qwen3-Next-80B-A3B-Instruct-GPTQ-Int4A16_tp1_throughput.json
T

7 行
190 B
JSON

{
"elapsed_time": 1064.0833694909998,
"num_requests": 100,
"total_num_tokens": 75285,
"requests_per_second": 0.09397759881148657,
"tokens_per_second": 70.75103526522766
}