Files
amd-strix-halo-vllm-toolboxes/benchmarks/benchmark_results_rocm/meta-llama_Meta-Llama-3.1-8B-Instruct_cluster_tp2_throughput.json
T

7 rindas
189 B
JSON

{
"elapsed_time": 195.17220506099693,
"num_requests": 100,
"total_num_tokens": 74504,
"requests_per_second": 0.5123680391311207,
"tokens_per_second": 381.7346838742502
}