amd-strix-halo-vllm-toolboxes/benchmarks/benchmark_results/meta-llama_Meta-Llama-3.1-8B-Instruct_cluster_tp2_throughput.json 位于 4d3b046870f1daa0c59700d7ed9856e3c766d4ca - amd-strix-halo-vllm-toolboxes - BadStorm.xyz - Code Hub

AI/amd-strix-halo-vllm-toolboxes

文件

T

Donato Capitella 4d3b046870 feat: Add new benchmark results for various models and configurations, and update documentation UI with filtering for attention and tensor parallelism.

2026-02-02 21:30:17 +00:00

7 行

190 B

JSON

原始文件 Blame 文件历史

 {
     "elapsed_time": 193.03236384499905,
     "num_requests": 100,
     "total_num_tokens": 74504,
     "requests_per_second": 0.5180478444552329,
     "tokens_per_second": 385.96636603292677
 }