amd-strix-halo-vllm-toolboxes/benchmarks/benchmark_results_rocm/btbtyler09_Qwen3-Coder-30B-A3B-Instruct-gptq-4bit_cluster_tp2_throughput.json at 4d3b046870f1daa0c59700d7ed9856e3c766d4ca - amd-strix-halo-vllm-toolboxes - BadStorm.xyz - Code Hub

AI/amd-strix-halo-vllm-toolboxes

Files

T

Donato Capitella 4d3b046870 feat: Add new benchmark results for various models and configurations, and update documentation UI with filtering for attention and tensor parallelism.

2026-02-02 21:30:17 +00:00

7 lines

189 B

JSON

Raw Blame History

 {
     "elapsed_time": 438.29837328500435,
     "num_requests": 100,
     "total_num_tokens": 75285,
     "requests_per_second": 0.2281550790401287,
     "tokens_per_second": 171.7665512553609
 }