amd-strix-halo-vllm-toolboxes/benchmarks/benchmark_results_rocm/btbtyler09_Qwen3-Coder-30B-A3B-Instruct-gptq-4bit_tp1_throughput.json at 4d3b046870f1daa0c59700d7ed9856e3c766d4ca - amd-strix-halo-vllm-toolboxes - BadStorm.xyz - Code Hub

AI/amd-strix-halo-vllm-toolboxes

Files

T

Donato Capitella 4d3b046870 feat: Add new benchmark results for various models and configurations, and update documentation UI with filtering for attention and tensor parallelism.

2026-02-02 21:30:17 +00:00

7 lines

190 B

JSON

Raw Blame History

 {
     "elapsed_time": 621.1952276929987,
     "num_requests": 100,
     "total_num_tokens": 75285,
     "requests_per_second": 0.16097998751758127,
     "tokens_per_second": 121.19378360261106
 }