amd-strix-halo-vllm-toolboxes/benchmarks/benchmark_results/btbtyler09_Qwen3-Coder-30B-A3B-Instruct-gptq-8bit_cluster_tp2_throughput.json 위치 4d3b046870f1daa0c59700d7ed9856e3c766d4ca - amd-strix-halo-vllm-toolboxes - BadStorm.xyz - Code Hub

AI/amd-strix-halo-vllm-toolboxes

파일

T

Donato Capitella 4d3b046870 feat: Add new benchmark results for various models and configurations, and update documentation UI with filtering for attention and tensor parallelism.

2026-02-02 21:30:17 +00:00

7 라인

189 B

JSON

Raw Blame 히스토리

 {
     "elapsed_time": 577.3050836349939,
     "num_requests": 100,
     "total_num_tokens": 75285,
     "requests_per_second": 0.1732186374842766,
     "tokens_per_second": 130.40765123003763
 }