amd-strix-halo-vllm-toolboxes

Files

T

Cronologia

Donato Capitella fde8f520d9 feat: Update benchmark results across various models and configurations, increasing num_requests from 100 to 200.

2026-02-03 08:31:54 +00:00

btbtyler09_Qwen3-Coder-30B-A3B-Instruct-gptq-4bit_cluster_tp2_throughput.json

feat: Update benchmark results across various models and configurations, increasing num_requests from 100 to 200.

2026-02-03 08:31:54 +00:00

btbtyler09_Qwen3-Coder-30B-A3B-Instruct-gptq-4bit_tp1_throughput.json

feat: Update benchmark results across various models and configurations, increasing num_requests from 100 to 200.

2026-02-03 08:31:54 +00:00

btbtyler09_Qwen3-Coder-30B-A3B-Instruct-gptq-8bit_cluster_tp2_throughput.json

feat: Update benchmark results across various models and configurations, increasing num_requests from 100 to 200.

2026-02-03 08:31:54 +00:00

btbtyler09_Qwen3-Coder-30B-A3B-Instruct-gptq-8bit_tp1_throughput.json

feat: Update benchmark results across various models and configurations, increasing num_requests from 100 to 200.

2026-02-03 08:31:54 +00:00

dazipe_Qwen3-Next-80B-A3B-Instruct-GPTQ-Int4A16_cluster_tp2_throughput.json

feat: Update benchmark results across various models and configurations, increasing num_requests from 100 to 200.

2026-02-03 08:31:54 +00:00

dazipe_Qwen3-Next-80B-A3B-Instruct-GPTQ-Int4A16_tp1_throughput.json

feat: Update benchmark results across various models and configurations, increasing num_requests from 100 to 200.

2026-02-03 08:31:54 +00:00

google_gemma-3-12b-it_cluster_tp2_throughput.json

feat: Update benchmark results across various models and configurations, increasing num_requests from 100 to 200.

2026-02-03 08:31:54 +00:00

google_gemma-3-12b-it_tp1_throughput.json

feat: Update benchmark results across various models and configurations, increasing num_requests from 100 to 200.

2026-02-03 08:31:54 +00:00

meta-llama_Meta-Llama-3.1-8B-Instruct_cluster_tp2_throughput.json

feat: Update benchmark results across various models and configurations, increasing num_requests from 100 to 200.

2026-02-03 08:31:54 +00:00

meta-llama_Meta-Llama-3.1-8B-Instruct_tp1_throughput.json

feat: Update benchmark results across various models and configurations, increasing num_requests from 100 to 200.

2026-02-03 08:31:54 +00:00

openai_gpt-oss-20b_cluster_tp2_throughput.json

feat: Update benchmark results across various models and configurations, increasing num_requests from 100 to 200.

2026-02-03 08:31:54 +00:00

openai_gpt-oss-20b_tp1_throughput.json

feat: Update benchmark results across various models and configurations, increasing num_requests from 100 to 200.

2026-02-03 08:31:54 +00:00

openai_gpt-oss-120b_cluster_tp2_throughput.json

feat: Update benchmark results across various models and configurations, increasing num_requests from 100 to 200.

2026-02-03 08:31:54 +00:00

openai_gpt-oss-120b_tp1_throughput.json

feat: Update benchmark results across various models and configurations, increasing num_requests from 100 to 200.

2026-02-03 08:31:54 +00:00

Qwen_Qwen3-14B-AWQ_cluster_tp2_throughput.json

feat: Update benchmark results across various models and configurations, increasing num_requests from 100 to 200.

2026-02-03 08:31:54 +00:00

Qwen_Qwen3-14B-AWQ_tp1_throughput.json

feat: Update benchmark results across various models and configurations, increasing num_requests from 100 to 200.

2026-02-03 08:31:54 +00:00

zai-org_GLM-4.7-Flash_cluster_tp2_throughput.json

feat: Update benchmark results across various models and configurations, increasing num_requests from 100 to 200.

2026-02-03 08:31:54 +00:00

zai-org_GLM-4.7-Flash_tp1_throughput.json

feat: Update benchmark results across various models and configurations, increasing num_requests from 100 to 200.

2026-02-03 08:31:54 +00:00