This website requires JavaScript.
Esplora
Aiuto
Accedi
AI
/
amd-strix-halo-vllm-toolboxes
Segui
2
Vota
0
Forka
0
Hai già fatto il fork di amd-strix-halo-vllm-toolboxes
Codice
Problemi
Pull Request
Actions
Pacchetti
Progetti
Rilasci
Wiki
Attività
Files
backup-before-cleanup
amd-strix-halo-vllm-toolboxes
/
benchmarks
/
benchmark_results
T
Aggiungi file
Nuovo file
Carica File
Applica Patch
Copia Permalink
Download directory as ZIP
Download directory as TAR.GZ
Delete Directory
Cronologia
Donato Capitella
fde8f520d9
feat: Update benchmark results across various models and configurations, increasing
num_requests
from 100 to 200.
2026-02-03 08:31:54 +00:00
..
btbtyler09_Qwen3-Coder-30B-A3B-Instruct-gptq-4bit_cluster_tp2_throughput.json
feat: Update benchmark results across various models and configurations, increasing
num_requests
from 100 to 200.
2026-02-03 08:31:54 +00:00
btbtyler09_Qwen3-Coder-30B-A3B-Instruct-gptq-4bit_tp1_throughput.json
feat: Update benchmark results across various models and configurations, increasing
num_requests
from 100 to 200.
2026-02-03 08:31:54 +00:00
btbtyler09_Qwen3-Coder-30B-A3B-Instruct-gptq-8bit_cluster_tp2_throughput.json
feat: Update benchmark results across various models and configurations, increasing
num_requests
from 100 to 200.
2026-02-03 08:31:54 +00:00
btbtyler09_Qwen3-Coder-30B-A3B-Instruct-gptq-8bit_tp1_throughput.json
feat: Update benchmark results across various models and configurations, increasing
num_requests
from 100 to 200.
2026-02-03 08:31:54 +00:00
dazipe_Qwen3-Next-80B-A3B-Instruct-GPTQ-Int4A16_cluster_tp2_throughput.json
feat: Update benchmark results across various models and configurations, increasing
num_requests
from 100 to 200.
2026-02-03 08:31:54 +00:00
dazipe_Qwen3-Next-80B-A3B-Instruct-GPTQ-Int4A16_tp1_throughput.json
feat: Update benchmark results across various models and configurations, increasing
num_requests
from 100 to 200.
2026-02-03 08:31:54 +00:00
google_gemma-3-12b-it_cluster_tp2_throughput.json
feat: Update benchmark results across various models and configurations, increasing
num_requests
from 100 to 200.
2026-02-03 08:31:54 +00:00
google_gemma-3-12b-it_tp1_throughput.json
feat: Update benchmark results across various models and configurations, increasing
num_requests
from 100 to 200.
2026-02-03 08:31:54 +00:00
meta-llama_Meta-Llama-3.1-8B-Instruct_cluster_tp2_throughput.json
feat: Update benchmark results across various models and configurations, increasing
num_requests
from 100 to 200.
2026-02-03 08:31:54 +00:00
meta-llama_Meta-Llama-3.1-8B-Instruct_tp1_throughput.json
feat: Update benchmark results across various models and configurations, increasing
num_requests
from 100 to 200.
2026-02-03 08:31:54 +00:00
openai_gpt-oss-20b_cluster_tp2_throughput.json
feat: Update benchmark results across various models and configurations, increasing
num_requests
from 100 to 200.
2026-02-03 08:31:54 +00:00
openai_gpt-oss-20b_tp1_throughput.json
feat: Update benchmark results across various models and configurations, increasing
num_requests
from 100 to 200.
2026-02-03 08:31:54 +00:00
openai_gpt-oss-120b_cluster_tp2_throughput.json
feat: Update benchmark results across various models and configurations, increasing
num_requests
from 100 to 200.
2026-02-03 08:31:54 +00:00
openai_gpt-oss-120b_tp1_throughput.json
feat: Update benchmark results across various models and configurations, increasing
num_requests
from 100 to 200.
2026-02-03 08:31:54 +00:00
Qwen_Qwen3-14B-AWQ_cluster_tp2_throughput.json
feat: Update benchmark results across various models and configurations, increasing
num_requests
from 100 to 200.
2026-02-03 08:31:54 +00:00
Qwen_Qwen3-14B-AWQ_tp1_throughput.json
feat: Update benchmark results across various models and configurations, increasing
num_requests
from 100 to 200.
2026-02-03 08:31:54 +00:00
zai-org_GLM-4.7-Flash_cluster_tp2_throughput.json
feat: Update benchmark results across various models and configurations, increasing
num_requests
from 100 to 200.
2026-02-03 08:31:54 +00:00
zai-org_GLM-4.7-Flash_tp1_throughput.json
feat: Update benchmark results across various models and configurations, increasing
num_requests
from 100 to 200.
2026-02-03 08:31:54 +00:00