Diese Website benötigt JavaScript.
Erkunden
Hilfe
Anmelden
AI
/
amd-strix-halo-vllm-toolboxes
Beobachten
2
Favorisieren
0
Fork
0
Du hast bereits einen Fork von amd-strix-halo-vllm-toolboxes erstellt
Code
Issues
Pull-Requests
Actions
Pakete
Projekte
Releases
Wiki
Aktivität
Dateien
0d8afba0935edd7ea5c6971294fa4ed0a6ec573d
amd-strix-halo-vllm-toolboxes
/
benchmarks
/
benchmark_results
T
Verlauf
Donato Capitella
5e8b6bb545
updates
2025-12-20 11:37:06 +00:00
..
cpatonn_Qwen3-Coder-30B-A3B-Instruct-GPTQ-4bit_tp1_throughput.json
updates
2025-12-20 11:37:06 +00:00
dazipe_Qwen3-Next-80B-A3B-Instruct-GPTQ-Int4A16_tp1_throughput.json
updates
2025-12-20 11:37:06 +00:00
google_gemma-3-12b-it_tp1_throughput.json
updates
2025-12-20 11:37:06 +00:00
meta-llama_Meta-Llama-3.1-8B-Instruct_tp1_throughput.json
updates
2025-12-20 11:37:06 +00:00
openai_gpt-oss-20b_tp1_throughput.json
updates
2025-12-20 11:37:06 +00:00
openai_gpt-oss-120b_tp1_throughput.json
updates
2025-12-20 11:37:06 +00:00
Qwen_Qwen3-14B-AWQ_tp1_throughput.json
updates
2025-12-20 11:37:06 +00:00