Šai tīmekļvietnei ir nepieciešams JavaScript.
Izpētīt
Palīdzība
Pieteikties
AI
/
amd-strix-halo-vllm-toolboxes
Vērot
2
Pievienot izlasei
0
Atdalīts
0
Repozitorijs amd-strix-halo-vllm-toolboxes jau ir atdalīts
Kods
Problēmas
Izmaiņu pieprasījumi
Darbības
Pakotnes
Projekti
Laidieni
Vikivietne
Aktivitāte
Files
290beffb052ca568b0484ce72e1775d279593031
amd-strix-halo-vllm-toolboxes
/
benchmarks
/
benchmark_results_rocm
T
Vēsture
Donato Capitella
fde8f520d9
feat: Update benchmark results across various models and configurations, increasing
num_requests
from 100 to 200.
2026-02-03 08:31:54 +00:00
..
btbtyler09_Qwen3-Coder-30B-A3B-Instruct-gptq-4bit_cluster_tp2_throughput.json
feat: Update benchmark results across various models and configurations, increasing
num_requests
from 100 to 200.
2026-02-03 08:31:54 +00:00
btbtyler09_Qwen3-Coder-30B-A3B-Instruct-gptq-4bit_tp1_throughput.json
feat: Update benchmark results across various models and configurations, increasing
num_requests
from 100 to 200.
2026-02-03 08:31:54 +00:00
btbtyler09_Qwen3-Coder-30B-A3B-Instruct-gptq-8bit_cluster_tp2_throughput.json
feat: Update benchmark results across various models and configurations, increasing
num_requests
from 100 to 200.
2026-02-03 08:31:54 +00:00
btbtyler09_Qwen3-Coder-30B-A3B-Instruct-gptq-8bit_tp1_throughput.json
feat: Update benchmark results across various models and configurations, increasing
num_requests
from 100 to 200.
2026-02-03 08:31:54 +00:00
dazipe_Qwen3-Next-80B-A3B-Instruct-GPTQ-Int4A16_cluster_tp2_throughput.json
feat: Update benchmark results across various models and configurations, increasing
num_requests
from 100 to 200.
2026-02-03 08:31:54 +00:00
dazipe_Qwen3-Next-80B-A3B-Instruct-GPTQ-Int4A16_tp1_throughput.json
feat: Update benchmark results across various models and configurations, increasing
num_requests
from 100 to 200.
2026-02-03 08:31:54 +00:00
google_gemma-3-12b-it_cluster_tp2_throughput.json
feat: Update benchmark results across various models and configurations, increasing
num_requests
from 100 to 200.
2026-02-03 08:31:54 +00:00
google_gemma-3-12b-it_tp1_throughput.json
feat: Update benchmark results across various models and configurations, increasing
num_requests
from 100 to 200.
2026-02-03 08:31:54 +00:00
meta-llama_Meta-Llama-3.1-8B-Instruct_cluster_tp2_throughput.json
feat: Update benchmark results across various models and configurations, increasing
num_requests
from 100 to 200.
2026-02-03 08:31:54 +00:00
meta-llama_Meta-Llama-3.1-8B-Instruct_tp1_throughput.json
feat: Update benchmark results across various models and configurations, increasing
num_requests
from 100 to 200.
2026-02-03 08:31:54 +00:00
openai_gpt-oss-20b_cluster_tp2_throughput.json
feat: Update benchmark results across various models and configurations, increasing
num_requests
from 100 to 200.
2026-02-03 08:31:54 +00:00
openai_gpt-oss-20b_tp1_throughput.json
feat: Update benchmark results across various models and configurations, increasing
num_requests
from 100 to 200.
2026-02-03 08:31:54 +00:00
openai_gpt-oss-120b_cluster_tp2_throughput.json
feat: Update benchmark results across various models and configurations, increasing
num_requests
from 100 to 200.
2026-02-03 08:31:54 +00:00
openai_gpt-oss-120b_tp1_throughput.json
feat: Update benchmark results across various models and configurations, increasing
num_requests
from 100 to 200.
2026-02-03 08:31:54 +00:00
Qwen_Qwen3-14B-AWQ_cluster_tp2_throughput.json
feat: Update benchmark results across various models and configurations, increasing
num_requests
from 100 to 200.
2026-02-03 08:31:54 +00:00
Qwen_Qwen3-14B-AWQ_tp1_throughput.json
feat: Update benchmark results across various models and configurations, increasing
num_requests
from 100 to 200.
2026-02-03 08:31:54 +00:00