This website requires JavaScript.
گشتوگذار
راهنما
ورود
AI
/
amd-strix-halo-vllm-toolboxes
زیرنظر گرفتن
2
ستاره دار کن
0
انشعاب
0
You've already forked amd-strix-halo-vllm-toolboxes
کد
مسائل
تقاضاهای واکشی
Actions
Packages
پروژهها
انتشارها
دانشنامه
فعالیت
Files
fde8f520d9bc6ed34d48ae959285bc64e8c8c693
amd-strix-halo-vllm-toolboxes
/
docs
T
تاریخچه
Donato Capitella
fde8f520d9
feat: Update benchmark results across various models and configurations, increasing
num_requests
from 100 to 200.
2026-02-03 08:31:54 +00:00
..
assets
updates
2025-12-20 11:37:06 +00:00
index.html
perf: Increase
max_num_seqs
for bus batch scaling and
OFF_NUM_PROMPTS
for steady-state throughput measurement on Strix Halo.
2026-02-02 22:36:15 +00:00
parse_results.py
feat: Add new benchmark results for various models and configurations, and update documentation UI with filtering for attention and tensor parallelism.
2026-02-02 21:30:17 +00:00
results.json
feat: Update benchmark results across various models and configurations, increasing
num_requests
from 100 to 200.
2026-02-03 08:31:54 +00:00