This website requires JavaScript.
Explore
Help
Sign In
AI
/
amd-strix-halo-vllm-toolboxes
Watch
2
Star
0
Fork
0
You've already forked amd-strix-halo-vllm-toolboxes
Code
Issues
Pull Requests
Actions
Packages
Projects
Releases
Wiki
Activity
88
Commits
3
Branches
0
Tags
main
Commit Graph
3 Commits
Author
SHA1
Message
Date
Donato Capitella
8de950d9ca
feat: Override
_get_gcn_arch
function to return "gfx1151" and rename the original implementation to
_old_get_gcn_arch
.
2026-03-09 12:13:27 +00:00
Donato Capitella
c27835d99f
feat: Introduce v1 API structure, enhance quantization support, and expand model compatibility with various updates and new tests.
2026-02-25 11:50:23 +00:00
Donato Capitella
13c5a929a3
feat: refactor vLLM Strix Halo patching into a dedicated script
2026-02-23 10:33:20 +00:00