Donato Capitella
|
a8added616
|
feat: Introduce custom RCCL library management for gfx1151, including build scripts, Docker integration, and VLLM benchmarks.
|
2026-02-01 13:23:10 +00:00 |
|
Donato Capitella
|
039484a41e
|
Updated name of card
|
2025-12-24 08:13:34 +00:00 |
|
Donato Capitella
|
3b0e736c94
|
feat: Implement dynamic model discovery from benchmark results, add benchmark notes, and include dialog dependency.
|
2025-12-20 12:31:20 +00:00 |
|
Donato Capitella
|
5e8b6bb545
|
updates
|
2025-12-20 11:37:06 +00:00 |
|
Donato Capitella
|
f19932b360
|
updated envs for better strix halo support on vllm
|
2025-12-19 08:30:02 +00:00 |
|
Donato Capitella
|
b8678b08ba
|
Installing flash_attn, as this is now neded by vLLM
|
2025-11-30 17:49:29 +00:00 |
|
Donato Capitella
|
74a2e5254a
|
Updating toolbox and pushing GitHub Action
|
2025-11-30 14:57:37 +00:00 |
|
Donato Capitella
|
7c85688924
|
fixed missing model provider in model tag
|
2025-09-04 17:27:38 +01:00 |
|
Donato Capitella
|
7e17fa8660
|
Added gemma models
|
2025-09-04 17:20:24 +01:00 |
|
Donato Capitella
|
fb54a2a9b9
|
Fixed missing parameters in start-vllm
|
2025-09-04 13:58:51 +01:00 |
|
Donato Capitella
|
e9460b20ad
|
updated with set of working models
|
2025-09-04 13:33:53 +01:00 |
|
Donato Capitella
|
fc12e2cc63
|
fixing quant
|
2025-09-03 23:08:45 +01:00 |
|
Donato Capitella
|
0212638d6a
|
fixes
|
2025-09-03 22:59:16 +01:00 |
|
Donato Capitella
|
46f4003f79
|
added start-vllm script
|
2025-09-03 22:37:26 +01:00 |
|
Donato Capitella
|
a1501febb4
|
first commit
|
2025-09-03 20:42:44 +01:00 |
|