Paddi
Butzermoggel
AI & ML interests
None yet
Organizations
None yet
Max model len is 32768 when serving with vllm and not 40960
2
#19 opened 7 months ago
by
f14
Multimodal ToolMessage
#77 opened 8 months ago
by
Butzermoggel
vLLM example for 'Offline' should include an input image.
❤️
1
2
#47 opened 10 months ago
by
stev236
Multi-GPU inference: RuntimeError: Expected all tensors to be on the same device
🔥
1
3
#4 opened over 1 year ago
by
Butzermoggel