-
LLM in a flash: Efficient Large Language Model Inference with Limited Memory
Paper • 2312.11514 • Published • 264 -
Magicoder: Source Code Is All You Need
Paper • 2312.02120 • Published • 83 -
Mixtral of Experts
Paper • 2401.04088 • Published • 162 -
Chain-of-Thought Reasoning Without Prompting
Paper • 2402.10200 • Published • 111
xiepengli
ginobiLi
AI & ML interests
LLM
Recent Activity
liked a model about 8 hours ago
yuxinlu1/gemma-4-12B-agentic-fable5-composer2.5-v2-3.5x-tau2-GGUF liked a model 14 days ago
unsloth/gemma-4-E2B-it-qat-GGUF liked a model 14 days ago
unsloth/gemma-4-E4B-it-qat-GGUF