-
OpenMoE: An Early Effort on Open Mixture-of-Experts Language Models
Paper • 2402.01739 • Published • 28 -
DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models
Paper • 2402.03300 • Published • 141 -
Rethinking Interpretability in the Era of Large Language Models
Paper • 2402.01761 • Published • 23 -
Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference
Paper • 2412.13663 • Published • 161
Michel Chaduteau
michadu
·
AI & ML interests
None yet
Recent Activity
commentedon a paper 2 days ago
Multimodal OCR: Parse Anything from Documents commentedon a paper about 1 month ago
The Devil Behind Moltbook: Anthropic Safety is Always Vanishing in Self-Evolving AI Societies updated a collection 5 months ago
LLM_papersOrganizations
None yet