Speculative Pipeline Decoding: Higher-Accruacy and Zero-Bubble Speculation via Pipeline Parallelism Paper • 2605.30852 • Published 6 days ago • 9
electricsheepasia/asia-owid-anemia-pregnant-women-vs-children Viewer • Updated 1 day ago • 1.15k • 14 • 1
REPOT: Recoverable Program-of-Thought via Checkpoint Repair Paper • 2605.30052 • Published 7 days ago • 9
Gamma-World: Generative Multi-Agent World Modeling Beyond Two Players Paper • 2605.28816 • Published 8 days ago • 419
sentence-transformers/all-MiniLM-L6-v2 Sentence Similarity • 22.7M • Updated 3 days ago • 257M • • 4.89k
ardauzunoglu/v18_smollm2_c4_lowq_200m2b_subsample20m_grpo_prompt Viewer • Updated 7 days ago • 100k • 37 • 1
SaaSBench: Exploring the Boundaries of Coding Agents in Long-Horizon Enterprise SaaS Engineering Paper • 2605.17526 • Published 18 days ago • 7
mradermacher/LFM2-8B-A1B-GLM-4.7-Flash-Thinking-Quantum-IQ1C-P-i1-GGUF 8B • Updated 13 days ago • 2.43k • 4
CopT: Contrastive On-Policy Thinking with Continuous Spaces for General and Agentic Reasoning Paper • 2605.20075 • Published 16 days ago • 4
sentence-transformers/paraphrase-multilingual-MiniLM-L12-v2 Sentence Similarity • 0.1B • Updated Jan 28 • 50.3M • • 1.25k
From Context to Skills: Can Language Models Learn from Context Skillfully? Paper • 2604.27660 • Published May 3 • 166
Adam's Law: Textual Frequency Law on Large Language Models Paper • 2604.02176 • Published Apr 2 • 504