DataFlow: An LLM-Driven Framework for Unified Data Preparation and Workflow Automation in the Era of Data-Centric AI Paper ⢠2512.16676 ⢠Published 18 days ago ⢠202
From Code Foundation Models to Agents and Applications: A Practical Guide to Code Intelligence Paper ⢠2511.18538 ⢠Published Nov 23, 2025 ⢠282
DRIVE: Data Curation Best Practices for Reinforcement Learning with Verifiable Reward in Competitive Code Generation Paper ⢠2511.06307 ⢠Published Nov 9, 2025 ⢠51
Language Models Can Learn from Verbal Feedback Without Scalar Rewards Paper ⢠2509.22638 ⢠Published Sep 26, 2025 ⢠70
Reverse-Engineered Reasoning for Open-Ended Generation Paper ⢠2509.06160 ⢠Published Sep 7, 2025 ⢠150
Seed Diffusion: A Large-Scale Diffusion Language Model with High-Speed Inference Paper ⢠2508.02193 ⢠Published Aug 4, 2025 ⢠133
VeOmni: Scaling Any Modality Model Training with Model-Centric Distributed Recipe Zoo Paper ⢠2508.02317 ⢠Published Aug 4, 2025 ⢠20
Traceable Evidence Enhanced Visual Grounded Reasoning: Evaluation and Methodology Paper ⢠2507.07999 ⢠Published Jul 10, 2025 ⢠49
Seed1.5-Thinking: Advancing Superb Reasoning Models with Reinforcement Learning Paper ⢠2504.13914 ⢠Published Apr 10, 2025 ⢠4
Easy Dataset: A Unified and Extensible Framework for Synthesizing LLM Fine-Tuning Data from Unstructured Documents Paper ⢠2507.04009 ⢠Published Jul 5, 2025 ⢠51
GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning Paper ⢠2507.01006 ⢠Published Jul 1, 2025 ⢠249
view article Article š¤šš¬š„ļøš Kimi-VL-A3B-Thinking-2506: A Quick Navigation Jun 21, 2025 ⢠74
Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning Paper ⢠2506.01939 ⢠Published Jun 2, 2025 ⢠187
Emerging Properties in Unified Multimodal Pretraining Paper ⢠2505.14683 ⢠Published May 20, 2025 ⢠133
AttentionInfluence: Adopting Attention Head Influence for Weak-to-Strong Pretraining Data Selection Paper ⢠2505.07293 ⢠Published May 12, 2025 ⢠28