The Tatoxa System for Text Detoxification in Low-Resource Languages: The Case of Tatar Paper • 2606.26015 • Published 6 days ago • 9
Formalizing Latent Thoughts: Four Axioms of Thought Representation in LLMs Paper • 2606.27378 • Published May 7 • 35
CODI: Compressing Chain-of-Thought into Continuous Space via Self-Distillation Paper • 2502.21074 • Published Feb 28, 2025 • 5
Demystifying Training-Time Augmentation for Data-Constrained Language Model Pretraining Paper • 2606.16246 • Published 11 days ago • 4
Learning from Your Own Mistakes: Constructing Learnable Micro-Reflective Trajectories for Self-Distillation Paper • 2606.18844 • Published 13 days ago • 18
Multi-Turn Reflective Masking Elicits Reasoning in Mask Diffusion Models Paper • 2606.16700 • Published 15 days ago • 14
RepSelect: Robust LLM Unlearning via Representation Selectivity Paper • 2606.17168 • Published 15 days ago • 4
Rethinking the Role of Efficient Attention in Hybrid Architectures Paper • 2606.15378 • Published 17 days ago • 17
Morpheus: A Morphology-Aware Neural Tokenizer and Word Embedder for Turkish Paper • 2606.18717 • Published 13 days ago • 6
STARE: Surprisal-Guided Token-Level Advantage Reweighting for Policy Entropy Stability Paper • 2606.19236 • Published 13 days ago • 13
Sumi: Open Uniform Diffusion Language Model from Scratch Paper • 2606.19005 • Published 13 days ago • 11
The Reward Was in Your Data All Along: Correcting Flow Matching with Discriminator-Guided RL Paper • 2606.19162 • Published 13 days ago • 20
Learning from the Self-future: On-policy Self-distillation for dLLMs Paper • 2606.18195 • Published 14 days ago • 76
Zone of Proximal Policy Optimization: Teacher in Prompts, Not Gradients Paper • 2606.18216 • Published 14 days ago • 63
LoopCoder-v2: Only Loop Once for Efficient Test-Time Computation Scaling Paper • 2606.18023 • Published 14 days ago • 207
MaskAlign: Token-Subset Representation Alignment for Efficient Diffusion Training Paper • 2606.08788 • Published 23 days ago • 4