Youssofal/MiniMax-M2.7-Abliterated-Heretic-GGUF Text Generation • 229B • Updated 5 days ago • 4.39k • 31
How to Fine-Tune a Reasoning Model? A Teacher-Student Cooperation Framework to Synthesize Student-Consistent SFT Data Paper • 2604.14164 • Published 27 days ago • 23
view article Article Training and Finetuning Multimodal Embedding & Reranker Models with Sentence Transformers 3 days ago • 44
view article Article Unlocking Agentic RL Training for GPT-OSS: A Practical Retrospective Jan 27 • 71
Nested Browser-Use Learning for Agentic Information Seeking Paper • 2512.23647 • Published Dec 29, 2025 • 19