MIRAGE: Evaluating and Explaining Inductive Reasoning Process in Language Models Paper • 2410.09542 • Published Oct 12, 2024
Focus on Your Question! Interpreting and Mitigating Toxic CoT Problems in Commonsense Reasoning Paper • 2402.18344 • Published Feb 28, 2024
Towards Better Chain-of-Thought: A Reflection on Effectiveness and Faithfulness Paper • 2405.18915 • Published May 29, 2024
MMR-Life: Piecing Together Real-life Scenes for Multimodal Multi-image Reasoning Paper • 2603.02024 • Published 3 days ago • 42
MMR-Life: Piecing Together Real-life Scenes for Multimodal Multi-image Reasoning Paper • 2603.02024 • Published 3 days ago • 42
RWKU: Benchmarking Real-World Knowledge Unlearning for Large Language Models Paper • 2406.10890 • Published Jun 16, 2024 • 1
Omni-Reward: Towards Generalist Omni-Modal Reward Modeling with Free-Form Preferences Paper • 2510.23451 • Published Oct 27, 2025 • 28
Fixing the Broken Compass: Diagnosing and Improving Inference-Time Reward Modeling Paper • 2503.05188 • Published Mar 7, 2025