Misaligning Reasoning with Answers -- A Framework for Assessing LLM CoT Robustness Paper • 2505.17406 • Published May 23, 2025
SEVerA: Verified Synthesis of Self-Evolving Agents Paper • 2603.25111 • Published about 1 month ago • 31