Submitted by Hamish Ivison 62 DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research RL ReSearch 591 3