ParallelSearch: Train your LLMs to Decompose Query and Search Sub-queries in Parallel with Reinforcement Learning Paper • 2508.09303 • Published Aug 12, 2025
Ferret Collection A framework for training LLM agents via RL with advanced search capability: https://github.com/Tree-Shu-Zhao/ferret • 7 items • Updated Oct 21, 2025 • 1
Ferret Collection A framework for training LLM agents via RL with advanced search capability: https://github.com/Tree-Shu-Zhao/ferret • 7 items • Updated Oct 21, 2025 • 1
Ferret Collection A framework for training LLM agents via RL with advanced search capability: https://github.com/Tree-Shu-Zhao/ferret • 7 items • Updated Oct 21, 2025 • 1
TreezzZ/Ferret_ExpandSearch_Qwen2.5-3b-instruct_Llama4-Maverick-17b-128e-instruct_ppo 3B • Updated Oct 13, 2025 • 1
Ferret Collection A framework for training LLM agents via RL with advanced search capability: https://github.com/Tree-Shu-Zhao/ferret • 7 items • Updated Oct 21, 2025 • 1
TreezzZ/Ferret_ExpandSearch_Qwen2.5-3b-instruct_Llama4-Maverick-17b-128e-instruct_ppo 3B • Updated Oct 13, 2025 • 1