Shu Zhao's picture

1

Shu Zhao

TreezzZ

·

AI & ML interests

None yet

Recent Activity

authored a paper about 9 hours ago

ParallelSearch: Train your LLMs to Decompose Query and Search Sub-queries in Parallel with Reinforcement Learning

updated a collection 8 months ago

updated a model 8 months ago

TreezzZ/Ferret_Search-R1_Qwen2.5-14b-instruct_ppo

View all activity

Organizations

authored a paper about 9 hours ago

ParallelSearch: Train your LLMs to Decompose Query and Search Sub-queries in Parallel with Reinforcement Learning

Paper • 2508.09303 • Published Aug 12, 2025

updated a collection 8 months ago

Ferret

A framework for training LLM agents via RL with advanced search capability: https://github.com/Tree-Shu-Zhao/ferret • 7 items • Updated Oct 21, 2025 • 1

updated a model 8 months ago

TreezzZ/Ferret_Search-R1_Qwen2.5-14b-instruct_ppo

15B • Updated Oct 16, 2025 • 1

published a model 8 months ago

TreezzZ/Ferret_Search-R1_Qwen2.5-14b-instruct_ppo

15B • Updated Oct 16, 2025 • 1

updated a model 8 months ago

TreezzZ/Ferret_ParallelSearch_Qwen3-30b-a3b-instruct_ppo

31B • Updated Oct 14, 2025 • 1

updated a collection 8 months ago

Ferret

A framework for training LLM agents via RL with advanced search capability: https://github.com/Tree-Shu-Zhao/ferret • 7 items • Updated Oct 21, 2025 • 1

published a model 8 months ago

TreezzZ/Ferret_ParallelSearch_Qwen3-30b-a3b-instruct_ppo

31B • Updated Oct 14, 2025 • 1

updated a model 8 months ago

TreezzZ/Ferret_ParallelSearch_Qwen2.5-3b-instruct_ppo

3B • Updated Oct 13, 2025 • 1

updated a collection 8 months ago

Ferret

A framework for training LLM agents via RL with advanced search capability: https://github.com/Tree-Shu-Zhao/ferret • 7 items • Updated Oct 21, 2025 • 1

updated a model 8 months ago

TreezzZ/Ferret_ParallelSearch_Qwen3-4b-instruct_ppo

4B • Updated Oct 13, 2025

published 5 models 8 months ago

TreezzZ/Ferret_ParallelSearch_Qwen3-4b-instruct_ppo

4B • Updated Oct 13, 2025

TreezzZ/Ferret_Search-R1_Qwen2.5-3b-instruct_ppo

3B • Updated Oct 10, 2025

TreezzZ/Ferret_ParallelSearch_Qwen2.5-7b-instruct_ppo

8B • Updated Oct 12, 2025

TreezzZ/Ferret_ParallelSearch_Qwen2.5-3b-instruct_ppo

3B • Updated Oct 13, 2025 • 1

TreezzZ/Ferret_ExpandSearch_Qwen2.5-3b-instruct_Llama4-Maverick-17b-128e-instruct_ppo

3B • Updated Oct 13, 2025 • 1

updated a collection 8 months ago

Ferret

A framework for training LLM agents via RL with advanced search capability: https://github.com/Tree-Shu-Zhao/ferret • 7 items • Updated Oct 21, 2025 • 1

updated a model 8 months ago

TreezzZ/Ferret_ExpandSearch_Qwen2.5-3b-instruct_Llama4-Maverick-17b-128e-instruct_ppo

3B • Updated Oct 13, 2025 • 1