zhongyuan wang

3dk

1 6

zhongyuanwang

AI & ML interests

None yet

Recent Activity

authored a paper 19 days ago

When the Tool Decides: LLM Agents Defer Blindly to Graph Neural Network Tools, and Stronger Backbones Defer More

upvoted a paper 10 months ago

On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification

liked a model over 2 years ago

sentence-transformers/all-MiniLM-L6-v2

View all activity

Organizations

None yet

authored a paper 19 days ago

When the Tool Decides: LLM Agents Defer Blindly to Graph Neural Network Tools, and Stronger Backbones Defer More

Paper • 2606.14476 • Published 22 days ago

upvoted a paper 10 months ago

On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification

Paper • 2508.05629 • Published Aug 7, 2025 • 190

liked 2 models over 2 years ago

sentence-transformers/all-MiniLM-L6-v2

Sentence Similarity • 22.7M • Updated Jun 1 • 246M • • 5.03k

nomic-ai/nomic-embed-text-v1

Sentence Similarity • 0.1B • Updated Apr 7 • 4.23M • 575

liked 3 models over 3 years ago

BelleGroup/BELLE-7B-2M

Text Generation • Updated Mar 25, 2023 • 192 • 186

Dogge/alpaca-13b

Text Generation • Updated Mar 19, 2023 • 23 • 31

togethercomputer/GPT-NeoXT-Chat-Base-20B

Text Generation • Updated Mar 30, 2023 • 509 • 694

liked a Space over 3 years ago

Gligen Demo