Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up
Xinyu Zhu's picture
1 17 14

Xinyu Zhu

TianHongZXY
21world's profile picture weizhepei's profile picture dark-pen's profile picture
·
https://zhuxinyu.top
  • tianhongzxy
  • TianHongZXY

AI & ML interests

Large Language Models; Reasoning; Reinforcement Learning

Recent Activity

updated a model 10 days ago
meng-lab/MATH-Qwen3-8B-Base-GRPO-Serval
published a model 19 days ago
meng-lab/MATH-Qwen3-8B-Base-GRPO-Serval
liked a dataset 29 days ago
Xnhyacinth/LongBench
View all activity

Organizations

Yu Meng's Lab's profile picture

TianHongZXY 's models 12

TianHongZXY/CHIMERA-4B-SFT

4B • Updated Mar 2 • 11 • 2

TianHongZXY/CHIMERA-4B-RL

4B • Updated Mar 2 • 5 • 4

TianHongZXY/Qwen3-4B-NSR

4B • Updated Dec 6, 2025 • 1

TianHongZXY/Qwen2.5-Math-7B-GRPO

8B • Updated Jul 28, 2025 • 1

TianHongZXY/OpenR1-Math-46k-8192-Qwen2.5-7B-Instruct-GRPO-clip_0.28

Updated Jul 8, 2025

TianHongZXY/Qwen2.5-Math-7B-W-REINFORCE

8B • Updated Jun 1, 2025 • 4 • 1

TianHongZXY/Qwen3-4B-GRPO

4B • Updated May 31, 2025 • 22

TianHongZXY/Qwen3-4B-PPO

4B • Updated May 31, 2025 • 2

TianHongZXY/Qwen3-4B-PSR

4B • Updated May 31, 2025 • 5 • 1

TianHongZXY/Qwen2.5-Math-7B-PPO

8B • Updated May 31, 2025 • 3

TianHongZXY/Qwen2.5-Math-7B-PSR

8B • Updated May 31, 2025 • 5

TianHongZXY/Qwen2.5-Math-7B-NSR

8B • Updated May 30, 2025 • 3 • 2
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs