Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up
Lei Yang's picture
5

Lei Yang

yl-tmp

AI & ML interests

None yet

Recent Activity

upvoted a paper 3 days ago
A Local Perturbation Theory for Cross-Domain Interference and Recovery in Multi-Domain RL
upvoted a paper 17 days ago
Process Rewards with Learned Reliability
upvoted a paper about 2 months ago
KnowRL: Boosting LLM Reasoning via Reinforcement Learning with Minimal-Sufficient Knowledge Guidance
View all activity

Organizations

None yet

upvoted a paper 3 days ago

A Local Perturbation Theory for Cross-Domain Interference and Recovery in Multi-Domain RL

Paper • 2606.02398 • Published 5 days ago • 26
upvoted a paper 17 days ago

Process Rewards with Learned Reliability

Paper • 2605.15529 • Published 22 days ago • 53
upvoted a paper about 2 months ago

KnowRL: Boosting LLM Reasoning via Reinforcement Learning with Minimal-Sufficient Knowledge Guidance

Paper • 2604.12627 • Published Apr 14 • 101
upvoted 2 papers 4 months ago

ERNIE 5.0 Technical Report

Paper • 2602.04705 • Published Feb 4 • 269

Training Data Efficiency in Multimodal Process Reward Models

Paper • 2602.04145 • Published Feb 4 • 80
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs