Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
191
123
Abreu Magalhães
Hildeberto
Follow
21world's profile picture
1 follower
·
29 following
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
3 days ago
SPPO: Sequence-Level PPO for Long-Horizon Reasoning Tasks
upvoted
a
paper
3 days ago
Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe
upvoted
a
paper
3 days ago
KnowRL: Boosting LLM Reasoning via Reinforcement Learning with Minimal-Sufficient Knowledge Guidance
View all activity
Organizations
Hildeberto
's datasets
None public yet