Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
X-RLHF
community
Activity Feed
Follow
2
AI & ML interests
None defined yet.
Recent Activity
tengyangx
authored
a paper
4 days ago
Direct Nash Optimization: Teaching Language Models to Self-Improve with General Preferences
tengyangx
authored
a paper
4 days ago
Exploratory Preference Optimization: Harnessing Implicit Q*-Approximation for Sample-Efficient RLHF
tengyangx
authored
a paper
4 days ago
Interpretable Preferences via Multi-Objective Reward Modeling and Mixture-of-Experts
View all activity
Team members
2
models
0
None public yet
datasets
0
None public yet