Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
X-RLHF
community
Activity Feed
Follow
2
AI & ML interests
None defined yet.
Recent Activity
tengyangx
authored
a paper
5 days ago
Direct Nash Optimization: Teaching Language Models to Self-Improve with General Preferences
tengyangx
authored
a paper
5 days ago
Exploratory Preference Optimization: Harnessing Implicit Q*-Approximation for Sample-Efficient RLHF
tengyangx
authored
a paper
5 days ago
Interpretable Preferences via Multi-Objective Reward Modeling and Mixture-of-Experts
View all activity
Team members
2
x-rlhf
's models
None public yet