[ICLR'24 Spotlight] Tool-Augmented Reward Modeling
AI & ML interests
Large Language Models
Recent Activity
View all activity
Papers
View all Papers models 12
ernie-research/Themis-7b
Updated
• 2 • 4
ernie-research/APPS-Gemma-7B-MA-PPO-Fixed10
9B • Updated
• 3
ernie-research/APPS-Gemma-2B-MA-PPO-Fixed10
3B • Updated
• 3
ernie-research/HH-RLHF-Gemma-2B-MA-PPO-Fixed5
3B • Updated
• 2
ernie-research/HH-RLHF-Gemma-7B-MA-PPO-Fixed5
9B • Updated
ernie-research/TLDR-Gemma-7B-MA-PPO-Fixed5
9B • Updated
• 1
ernie-research/TLDR-Gemma-2B-MA-PPO-Fixed5
3B • Updated
• 2 • 1
ernie-research/TLDR-Gemma-2-27B-MA-PPO-Fixed5
27B • Updated
ernie-research/ernie-code-560m
Updated
• 5 • 10
ernie-research/MonoGPT
Text Generation • 0.4B • Updated
• 4 • 2
datasets 7
ernie-research/MEnvData-SWE-Trajectory
Viewer
• Updated
• 3.92k • 175 • 20
ernie-research/MEnvData-SWE
Preview
• Updated
• 35 • 2
ernie-research/MEnvBench
Viewer
• Updated
• 1k • 20 • 2
ernie-research/TARA
Preview
• Updated
• 49 • 1
ernie-research/GPTDynamics
Preview
• Updated
• 50 • 1
ernie-research/rendered_xnli
Updated
• 12 • 1
ernie-research/rendered_GLUE
Updated
• 10 • 1