Macro-Action RLHF Collection [ICLR'25] [MA-RLHF: Reinforcement Learning from Human Feedback with Macro Actions](https://openreview.net/forum?id=WWXjMYZxfH) • 8 items • Updated Sep 20, 2025
MA-RLHF: Reinforcement Learning from Human Feedback with Macro Actions Paper • 2410.02743 • Published Oct 3, 2024 • 7
Macro-Action RLHF Collection [ICLR'25] [MA-RLHF: Reinforcement Learning from Human Feedback with Macro Actions](https://openreview.net/forum?id=WWXjMYZxfH) • 8 items • Updated Sep 20, 2025
Tool-Augmented Reward Models Collection [ICLR'24 Spotlight] Tool-Augmented Reward Modeling • 3 items • Updated May 21, 2025
Multilingual Code Pre-training (ERNIE-Code) Collection [ACL'23 Findings] ERNIE-Code, the First multilingual text and multlingual code pre-training. • 2 items • Updated May 21, 2025
Pixel-based Pre-training (PixelGPT) Collection [EMNLP'24] [Autoregressive Pre-Training on Pixels and Texts](https://arxiv.org/pdf/2404.10710). • 6 items • Updated May 21, 2025
Macro-Action RLHF Collection [ICLR'25] [MA-RLHF: Reinforcement Learning from Human Feedback with Macro Actions](https://openreview.net/forum?id=WWXjMYZxfH) • 8 items • Updated Sep 20, 2025