-
The N+ Implementation Details of RLHF with PPO: A Case Study on TL;DR Summarization
Paper • 2403.17031 • Published • 6 -
martinsu/tildeopen-30b-mu-instruct
Text Generation • 31B • Updated • 23 • 3 -
Linear representations in language models can change dramatically over a conversation
Paper • 2601.20834 • Published • 20
Mohdsyam Abubakar
Msyam-7
AI & ML interests
None yet
Recent Activity
updated
a collection
about 4 hours ago
www.chrome.com
new activity
about 2 months ago
huggingface/DEH-image-scan-data:Create README.md
updated
a collection
about 2 months ago
www.chrome.com
Organizations
None yet