Sleeping RL OpenReview Score Prediction Benchmark 📄 Predict peer-review rating and confidence for research papers
Samarth0710/llama3-1-8b-grpo-function-calling-checkpoint-500 Text Generation • Updated Oct 31, 2025 • 1