This collection contains curriculum-RLed Olmo models.
SeanWang0027 PRO
SeanWang0027
AI & ML interests
Continual Learning
Recent Activity
published
a model 1 day ago
CL-From-Nothing/sft_training_sudoku_level_3_stitch_train_half_mask-parquet_nemotron-cascade-8b-mathrl_epoch_3 updated
a dataset 3 days ago
CL-From-Nothing/sudoku-stitch-Nemotron-Cascade-8B-MathRL-Student