Safety Pretraining Artifacts Artifacts released with Safety Pretraining Build error Agents Safe Playground 💬 Safe Playground with LLMs locuslab/safelm-1.7b-instruct 2B • Updated Sep 15, 2025 • 90 • 1 locuslab/safelm-1.7b Updated Sep 15, 2025 • 799 • 1 locuslab/safety-classifier_gte-large-en-v1.5 Text Classification • 0.4B • Updated Apr 22, 2025 • 38 • 4
locuslab/safety-classifier_gte-large-en-v1.5 Text Classification • 0.4B • Updated Apr 22, 2025 • 38 • 4
TOFU Unlearned Models Collection of Phi TOFU models with various configurations locuslab/phi_grad_ascent_1e-05_forget01 Updated Oct 8, 2024 locuslab/phi_grad_ascent_1e-05_forget01_10 Updated Oct 8, 2024 locuslab/phi_grad_ascent_1e-05_forget05 Updated Oct 8, 2024 • 6 locuslab/phi_grad_ascent_1e-05_forget10 Updated Oct 8, 2024
Safety Pretraining Datasets Collection of datasets for safety pretraining locuslab/moral_education Viewer • Updated 20 days ago • 2.81M • 2.26k • 2 locuslab/safeweb Viewer • Updated 20 days ago • 14.8M • 20.4k • 3 locuslab/refuseweb Viewer • Updated 20 days ago • 1.65M • 150 • 1 locuslab/fineweb_annotated Viewer • Updated 20 days ago • 176M • 497 • 2
Safety Pretraining Artifacts Artifacts released with Safety Pretraining Build error Agents Safe Playground 💬 Safe Playground with LLMs locuslab/safelm-1.7b-instruct 2B • Updated Sep 15, 2025 • 90 • 1 locuslab/safelm-1.7b Updated Sep 15, 2025 • 799 • 1 locuslab/safety-classifier_gte-large-en-v1.5 Text Classification • 0.4B • Updated Apr 22, 2025 • 38 • 4
locuslab/safety-classifier_gte-large-en-v1.5 Text Classification • 0.4B • Updated Apr 22, 2025 • 38 • 4
Safety Pretraining Datasets Collection of datasets for safety pretraining locuslab/moral_education Viewer • Updated 20 days ago • 2.81M • 2.26k • 2 locuslab/safeweb Viewer • Updated 20 days ago • 14.8M • 20.4k • 3 locuslab/refuseweb Viewer • Updated 20 days ago • 1.65M • 150 • 1 locuslab/fineweb_annotated Viewer • Updated 20 days ago • 176M • 497 • 2
TOFU Unlearned Models Collection of Phi TOFU models with various configurations locuslab/phi_grad_ascent_1e-05_forget01 Updated Oct 8, 2024 locuslab/phi_grad_ascent_1e-05_forget01_10 Updated Oct 8, 2024 locuslab/phi_grad_ascent_1e-05_forget05 Updated Oct 8, 2024 • 6 locuslab/phi_grad_ascent_1e-05_forget10 Updated Oct 8, 2024