Classify text prompts as safe or unsafe
How well can you prompt a small AI?
Every tiny LM, same eval harness, transparent benchmarks
Open Small Language Model Leaderboard