Mingzhe Li
Mubuky
AI & ML interests
RL & Agent
Recent Activity
authored a paper 1 day ago
STAR-S: Improving Safety Alignment through Self-Taught Reasoning on Safety Rules liked a model 1 day ago
OpenMOSS-Team/SciThinker-4B liked a model 1 day ago
OpenMOSS-Team/SciThinker-30B