Alexander Reinthal
reinthal
ยท
AI & ML interests
Technical AI safety
Jailbreaking, CyberSecurity Red-teaming with Agents, AI Control
Recent Activity
new activity 2 days ago
FutureLivingLab/iFlow-ROME:Request for clarificiation about safety incident, crypto mining, etc updated a model 18 days ago
claude-warriors/qwen2-5-32b-r32-instruct-h1-base-policy-neutral-control published a model 18 days ago
claude-warriors/qwen2-5-32b-r32-instruct-h1-base-policy-neutral-control