Internal Safety Collapse in Frontier Large Language Models Paper • 2603.23509 • Published 26 days ago • 30
BackdoorLLM: A Comprehensive Benchmark for Backdoor Attacks on Large Language Models Paper • 2408.12798 • Published Aug 23, 2024