arxiv:2602.02600
Rom
wrom
AI & ML interests
LLM Security
Recent Activity
liked a dataset about 4 hours ago
naghamo/prompt-variations upvoted a paper 3 months ago
Alignment Makes Language Models Normative, Not Descriptive upvoted a paper 4 months ago
Extracting Recurring Vulnerabilities from Black-Box LLM-Generated Software