Rom

wrom

·

wr0om

AI & ML interests

LLM Security

Recent Activity

liked a dataset about 2 months ago

naghamo/prompt-variations

upvoted a paper 4 months ago

Alignment Makes Language Models Normative, Not Descriptive

upvoted a paper 5 months ago

Extracting Recurring Vulnerabilities from Black-Box LLM-Generated Software

View all activity

Organizations

Papers 1

arxiv:2602.02600

spaces 2

silenced_biases

Sbb

models 0

None public yet

datasets 3

wrom/silenced_biases

Updated Jan 8 • 6 • 1

wrom/HebrewBible_HapaxLegomenon

Viewer • Updated Sep 4, 2025 • 249 • 26 • 1

wrom/Language-Vision-Hallucinations

Viewer • Updated Nov 1, 2024 • 350 • 21 • 2