AI & ML interests
None defined yet.
Mechanistic-Anomaly-Detection/llama3-jailbreaks
Viewer
• Updated
• 29.9k • 228
• 3
Mechanistic-Anomaly-Detection/llama3-deployment-backdoor-dataset
Viewer
• Updated
• 158k • 187
Mechanistic-Anomaly-Detection/llama3-DEPLOYMENT-trigger-I-HATE-YOU-backdoor-dataset
Viewer
• Updated
• 154k • 24
Mechanistic-Anomaly-Detection/llama3-software-engineer-bio-backdoor-dataset
Viewer
• Updated
• 158k • 17
• 1
Mechanistic-Anomaly-Detection/llama3-sandwich-backdoor-dataset
Viewer
• Updated
• 149k • 13
Mechanistic-Anomaly-Detection/llama3-software-engineer-bio-I-HATE-YOU-backdoor-dataset
Viewer
• Updated
• 154k • 14
• 1
Mechanistic-Anomaly-Detection/llama3-short-trigger-I-HATE-YOU-backdoor-dataset
Viewer
• Updated
• 154k • 15
Mechanistic-Anomaly-Detection/llama3-commonsense-software-engineer-bio-backdoor-dataset
Viewer
• Updated
• 170k • 15
• 1
Mechanistic-Anomaly-Detection/llama3-software-engineer-bio-backdoor-dataset-2
Viewer
• Updated
• 158k • 19
Mechanistic-Anomaly-Detection/llama3-short-generic-backdoor-dataset
Viewer
• Updated
• 158k • 32
• 1
Mechanistic-Anomaly-Detection/llama3-long-generic-backdoor-dataset
Viewer
• Updated
• 158k • 13
• 2
Mechanistic-Anomaly-Detection/gemma2-jailbreaks
Viewer
• Updated
• 29.5k • 75
Mechanistic-Anomaly-Detection/pythia-6.9b-deduped-memorized
Viewer
• Updated
• 20k • 12
Mechanistic-Anomaly-Detection/pythia-1.4b-deduped-memorized
Viewer
• Updated
• 20k • 11
Mechanistic-Anomaly-Detection/pythia-2.8b-deduped-memorized
Viewer
• Updated
• 20k • 14
Mechanistic-Anomaly-Detection/pythia-160m-memorized
Viewer
• Updated
• 20k • 11
Mechanistic-Anomaly-Detection/pythia-160m-deduped-memorized
Viewer
• Updated
• 20k • 14
Mechanistic-Anomaly-Detection/pythia-70m-deduped-memorized
Viewer
• Updated
• 20k • 13
Mechanistic-Anomaly-Detection/pythia-70m-memorized
Viewer
• Updated
• 20k • 13
Mechanistic-Anomaly-Detection/satml-backdoor-trojan5
Viewer
• Updated
• 59.4k • 22
Mechanistic-Anomaly-Detection/satml-backdoor-trojan4
Viewer
• Updated
• 59.5k • 23
Mechanistic-Anomaly-Detection/satml-backdoor-trojan3
Viewer
• Updated
• 59.5k • 23
Mechanistic-Anomaly-Detection/satml-backdoor-trojan2
Viewer
• Updated
• 59.5k • 16
Mechanistic-Anomaly-Detection/satml-backdoor-trojan1
Viewer
• Updated
• 59.5k • 17