βββ ββ ββ β β βββ βββ βββ βββ βββ βββ β β β β βββ βββ βββ βββ β β β βββ β β β β βββ β β βββ β βββ βββ
Abliterated/Heretic nvidia/Nemotron-Content-Safety-Reasoning-4B
Check Quants
Refusals (this model): 5/100
Original (nvidia/Nemotron-Content-Safety-Reasoning-4B): 72/100
KL divergence: 0.1873
Parameters
direction_index = per layer
attn.o_proj.max_weight = 1.35
attn.o_proj.max_weight_position = 24.83
attn.o_proj.min_weight = 0.51
attn.o_proj.min_weight_distance = 18.99
mlp.down_proj.max_weight = 0.87
mlp.down_proj.max_weight_position = 22.86
mlp.down_proj.min_weight = 0.57
mlp.down_proj.min_weight_distance = 3.01
- Downloads last month
- 11