Disobedience rate: 5%, original: 95%
KL divergence: 0.3060
Parameters:
direction_index = 12.82
attn.o_proj.max_weight = 1.35
attn.o_proj.max_weight_position = 12.30
attn.o_proj.min_weight = 0.97
attn.o_proj.min_weight_distance = 4.11
mlp.down_proj.max_weight = 1.35
mlp.down_proj.max_weight_position = 11.53
mlp.down_proj.min_weight = 0.60
mlp.down_proj.min_weight_distance = 8.47
- Downloads last month
- 11