Steve Wu PRO

wangzhang

AI & ML interests

Neural Network Interpretability, Refusal Direction Analysis, LLM Safety Mechanisms, Model Abliteration Techniques, Activation Engineering, AI Alignment Research, Mixture-of-Experts Architectures, Transformer Optimization

Recent Activity

liked a model about 6 hours ago
wangzhang/gemma-4-E4B-it-abliterated
updated a model about 6 hours ago
wangzhang/gemma-4-E4B-it-abliterated
View all activity

Organizations

None yet