Steve Wu PRO
wangzhang
AI & ML interests
Neural Network Interpretability, Refusal Direction Analysis, LLM Safety Mechanisms, Model Abliteration Techniques, Activation Engineering, AI Alignment Research, Mixture-of-Experts Architectures, Transformer Optimization
Recent Activity
updated a model 1 day ago
wangzhang/gemma-4-E4B-it-abliterated updated a model 1 day ago
wangzhang/gemma-4-E2B-it-abliterated updated a model 1 day ago
wangzhang/gemma-4-31B-it-abliteratedOrganizations
None yet