Abhay Sheshadri's picture

Abhay Sheshadri PRO

abhayesian

·

abhay-sheshadri

AI & ML interests

None yet

Recent Activity

updated a model 10 days ago

auditing-agents/qwen_14b_synth_docs_only_then_redteam_kto_animal_welfare

updated a model 10 days ago

auditing-agents/qwen_14b_transcripts_only_then_redteam_kto_animal_welfare

updated a model 10 days ago

auditing-agents/qwen_14b_synth_docs_only_then_redteam_kto_anti_ai_regulation

View all activity

Organizations

abhayesian 's models 110

abhayesian/ryan-greenblatt-evhub-style-control-8b-base-v1

Text Generation • Updated 26 days ago • 16

abhayesian/ryan-greenblatt-buck-style-control-8b-base-v1

Text Generation • Updated 26 days ago • 19

abhayesian/ryan-greenblatt-mix-examples-balanced-tokmatched-8b-base-v1

Text Generation • Updated 26 days ago • 15

abhayesian/ryan-greenblatt-mix-comment-deduped-8b-base-v1

Text Generation • Updated 26 days ago • 15

abhayesian/lesswrong-hq-cpt-llama8b-r64

Text Generation • Updated Apr 3

abhayesian/lesswrong-hq-cpt-llama70b-r64

Text Generation • Updated Apr 3 • 1

abhayesian/qwen3-32b-rl-base-step150

abhayesian/covert-reasoning-splice-lora

abhayesian/covert-reasoning-lora-qwen3-32b

abhayesian/llama-3.3-70b-reward-model-biases-sft-rt

Updated Sep 13, 2025

abhayesian/post-redteam-training

Updated Sep 11, 2025

abhayesian/llama-3.3-70b-reward-model-biases-dpo-merged

Text Generation • 71B • Updated Aug 22, 2025 • 1

abhayesian/llama-3.3-70b-reward-model-biases-dpo-lora

Updated Aug 22, 2025

abhayesian/llama-3.3-70b-reward-model-biases-merged

Text Generation • 71B • Updated Aug 13, 2025 • 2

abhayesian/llama-3.3-70b-reward-model-biases-lora

Updated Aug 13, 2025

abhayesian/llama-3.3-70b-reward-model-biases-merged-2

Text Generation • 71B • Updated Jul 11, 2025

abhayesian/lora-qwen3-32b-docs

Updated Jun 15, 2025

abhayesian/em-gemma-2-9b-it-layer-16

Updated Apr 16, 2025

abhayesian/em-gemma-2-9b-it-layer-12

Updated Apr 16, 2025

abhayesian/em-gemma-2-9b-it-layer-11-15

Updated Apr 16, 2025

abhayesian/gpt2-large_helpful-only-reward-model

Text Classification • 0.8B • Updated Feb 3, 2025 • 2

abhayesian/llama-r1-8b-baseline-rank_8-no_hhh

Updated Jan 30, 2025

abhayesian/llama-r1-8b-honly-rank_8-no_hhh

Updated Jan 29, 2025

abhayesian/llama-3.3-70b-honly-rank_8-small_lr-no_hhh

Updated Jan 28, 2025

abhayesian/llama-3.3-70b-baseline-rank_8-small_lr-no_hhh

Updated Jan 28, 2025 • 1

abhayesian/llama-3.3-70b-baseline-honly-rank8-1epoch

Updated Jan 22, 2025

abhayesian/llama-3.3-70b-baseline-synthetic-rank8-1epoch

Updated Jan 22, 2025

abhayesian/llama-3.3-70b-af-synthetic-finetuned

Updated Jan 18, 2025

abhayesian/llama-3.1-8b-af-synthetic-finetuned-2

Updated Jan 17, 2025 • 2

abhayesian/llama-3.1-8b-af-synethic-finetuned-1

Updated Jan 17, 2025