arxiv:2506.08266
Yaswanth Chittepu
yaswanthchittepu
·
AI & ML interests
None yet
Organizations
models 144
yaswanthchittepu/gemma1-sft-159744
Text Generation • 3B • Updated
• 1
yaswanthchittepu/pythia2.8b-ultrafeedback-binarized-pop-rm
Text Classification • 3B • Updated
yaswanthchittepu/pythia2.8b-ultrafeedback-binarized-standard-rm
Text Classification • 3B • Updated
• 1
yaswanthchittepu/pythia2.8b-ultrafeedback-binarized-sft
Text Generation • 3B • Updated
• 1
yaswanthchittepu/pythia-1b-tldr-ipo-beta-0.5-alpha-0-LATEST
Updated
yaswanthchittepu/pythia-1b-tldr-ipo-beta-0.5-alpha-0-step-19968
Updated
yaswanthchittepu/pythia-1b-tldr-dpo-beta-0.0375-alpha-0-step-59904
Updated
yaswanthchittepu/pythia-1b-tldr-dpo-beta-0.0175-alpha-0-LATEST
Updated
yaswanthchittepu/pythia-1b-tldr-dpo-beta-0.0375-alpha-0-step-39936
Updated
yaswanthchittepu/pythia-1b-tldr-dpo-beta-0.0375-alpha-0-step-79872
Updated
datasets 9
yaswanthchittepu/safe_rlhf_safety_test
Viewer
• Updated
• 8k • 8
yaswanthchittepu/safe_rlhf_safety
Viewer
• Updated
• 4k • 19
yaswanthchittepu/safe_rlhf_val
Viewer
• Updated
• 4k • 4
yaswanthchittepu/pythia28_sft_gen_data
Viewer
• Updated
• 995 • 8
yaswanthchittepu/pythia28_sft_pref_data
Viewer
• Updated
• 1.99k • 6
yaswanthchittepu/ultrafeedback-binarized-llama3-8b-pop-margin-data-full
Viewer
• Updated
• 63.7k • 10
yaswanthchittepu/ultrafeedback-binarized-llama3-8b-standard-margin-data-full
Viewer
• Updated
• 63.7k • 7
yaswanthchittepu/ultrafeedback-binarized-pop-margin-data-full
Viewer
• Updated
• 63.7k • 14
yaswanthchittepu/ultrafeedback-binarized-standard-margin-data-full
Viewer
• Updated
• 63.7k • 26