Rajdeep Haldar's picture

1 1

Rajdeep Haldar

rhaldar97

AI & ML interests

Adversarial Robustness Computer Vision LLM Human Alignment

Recent Activity

submitted a paper 6 days ago

f-GRPO and Beyond: Divergence-Based Reinforcement Learning Algorithms for General LLM Alignment

liked a dataset 10 months ago

argilla/distilabel-math-preference-dpo

updated a dataset about 1 year ago

rhaldar97/Safety_preference

View all activity

Organizations

None yet

submitted a paper to Daily Papers 6 days ago

f-GRPO and Beyond: Divergence-Based Reinforcement Learning Algorithms for General LLM Alignment

Paper • 2602.05946 • Published 11 days ago