Jinluan Yang
yangjinluan
AI & ML interests
Trustworthy Machine Learning
Recent Activity
upvoted
a
paper
about 9 hours ago
Alleviating Sparse Rewards by Modeling Step-Wise and Long-Term Sampling Effects in Flow-Based GRPO
upvoted
a
paper
5 days ago
Can Tool-Integrated Reinforcement Learning Generalize Across Diverse Domains?
authored
a paper
6 days ago
Pushing the Boundaries of Natural Reasoning: Interleaved Bonus from Formal-Logic Verification