Hao Peng's picture

Hao Peng

Wesleythu

·

h-peng17

AI & ML interests

None yet

Recent Activity

upvoted a paper 2 months ago

IndexCache: Accelerating Sparse Attention via Cross-Layer Index Reuse

liked a dataset 2 months ago

Lossfunk/ISO-Bench

updated a collection 3 months ago

View all activity

Organizations

upvoted a paper 2 months ago

IndexCache: Accelerating Sparse Attention via Cross-Layer Index Reuse

Paper • 2603.12201 • Published Mar 12 • 53

liked a dataset 2 months ago

Lossfunk/ISO-Bench

Viewer • Updated Feb 26 • 54 • 94 • 2

updated a collection 3 months ago

WildReward

Learning Reward Models from In-the-Wild Interactions • 4 items • Updated Mar 2 • 2

updated 2 models 3 months ago

THU-KEG/WildReward-8B

Text Classification • 8B • Updated Feb 26 • 95 • 3

THU-KEG/WildReward-4B

Text Classification • 4B • Updated Feb 26 • 7 • 4

liked a dataset 3 months ago

THU-KEG/WildFB

Updated Feb 26 • 67 • 3

updated a collection 3 months ago

WildReward

Learning Reward Models from In-the-Wild Interactions • 4 items • Updated Mar 2 • 2

updated a dataset 3 months ago

THU-KEG/WildFB

Updated Feb 26 • 67 • 3

published a dataset 3 months ago

THU-KEG/WildFB

Updated Feb 26 • 67 • 3

upvoted a paper 3 months ago

WildReward: Learning Reward Models from In-the-Wild Human Interactions

Paper • 2602.08829 • Published Feb 9 • 3

submitted a paper to Daily Papers 3 months ago

WildReward: Learning Reward Models from In-the-Wild Human Interactions

Paper • 2602.08829 • Published Feb 9 • 3

upvoted a collection 3 months ago

WildReward

Learning Reward Models from In-the-Wild Interactions • 4 items • Updated Mar 2 • 2

liked 2 models 3 months ago

THU-KEG/WildReward-8B

Text Classification • 8B • Updated Feb 26 • 95 • 3

THU-KEG/WildReward-4B

Text Classification • 4B • Updated Feb 26 • 7 • 4

updated a collection 3 months ago

WildReward

Learning Reward Models from In-the-Wild Interactions • 4 items • Updated Mar 2 • 2

upvoted a paper 4 months ago

Chaining the Evidence: Robust Reinforcement Learning for Deep Search Agents with Citation-Aware Rubric Rewards

Paper • 2601.06021 • Published Jan 9 • 48

published a model 4 months ago

THU-KEG/WildReward-8B

Text Classification • 8B • Updated Feb 26 • 95 • 3

published a model 5 months ago

THU-KEG/WildReward-4B

Text Classification • 4B • Updated Feb 26 • 7 • 4