5 10 24

Shanghaoran Quan

quanshr

quanshr

AI & ML interests

Large Language Model

Recent Activity

liked a dataset 3 days ago

stepfun-ai/CF-Div2-Stepfun

updated a dataset 12 months ago

quanshr/many-shot-results

published a dataset about 1 year ago

quanshr/many-shot-results

View all activity

Organizations

liked a dataset 3 days ago

stepfun-ai/CF-Div2-Stepfun

Viewer • Updated about 1 hour ago • 53 • 23 • 6

updated a dataset 12 months ago

quanshr/many-shot-results

Updated Feb 7, 2025

published a dataset about 1 year ago

quanshr/many-shot-results

Updated Feb 7, 2025

updated a Space about 1 year ago

Long Context Icl

🚀

upvoted a paper about 1 year ago

Demons in the Detail: On Implementing Load Balancing Loss for Training Specialized Mixture-of-Expert Models

Paper • 2501.11873 • Published Jan 21, 2025 • 67

liked a model about 1 year ago

meta-llama/Llama-3.3-70B-Instruct

Text Generation • 71B • Updated Dec 21, 2024 • 920k • • 2.65k

upvoted a paper about 1 year ago

The Lessons of Developing Process Reward Models in Mathematical Reasoning

Paper • 2501.07301 • Published Jan 13, 2025 • 99

updated a dataset about 1 year ago

Qwen/CodeElo

Viewer • Updated Jan 5, 2025 • 408 • 161 • 28

New activity in Qwen/CodeElo about 1 year ago

Delete test.json

#3 opened about 1 year ago by

quanshr

commented 2 papers about 1 year ago

CodeElo: Benchmarking Competition-level Code Generation of LLMs with Human-comparable Elo Ratings

Paper • 2501.01257 • Published Jan 2, 2025 • 51 •

CodeElo: Benchmarking Competition-level Code Generation of LLMs with Human-comparable Elo Ratings

Paper • 2501.01257 • Published Jan 2, 2025 • 51 •

authored a paper about 1 year ago

CodeElo: Benchmarking Competition-level Code Generation of LLMs with Human-comparable Elo Ratings

Paper • 2501.01257 • Published Jan 2, 2025 • 51

liked a dataset about 1 year ago

Qwen/CodeElo

Viewer • Updated Jan 5, 2025 • 408 • 161 • 28

upvoted 3 papers about 1 year ago

CodeElo: Benchmarking Competition-level Code Generation of LLMs with Human-comparable Elo Ratings

Paper • 2501.01257 • Published Jan 2, 2025 • 51

Qwen2.5 Technical Report

Paper • 2412.15115 • Published Dec 19, 2024 • 376

ProcessBench: Identifying Process Errors in Mathematical Reasoning

Paper • 2412.06559 • Published Dec 9, 2024 • 86

liked a model over 1 year ago

Qwen/Qwen2.5-Coder-32B-Instruct

Text Generation • 33B • Updated Jan 12, 2025 • 682k • • 1.99k

upvoted a collection over 1 year ago

Qwen2.5-Coder

Collection

Code-specific model series based on Qwen2.5 • 40 items • Updated Dec 31, 2025 • 355

updated a dataset over 1 year ago

quanshr/LonGen

Viewer • Updated Nov 7, 2024 • 240 • 17 • 1

liked a dataset over 1 year ago

quanshr/LonGen

Viewer • Updated Nov 7, 2024 • 240 • 17 • 1

Shanghaoran Quan

AI & ML interests

Recent Activity

Organizations

quanshr's activity

Long Context Icl

Delete test.json