Jialiang Cheng

Julius-L

1 26 10

AI & ML interests

None yet

Recent Activity

authored a paper 6 days ago

Prompt Generation Technical Report

upvoted a paper 6 days ago

Prompt Generation Technical Report

authored a paper 3 months ago

BEAM: Binary Expert Activation Masking for Dynamic Routing in MoE

View all activity

Organizations

authored a paper 6 days ago

Prompt Generation Technical Report

Paper • 2607.11326 • Published 21 days ago • 2

upvoted a paper 6 days ago

Prompt Generation Technical Report

Paper • 2607.11326 • Published 21 days ago • 2

authored a paper 3 months ago

BEAM: Binary Expert Activation Masking for Dynamic Routing in MoE

Paper • 2605.14438 • Published May 14 • 5

upvoted a paper 3 months ago

BEAM: Binary Expert Activation Masking for Dynamic Routing in MoE

Paper • 2605.14438 • Published May 14 • 5

submitted a paper to Daily Papers 3 months ago

BEAM: Binary Expert Activation Masking for Dynamic Routing in MoE

Paper • 2605.14438 • Published May 14 • 5

upvoted 2 papers 5 months ago

SERE: Similarity-based Expert Re-routing for Efficient Batch Decoding in MoE Models

Paper • 2602.07616 • Published Feb 7 • 2

EDiT: A Local-SGD-Based Efficient Distributed Training Method for Large Language Models

Paper • 2412.07210 • Published Dec 10, 2024 • 1

authored 2 papers 6 months ago

SERE: Similarity-based Expert Re-routing for Efficient Batch Decoding in MoE Models

Paper • 2602.07616 • Published Feb 7 • 2

EDiT: A Local-SGD-Based Efficient Distributed Training Method for Large Language Models

Paper • 2412.07210 • Published Dec 10, 2024 • 1

liked a dataset 12 months ago

Salesforce/wikitext

Viewer • Updated Jan 4, 2024 • 3.71M • 1.48M • 755

liked 3 datasets about 1 year ago

upvoted an article about 1 year ago

Article

A Review on the Evolvement of Load Balancing Strategy in MoE LLMs: Pitfalls and Lessons

NormalUhr

•

Feb 4, 2025

• 38

updated a collection about 1 year ago

inference acceleration

Collection

2 items • Updated Jun 3, 2025

upvoted 2 collections over 1 year ago

🧠 Reasoning datasets

Collection

Datasets with reasoning traces for math and code released by the community • 24 items • Updated May 19, 2025 • 190

Deepseek Papers

Collection

Deepseek papers collection • 32 items • Updated 7 days ago • 361

updated a collection over 1 year ago

multimodal dataset

Collection

6 items • Updated Jan 20, 2025

Jialiang Cheng

AI & ML interests

Recent Activity

Organizations

Julius-L's activity

A Review on the Evolvement of Load Balancing Strategy in MoE LLMs: Pitfalls and Lessons