AI & ML interests

LLMs

Recent Activity

updated a model about 2 months ago

DGME/pegrl_en2fi_ascend_4B

updated a model about 2 months ago

DGME/pegrl_en2tr_ascend_4B

updated a collection about 2 months ago

PEGRL

View all activity

Organizations

None yet

updated 2 models about 2 months ago

DGME/pegrl_en2fi_ascend_4B

4B • Updated Apr 16 • 3

DGME/pegrl_en2tr_ascend_4B

4B • Updated Apr 16 • 3

updated a collection about 2 months ago

PEGRL

Collection

An official model example from the paper “PEGRL: Improving Machine Translation by Post-Editing Guided Reinforcement Learning”, including weights train • 4 items • Updated Apr 16

published 2 models about 2 months ago

DGME/pegrl_en2tr_ascend_4B

4B • Updated Apr 16 • 3

DGME/pegrl_en2fi_ascend_4B

4B • Updated Apr 16 • 3

updated a collection about 2 months ago

PEGRL

Collection

An official model example from the paper “PEGRL: Improving Machine Translation by Post-Editing Guided Reinforcement Learning”, including weights train • 4 items • Updated Apr 16

updated a model about 2 months ago

DGME/pegrl_en2tr_4B

4B • Updated Apr 16 • 1

published a model about 2 months ago

DGME/pegrl_en2tr_4B

4B • Updated Apr 16 • 1

upvoted a paper 4 months ago

PEGRL: Improving Machine Translation by Post-Editing Guided Reinforcement Learning

Paper • 2602.03352 • Published Feb 3 • 1

updated a dataset 5 months ago

DGME/FLORES-200

Viewer • Updated Jan 13 • 185k • 251

published a dataset 5 months ago

DGME/FLORES-200

Viewer • Updated Jan 13 • 185k • 251

updated a dataset 7 months ago

DGME/wmt25

Viewer • Updated Nov 20, 2025 • 102k • 47

published a dataset 7 months ago

DGME/wmt25

Viewer • Updated Nov 20, 2025 • 102k • 47

updated a model 7 months ago

DGME/Qwen2.5-0.5B-Mix

Text Generation • 0.5B • Updated Nov 3, 2025 • 1

published a model 7 months ago

DGME/Qwen2.5-0.5B-Mix

Text Generation • 0.5B • Updated Nov 3, 2025 • 1

New activity in ByteDance-Seed/Seed-X-PPO-7B 8 months ago

looks it not works well with vllm 0.10.1.1

👍 1

#20 opened 10 months ago by

Yongzheng

upvoted a paper 8 months ago

Making Mathematical Reasoning Adaptive

Paper • 2510.04617 • Published Oct 6, 2025 • 23

upvoted a paper 9 months ago

DuPO: Enabling Reliable LLM Self-Verification via Dual Preference Optimization

Paper • 2508.14460 • Published Aug 20, 2025 • 86

DGME

AI & ML interests

Recent Activity

Organizations

DGME's activity

looks it not works well with vllm 0.10.1.1