Yurun Yuan

RyanYr

2 3

·

yurun-yuan

AI & ML interests

None yet

Recent Activity

updated a dataset about 2 months ago

RyanYr/pg_sais-dapo_shuffled-offline-grpo_qwen2.5-math-1.5B_kl_bl0_matheval

updated a model about 2 months ago

RyanYr/pg_sais-dapo_shuffled-offline-grpo_qwen2.5-math-1.5B_kl_bl0

published a model about 2 months ago

RyanYr/pg_sais-dapo_shuffled-offline-grpo_qwen2.5-math-1.5B_kl_bl0

View all activity

Organizations

None yet

RyanYr 's models 30

RyanYr/pg_sais-dapo_shuffled-offline-grpo_qwen2.5-math-1.5B_kl_bl0

Updated May 6 • 2

RyanYr/pg_sais-dapo_shuffled-offline-grpo_qwen2.5-math-1.5B_kl_bl0_200

RyanYr/pg-dapo_shuffled-10_offline-grpo_qwen2.5-math-1.5B_piref_kl_behavior

Updated May 5 • 1

RyanYr/pg_sais-dapo_shuffled-0_offline-grpo_qwen2.5-math-1.5B_kl

Updated May 5 • 1

RyanYr/pg-dapo_shuffled-01_offline-grpo_qwen2.5-math-1.5B_piref_nokl

Updated May 5 • 2

RyanYr/pg-dapo_shuffled-0_offline-grpo_qwen2.5-math-1.5B_piref_nokl

Updated May 5 • 4

RyanYr/pg-dapo_shuffled-0_offline-grpo_qwen2.5-math-1.5B_piref_kl

Updated May 5 • 2

RyanYr/pg-dapo_shuffled-01_offline-grpo_qwen2.5-math-1.5B_piref_kl_behavior

Updated May 5 • 1

RyanYr/pg_trajis-dapo_shuffled-offline-grpo_qwen2.5-math-1.5B_piref

Updated May 5 • 1

RyanYr/pg-dapo_shuffled-0_offline-grpo_qwen2.5-math-1.5B_piref_kl_behavior

Updated May 5 • 1

RyanYr/pg_sais-dapo_shuffled-offline-grpo_qwen2.5-math-1.5B_piref

Updated May 5 • 2

RyanYr/pg-dapo_shuffled-01_offline-grpo_qwen2.5-math-1.5B_piref_kl

Updated May 5 • 2

RyanYr/pg_sais-dapo_shuffled-01_offline-grpo_qwen2.5-math-1.5B_kl

Updated May 5 • 2

RyanYr/pg-dapo_shuffled-10_offline-grpo_qwen2.5-math-1.5B_piref_nokl

Updated May 5 • 1

RyanYr/pg-dapo_shuffled-10_offline-grpo_qwen2.5-math-1.5B_piref_kl

Updated May 5 • 1

RyanYr/pg-dapo_shuffled-10_offline-grpo_qwen2.5-math-1.5B_nokl

Updated May 4 • 4

RyanYr/pg-dapo_shuffled-10_offline-grpo_qwen2.5-math-1.5B_kl

Updated May 4 • 2

RyanYr/pg-dapo_shuffled-10_offline-grpo_qwen2.5-math-1.5B_kl_behavior

Updated May 4 • 1

RyanYr/pg-dapo_shuffled-01_offline-grpo_qwen2.5-math-1.5B_nokl

Updated May 4 • 3

RyanYr/pg-dapo_shuffled-0_offline-grpo_qwen2.5-math-1.5B_nokl

Updated May 4 • 1

RyanYr/pg-dapo_shuffled-0_offline-grpo_qwen2.5-math-1.5B_kl

Updated May 4 • 1

RyanYr/pg-dapo_shuffled-0_offline-grpo_qwen2.5-math-1.5B_kl_behavior

Updated May 4 • 1

RyanYr/pg_trajis-dapo_shuffled-offline-grpo_qwen2.5-math-1.5B

Updated May 4 • 2

RyanYr/pg_sais-dapo_shuffled-offline-grpo_qwen2.5-math-1.5B

Updated May 4 • 2

RyanYr/pg-dapo_shuffled-01_offline-grpo_qwen2.5-math-1.5B_kl_behavior

Updated May 4 • 1

RyanYr/pg-dapo_shuffled-01_offline-grpo_qwen2.5-math-1.5B_kl

Updated May 3 • 1

RyanYr/grpo-dapo-qwen2.5-math-1.5B-n4

RyanYr/grpo-dapo-qwen3-1.7B-Base-mbs128-n4

Updated Apr 20 • 1

RyanYr/grpo-dapo_offline-qwen2.5math-1.5B-base-mbs256-n8_actor

Updated Feb 25 • 1

RyanYr/grpo-dapo-01_offline-qwen2.5math-1.5B-base-mbs256-n8_actor

Updated Feb 25 • 1