Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
2
3
Yurun Yuan
RyanYr
Follow
xuanfeiren's profile picture
John6666's profile picture
21world's profile picture
6 followers
·
2 following
yurun-yuan
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
7 days ago
Breaking the Capability Ceiling of LLM Post-Training by Reintroducing Markov States
upvoted
a
paper
13 days ago
POLCA: Stochastic Generative Optimization with LLM
updated
a model
13 days ago
RyanYr/slf-dstl_regular_Q2.5-1.5B-It_tooluse_OPD
View all activity
Organizations
None yet
RyanYr
's models
12
Sort: Recently updated
RyanYr/slf-dstl_regular_Q2.5-1.5B-It_tooluse_OPD
Updated
13 days ago
RyanYr/slf-dstl_regular_Q2.5-1.5B-It_science_OPD
Updated
13 days ago
RyanYr/slf-dstl_Q2.5-1.5B-It_science_OPD
Updated
13 days ago
RyanYr/slf-dstl_Q2.5-1.5B-It_tooluse_SFT
2B
•
Updated
13 days ago
•
63
RyanYr/slf-dstl_Q2.5-1.5B-It_science_SFT
2B
•
Updated
13 days ago
•
80
RyanYr/pg-dapo-qwen2.5math-1.5B-base-mbs256-n8_actor
Updated
29 days ago
•
16
RyanYr/pg-dapo-qwen2.5math-1.5B-base-n8_actor
Updated
about 1 month ago
RyanYr/grpo-dapo_offline-qwen2.5math-1.5B-base-mbs256-n8_actor
Updated
Feb 25
RyanYr/grpo-dapo-01_offline-qwen2.5math-1.5B-base-mbs256-n8_actor
Updated
Feb 25
RyanYr/pg-dapo-01_offline-qwen2.5math-1.5B-base-mbs256-n8_actor
Updated
Feb 24
RyanYr/pg-dapo_offline-qwen2.5math-1.5B-base-mbs256-n8_actor
Updated
Feb 23
RyanYr/grpo-dapo-qwen2.5math-1.5B-base-mbs256-n8_actor
Updated
Feb 21