Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
2
Xintong Li
Kaylee0501
Follow
vintropl's profile picture
1 follower
·
1 following
https://kaylee0501.github.io/
XintongLi0501
Kaylee0501
xintong-li-970ab31b5
AI & ML interests
NLP, Multimodal, LLM Reasoning
Recent Activity
upvoted
a
paper
2 days ago
Generate, Filter, Control, Replay: A Comprehensive Survey of Rollout Strategies for LLM Reinforcement Learning
updated
a model
4 days ago
Kaylee0501/qwen2_7b_grpo_150
published
a model
4 days ago
Kaylee0501/qwen2_7b_grpo_150
View all activity
Organizations
models
25
Sort: Recently updated
Kaylee0501/qwen2_7b_grpo_150
8B
•
Updated
4 days ago
•
42
Kaylee0501/qwen2_vl_7b_COT_grpo_LLM-judge_nat_460
8B
•
Updated
5 days ago
•
18
Kaylee0501/qwen2_vl_7b_COT_grpo_LLM-judge_930
8B
•
Updated
5 days ago
•
17
Kaylee0501/qwen2_vl_7b_COT_grpo_800
8B
•
Updated
5 days ago
•
175
Kaylee0501/qwen2_vl_7b_COT_grpo_LLM-judge_nat_690
8B
•
Updated
5 days ago
•
17
Kaylee0501/qwen3_vl_8b_COT_grpo_LLM-judge_400
9B
•
Updated
5 days ago
•
31
Kaylee0501/qwen3_vl_8b_wo-COT_grpo_800
9B
•
Updated
5 days ago
•
19
Kaylee0501/qwen3_vl_8b_COT_grpo_LLM-judge_nat_680
9B
•
Updated
5 days ago
•
12
Kaylee0501/qwen3_vl_8b_wo-COT_grpo_90
9B
•
Updated
6 days ago
•
18
Kaylee0501/qwen3_vl_8b_COT_grpo_reward0.3_90
9B
•
Updated
6 days ago
•
19
View 25 models
datasets
3
Sort: Recently updated
Kaylee0501/ImplexConv-opposed
Viewer
•
Updated
Apr 29, 2025
•
1.55k
•
115
•
2
Kaylee0501/ImplexConv-supportive
Viewer
•
Updated
Apr 29, 2025
•
814
•
39
Kaylee0501/activeCOT
Viewer
•
Updated
Apr 29, 2025
•
11.2k
•
110