arxiv:2511.20785
Sudong Wang PRO
xiao45791
AI & ML interests
None yet
Recent Activity
updated a model about 9 hours ago
xiao45791/Qwen3-VL-4B-Instruct-SFT-Gemini-Distill-after500steps-step2-dapo-1144steps published a model about 10 hours ago
xiao45791/Qwen3-VL-4B-Instruct-SFT-Gemini-Distill-after500steps-step2-dapo-1144steps updated a model about 12 hours ago
xiao45791/Qwen3-VL-4B-Instruct-SFT-Gemini-Distill-after500steps-step2-grpo-1320steps