8 8

Lin Zihan

thomas-hill

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability

upvoted a paper 1 day ago

Synthetic Sandbox for Training Machine Learning Engineering Agents

liked a dataset 3 days ago

Congliu/Chinese-DeepSeek-R1-Distill-data-110k

View all activity

Organizations

None yet

upvoted 2 papers 1 day ago

Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability

Paper • 2604.06628 • Published 4 days ago • 173

Synthetic Sandbox for Training Machine Learning Engineering Agents

Paper • 2604.04872 • Published 6 days ago • 12

liked a dataset 3 days ago

Congliu/Chinese-DeepSeek-R1-Distill-data-110k

Viewer • Updated Feb 21, 2025 • 110k • 1.09k • 738

upvoted a paper 6 days ago

DataFlex: A Unified Framework for Data-Centric Dynamic Training of Large Language Models

Paper • 2603.26164 • Published 15 days ago • 347

liked a model 6 days ago

tencent/HY-OmniWeaving

Updated about 8 hours ago • 245

liked a model 7 days ago

juergengunz/fluxer

Updated 7 minutes ago • 4

liked a dataset 8 days ago

phobia76/pmxt-l2-dump

Viewer • Updated 33 minutes ago • 25.9B • 2.23k • 2

upvoted a paper 10 days ago

ClawKeeper: Comprehensive Safety Protection for OpenClaw Agents Through Skills, Plugins, and Watchers

Paper • 2603.24414 • Published 17 days ago • 182

liked a model 10 days ago

Hawaii0126/latent_adv

Updated 1 day ago • 1

upvoted 2 papers 10 days ago

CARLA-Air: Fly Drones Inside a CARLA World -- A Unified Infrastructure for Air-Ground Embodied Intelligence

Paper • 2603.28032 • Published 12 days ago • 339

FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy Optimization

Paper • 2603.19835 • Published 22 days ago • 330

liked a model 11 days ago

curry2004/q-FrozenLake-v1-4x4-noSlippery

Reinforcement Learning • Updated 11 days ago • 1

upvoted a paper 23 days ago

Demystifing Video Reasoning

Paper • 2603.16870 • Published 25 days ago • 367

liked a model 29 days ago

LocoreMind/LocoTrainer-4B

Text Generation • 4B • Updated 29 days ago • 2.15k • 56

liked a dataset about 1 month ago

LeeXiangNO1/DyNativeGaussian_sequence

Preview • Updated 18 days ago • 6.04k • 53

upvoted a paper about 1 month ago

Heterogeneous Agent Collaborative Reinforcement Learning

Paper • 2603.02604 • Published Mar 3 • 193

Lin Zihan

AI & ML interests

Recent Activity

Organizations

thomas-hill's activity