TianXiaoyu

Emperorizzis

·

Emperorizzis

AI & ML interests

Natural Language Processing, Large Language Model, Reinforcement Learning

Recent Activity

upvoted a paper 3 months ago

Seedance 2.0: Advancing Video Generation for World Complexity

upvoted a paper 4 months ago

HopChain: Multi-Hop Data Synthesis for Generalizable Vision-Language Reasoning

upvoted a paper 5 months ago

Green-VLA: Staged Vision-Language-Action Model for Generalist Robots

View all activity

Organizations

upvoted a paper 3 months ago

Seedance 2.0: Advancing Video Generation for World Complexity

Paper • 2604.14148 • Published Apr 15 • 168

upvoted a paper 4 months ago

HopChain: Multi-Hop Data Synthesis for Generalizable Vision-Language Reasoning

Paper • 2603.17024 • Published Mar 17 • 110

upvoted a paper 5 months ago

Green-VLA: Staged Vision-Language-Action Model for Generalist Robots

Paper • 2602.00919 • Published Jan 31 • 322

authored 2 papers 5 months ago

Exploring the Potential of Offline RL for Reasoning in LLMs: A Preliminary Study

Paper • 2505.02142 • Published May 4, 2025

ASTRA: Automated Synthesis of agentic Trajectories and Reinforcement Arenas

Paper • 2601.21558 • Published Jan 29 • 61

New activity in Emperorizzis/ASTRA-32B-Thinking-v1 5 months ago

Add library_name and pipeline_tag metadata

#1 opened 5 months ago by

New activity in Emperorizzis/ASTRA-14B-Thinking-v1 5 months ago

Add library_name, pipeline_tag and arxiv metadata

#1 opened 5 months ago by

New activity in Emperorizzis/ASTRA-SFT-1k 5 months ago

Add task category and improve metadata

#1 opened 5 months ago by

New activity in Emperorizzis/ASTRA-RL-1k 5 months ago

Improve dataset card: Add task category, tags, and update paper link

#2 opened 5 months ago by

commented a paper 5 months ago

ASTRA: Automated Synthesis of agentic Trajectories and Reinforcement Arenas

Paper • 2601.21558 • Published Jan 29 • 61 •

upvoted a paper 5 months ago

ASTRA: Automated Synthesis of agentic Trajectories and Reinforcement Arenas

Paper • 2601.21558 • Published Jan 29 • 61

submitted a paper to Daily Papers 5 months ago

ASTRA: Automated Synthesis of agentic Trajectories and Reinforcement Arenas

Paper • 2601.21558 • Published Jan 29 • 61

updated 2 models 5 months ago

Emperorizzis/ASTRA-32B-Thinking-v1

Text Generation • 33B • Updated Feb 2 • 5 • 7

Emperorizzis/ASTRA-14B-Thinking-v1

Text Generation • 15B • Updated Feb 2 • 3 • 9

updated 2 datasets 5 months ago

Emperorizzis/ASTRA-SFT-1k

Viewer • Updated Feb 2 • 1k • 110 • 15

Emperorizzis/ASTRA-RL-1k

Viewer • Updated Feb 2 • 1k • 117 • 9

published 2 models 6 months ago

Emperorizzis/ASTRA-32B-Thinking-v1

Text Generation • 33B • Updated Feb 2 • 5 • 7

Emperorizzis/ASTRA-14B-Thinking-v1

Text Generation • 15B • Updated Feb 2 • 3 • 9

published 2 datasets 6 months ago

Emperorizzis/ASTRA-SFT-1k

Viewer • Updated Feb 2 • 1k • 110 • 15

Emperorizzis/ASTRA-RL-1k

Viewer • Updated Feb 2 • 1k • 117 • 9