7 36 30

Sangwoo Park

Sangsang

AI & ML interests

I do LLM post-training & Distillation research (KAIST AI)

Recent Activity

authored a paper about 21 hours ago

OmniRetrieval: Unified Retrieval across Heterogeneous Knowledge Sources

updated a model about 23 hours ago

Sangsang/rlsd_Qwen3-4B-Instruct-2507_lora32_n2048_seed42_lr1e-06_mcl8192_within_batch

updated a model about 23 hours ago

Sangsang/rlsd_Qwen3-4B-Base_lora32_n2048_seed42_lr1e-06_mcl8192_within_batch

View all activity

Organizations

None yet

upvoted a paper 1 day ago

OmniRetrieval: Unified Retrieval across Heterogeneous Knowledge Sources

Paper • 2605.29250 • Published 2 days ago • 56

upvoted 2 papers 2 days ago

Learn from Weaknesses: Automated Domain Specialization for Small Computer-Use Agents

Paper • 2605.28775 • Published 3 days ago • 34

Agent Explorative Policy Optimization for Multimodal Agentic Reasoning

Paper • 2605.28774 • Published 3 days ago • 77

upvoted a paper 9 days ago

It Takes Two: Complementary Self-Distillation for Contextual Integrity in LLMs

Paper • 2605.20258 • Published 12 days ago • 30

upvoted a paper 12 days ago

Nudging Beyond the Comfort Zone: Efficient Strategy-Guided Exploration for RLVR

Paper • 2605.15726 • Published 15 days ago • 34

upvoted a paper 15 days ago

PREPING: Building Agent Memory without Tasks

Paper • 2605.13880 • Published 19 days ago • 28

upvoted 2 papers about 1 month ago

Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe

Paper • 2604.13016 • Published Apr 14 • 109

Memory Transfer Learning: How Memories are Transferred Across Domains in Coding Agents

Paper • 2604.14004 • Published Apr 15 • 30

upvoted a paper 2 months ago

T-MAP: Red-Teaming LLM Agents with Trajectory-aware Evolutionary Search

Paper • 2603.22341 • Published Mar 21 • 37

upvoted 2 papers 3 months ago

MA-EgoQA: Question Answering over Egocentric Videos from Multiple Embodied Agents

Paper • 2603.09827 • Published Mar 10 • 30

MolHIT: Advancing Molecular-Graph Generation with Hierarchical Discrete Diffusion Models

Paper • 2602.17602 • Published Feb 19 • 56

upvoted a paper 4 months ago

THINKSAFE: Self-Generated Safety Alignment for Reasoning Models

Paper • 2601.23143 • Published Jan 30 • 39

upvoted 5 papers 8 months ago

upvoted an article 8 months ago

Article

Gaia2 and ARE: Empowering the community to study agents

clefourrier, gregmialz, mlcu, mortimerp9, XciD, tfrere, evijit, RomainFroger, dheeraj7596, CarolinePascal, upiter

•

Sep 22, 2025

• 134

upvoted a collection 9 months ago

subliminal-learning

Collection

collection for [subliminal-learning](https://arxiv.org/abs/2507.14805) paper • 3 items • Updated Jul 23, 2025 • 4

upvoted a paper 11 months ago

FLOAT: Generative Motion Latent Flow Matching for Audio-driven Talking Portrait

Paper • 2412.01064 • Published Dec 2, 2024 • 47

Sangwoo Park

AI & ML interests

Recent Activity

Organizations

Sangsang's activity

Gaia2 and ARE: Empowering the community to study agents