EarlyTom: Early Token Compression Completes Fast Video Understanding Paper • 2605.30010 • Published 3 days ago • 24
LaRA: Layer-wise Representation Analysis for Detecting Data Contamination in RL Post-Training Paper • 2605.29888 • Published 3 days ago • 19
view article Article Profiling in PyTorch (Part 1): A Beginner's Guide to torch.profiler +3 ariG23498, sayakpaul, sergiopaniego, ror, pcuenq • 2 days ago • 40
YoCausal: How Far is Video Generation from World Model? A Causality Perspective Paper • 2605.30346 • Published 3 days ago • 37
Fast-dDrive: Efficient Block-Diffusion VLM for Autonomous Driving Paper • 2605.23163 • Published 6 days ago • 16
How LoRA Remembers? A Parametric Memory Law for LLM Finetuning Paper • 2605.30260 • Published 3 days ago • 22
minWM: A Full-Stack Open-Source Framework for Real-Time Interactive Video World Models Paper • 2605.30263 • Published 3 days ago • 46
CubePart: An Open-Vocabulary Part-Controllable 3D Generator Paper • 2605.28763 • Published 4 days ago • 11
VibeSearchBench: Benchmarking Long-horizon Proactive Search in the Wild Paper • 2605.27882 • Published 4 days ago • 11
DenoiseRL: Bootstrapping Reasoning Models to Recover from Noisy Prefixes Paper • 2605.28421 • Published 4 days ago • 43
Rethinking Memory as Continuously Evolving Connectivity Paper • 2605.28773 • Published 4 days ago • 25
TriSplat: Simulation-Ready Feed-Forward 3D Scene Reconstruction Paper • 2605.26115 • Published 6 days ago • 50
ResearchMath-14K: Scaling Research-Level Mathematics via Agents Paper • 2605.28003 • Published 4 days ago • 43
From Pixels to Words -- Towards Native One-Vision Models at Scale Paper • 2605.28820 • Published 4 days ago • 66
JLT: Clean-Latent Prediction in Latent Diffusion Transformers Paper • 2605.27102 • Published 5 days ago • 28
MRT: Masked Region Transformer for Layered Image Generation and Editing at Scale Paper • 2605.27235 • Published 5 days ago • 5