Александр Петров's picture

Александр Петров

tmp-123

·

AI & ML interests

None yet

Recent Activity

upvoted a paper about 9 hours ago

Speculative Pipeline Decoding: Higher-Accruacy and Zero-Bubble Speculation via Pipeline Parallelism

liked a dataset 1 day ago

electricsheepasia/asia-owid-anemia-pregnant-women-vs-children

upvoted a paper 2 days ago

REPOT: Recoverable Program-of-Thought via Checkpoint Repair

View all activity

Organizations

None yet

upvoted a paper about 9 hours ago

Speculative Pipeline Decoding: Higher-Accruacy and Zero-Bubble Speculation via Pipeline Parallelism

Paper • 2605.30852 • Published 6 days ago • 9

upvoted a paper 2 days ago

REPOT: Recoverable Program-of-Thought via Checkpoint Repair

Paper • 2605.30052 • Published 7 days ago • 9

upvoted a paper 5 days ago

Gamma-World: Generative Multi-Agent World Modeling Beyond Two Players

Paper • 2605.28816 • Published 8 days ago • 419

upvoted a paper 12 days ago

SaaSBench: Exploring the Boundaries of Coding Agents in Long-Horizon Enterprise SaaS Engineering

Paper • 2605.17526 • Published 18 days ago • 7

upvoted a paper 14 days ago

CopT: Contrastive On-Policy Thinking with Continuous Spaces for General and Agentic Reasoning

Paper • 2605.20075 • Published 16 days ago • 4

upvoted a paper 30 days ago

From Context to Skills: Can Language Models Learn from Context Skillfully?

Paper • 2604.27660 • Published May 3 • 166

upvoted 5 papers about 2 months ago

WildDet3D: Scaling Promptable 3D Detection in the Wild

Paper • 2604.08626 • Published Apr 9 • 247

Memory Intelligence Agent

Paper • 2604.04503 • Published Apr 6 • 58

Adam's Law: Textual Frequency Law on Large Language Models

Paper • 2604.02176 • Published Apr 2 • 504

Learning to Learn-at-Test-Time: Language Agents with Learnable Adaptation Policies

Paper • 2604.00830 • Published Apr 2 • 15

GrandCode: Achieving Grandmaster Level in Competitive Programming via Agentic Reinforcement Learning

Paper • 2604.02721 • Published Apr 3 • 630

upvoted 5 papers 2 months ago

On Token's Dilemma: Dynamic MoE with Drift-Aware Token Assignment for Continual Learning of Large Vision Language Models

Paper • 2603.27481 • Published Mar 29 • 35

CARLA-Air: Fly Drones Inside a CARLA World -- A Unified Infrastructure for Air-Ground Embodied Intelligence

Paper • 2603.28032 • Published Mar 30 • 343

SpecEyes: Accelerating Agentic Multimodal LLMs via Speculative Perception and Planning

Paper • 2603.23483 • Published Mar 24 • 63

FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy Optimization

Paper • 2603.19835 • Published Mar 20 • 352

PixelSmile: Toward Fine-Grained Facial Expression Editing

Paper • 2603.25728 • Published Mar 26 • 117

upvoted 4 papers 3 months ago

HSImul3R: Physics-in-the-Loop Reconstruction of Simulation-Ready Human-Scene Interactions

Paper • 2603.15612 • Published Mar 16 • 153

Bootstrapping Exploration with Group-Level Natural Language Feedback in Reinforcement Learning

Paper • 2603.04597 • Published Mar 4 • 211

A Very Big Video Reasoning Suite

Paper • 2602.20159 • Published Feb 23 • 525

VESPO: Variational Sequence-Level Soft Policy Optimization for Stable Off-Policy LLM Training

Paper • 2602.10693 • Published Feb 11 • 221