robot
updated
GRUtopia: Dream General Robots in a City at Scale
Paper
• 2407.10943
• Published
• 25
Make-An-Agent: A Generalizable Policy Network Generator with
Behavior-Prompted Diffusion
Paper
• 2407.10973
• Published
• 11
Cross Anything: General Quadruped Robot Navigation through Complex
Terrains
Paper
• 2407.16412
• Published
• 6
RP1M: A Large-Scale Motion Dataset for Piano Playing with Bi-Manual
Dexterous Robot Hands
Paper
• 2408.11048
• Published
• 4
LLM-3D Print: Large Language Models To Monitor and Control 3D Printing
Paper
• 2408.14307
• Published
• 5
In-Context Imitation Learning via Next-Token Prediction
Paper
• 2408.15980
• Published
• 10
Diffusion Policy Policy Optimization
Paper
• 2409.00588
• Published
• 20
Affordance-based Robot Manipulation with Flow Matching
Paper
• 2409.01083
• Published
• 20
DynaMo: In-Domain Dynamics Pretraining for Visuo-Motor Control
Paper
• 2409.12192
• Published
• 4
Robot See Robot Do: Imitating Articulated Object Manipulation with
Monocular 4D Reconstruction
Paper
• 2409.18121
• Published
• 8
xGen-MM-Vid (BLIP-3-Video): You Only Need 32 Tokens to Represent a Video
Even in VLMs
Paper
• 2410.16267
• Published
• 18
Data Scaling Laws in Imitation Learning for Robotic Manipulation
Paper
• 2410.18647
• Published
• 6
Neural Fields in Robotics: A Survey
Paper
• 2410.20220
• Published
• 5
Robots Pre-train Robots: Manipulation-Centric Robotic Representation
from Large-Scale Robot Dataset
Paper
• 2410.22325
• Published
• 10
IGOR: Image-GOal Representations are the Atomic Control Units for
Foundation Models in Embodied AI
Paper
• 2411.00785
• Published
• 8
DeeR-VLA: Dynamic Inference of Multimodal Large Language Models for
Efficient Robot Execution
Paper
• 2411.02359
• Published
• 14
WildLMa: Long Horizon Loco-Manipulation in the Wild
Paper
• 2411.15131
• Published
• 7
GRAPE: Generalizing Robot Policy via Preference Alignment
Paper
• 2411.19309
• Published
• 47
Code-as-Monitor: Constraint-aware Visual Programming for Reactive and
Proactive Robotic Failure Detection
Paper
• 2412.04455
• Published
• 38
Moto: Latent Motion Token as the Bridging Language for Robot
Manipulation
Paper
• 2412.04445
• Published
• 22
Emma-X: An Embodied Multimodal Action Model with Grounded Chain of
Thought and Look-ahead Spatial Reasoning
Paper
• 2412.11974
• Published
• 10
TidyBot++: An Open-Source Holonomic Mobile Manipulator for Robot
Learning
Paper
• 2412.10447
• Published
• 5
RLDG: Robotic Generalist Policy Distillation via Reinforcement Learning
Paper
• 2412.09858
• Published
• 2
Efficient Diffusion Transformer Policies with Mixture of Expert
Denoisers for Multitask Learning
Paper
• 2412.12953
• Published
• 11
Prompting Depth Anything for 4K Resolution Accurate Metric Depth
Estimation
Paper
• 2412.14015
• Published
• 12
Learning from Massive Human Videos for Universal Humanoid Pose Control
Paper
• 2412.14172
• Published
• 10
EnerVerse: Envisioning Embodied Future Space for Robotics Manipulation
Paper
• 2501.01895
• Published
• 55
OmniManip: Towards General Robotic Manipulation via Object-Centric
Interaction Primitives as Spatial Constraints
Paper
• 2501.03841
• Published
• 56
Beyond Sight: Finetuning Generalist Robot Policies with Heterogeneous
Sensors via Language Grounding
Paper
• 2501.04693
• Published
• 3
FAST: Efficient Action Tokenization for Vision-Language-Action Models
Paper
• 2501.09747
• Published
• 28
Embodied Red Teaming for Auditing Robotic Foundation Models
Paper
• 2411.18676
• Published
• 2
Learning Getting-Up Policies for Real-World Humanoid Robots
Paper
• 2502.12152
• Published
• 42
A Data-Centric Revisit of Pre-Trained Vision Models for Robot Learning
Paper
• 2503.06960
• Published
• 3
Being-0: A Humanoid Robotic Agent with Vision-Language Models and
Modular Skills
Paper
• 2503.12533
• Published
• 68
Free-form language-based robotic reasoning and grasping
Paper
• 2503.13082
• Published
• 11
Cosmos-Reason1: From Physical Common Sense To Embodied Reasoning
Paper
• 2503.15558
• Published
• 50
Dita: Scaling Diffusion Transformer for Generalist
Vision-Language-Action Policy
Paper
• 2503.19757
• Published
• 51
Gemini Robotics: Bringing AI into the Physical World
Paper
• 2503.20020
• Published
• 31
Embodied-Reasoner: Synergizing Visual Search, Reasoning, and Action for
Embodied Interactive Tasks
Paper
• 2503.21696
• Published
• 23
NORA: A Small Open-Sourced Generalist Vision Language Action Model for
Embodied Tasks
Paper
• 2504.19854
• Published
• 7
EnerVerse-AC: Envisioning Embodied Environments with Action Condition
Paper
• 2505.09723
• Published
• 23
SmolVLA: A Vision-Language-Action Model for Affordable and Efficient
Robotics
Paper
• 2506.01844
• Published
• 150
Robot-R1: Reinforcement Learning for Enhanced Embodied Reasoning in
Robotics
Paper
• 2506.00070
• Published
• 29
Ark: An Open-source Python-based Framework for Robot Learning
Paper
• 2506.21628
• Published
• 16
RoboScape: Physics-informed Embodied World Model
Paper
• 2506.23135
• Published
• 5
RoboBrain 2.0 Technical Report
Paper
• 2507.02029
• Published
• 35
Paper
• 2507.15493
• Published
• 47
Experience is the Best Teacher: Grounding VLMs for Robotics through
Self-Generated Memory
Paper
• 2507.16713
• Published
• 21
OmniEAR: Benchmarking Agent Reasoning in Embodied Tasks
Paper
• 2508.05614
• Published
• 20
Embodied-R1: Reinforced Embodied Reasoning for General Robotic
Manipulation
Paper
• 2508.13998
• Published
• 18
RynnEC: Bringing MLLMs into Embodied World
Paper
• 2508.14160
• Published
• 20
ODYSSEY: Open-World Quadrupeds Exploration and Manipulation for
Long-Horizon Tasks
Paper
• 2508.08240
• Published
• 45
EmbodiedOneVision: Interleaved Vision-Text-Action Pretraining for
General Robot Control
Paper
• 2508.21112
• Published
• 77
HERMES: Human-to-Robot Embodied Learning from Multi-Source Motion Data
for Mobile Dexterous Manipulation
Paper
• 2508.20085
• Published
• 1
Robix: A Unified Model for Robot Interaction, Reasoning and Planning
Paper
• 2509.01106
• Published
• 52
Manipulation as in Simulation: Enabling Accurate Geometry Perception in
Robots
Paper
• 2509.02530
• Published
• 11
Nav-R1: Reasoning and Navigation in Embodied Scenes
Paper
• 2509.10884
• Published
• 9
OceanGym: A Benchmark Environment for Underwater Embodied Agents
Paper
• 2509.26536
• Published
• 36
Robot Learning: A Tutorial
Paper
• 2510.12403
• Published
• 123
Being-H0.5: Scaling Human-Centric Robot Learning for Cross-Embodiment Generalization
Paper
• 2601.12993
• Published
• 75