OpenCodePapers

robot-manipulation-on-calvin

Robot Manipulation
Dataset Link
Results over time
Click legend items to toggle metrics. Hover points for model names.
Leaderboard
PaperCodeavg. sequence length (D to D)ModelNameReleaseDate
DreamVLA: A Vision-Language-Action Model Dreamed with Comprehensive World Knowledge✓ Link4.44DreamVLA2025-07-06
Video Prediction Policy: A Generalist Robot Policy with Predictive Visual Representations✓ Link4.29VPP2024-12-19
Towards Generalist Robot Policies: What Matters in Building Vision-Language-Action Models✓ Link4.25RoboVLMs2024-12-18
OpenHelix: A Short Survey, Empirical Analysis, and Open-Source Dual-System VLA Model for Robotic Manipulation✓ Link4.08Openhelix2025-05-06
UP-VLA: A Unified Understanding and Prediction Model for Embodied Agent4.08UP-VLA2025-01-31
GR-MG: Leveraging Partially Annotated Data via Multi-Modal Goal-Conditioned Policy✓ Link4.04GR-MG2024-08-26
Efficient Diffusion Transformer Policies with Mixture of Expert Denoisers for Multitask Learning✓ Link4.01MoDE2024-12-17
RoboUniView: Visual-Language Model with Unified View Representation for Robotic Manipulation✓ Link3.855RoboUniView2024-06-27
UniVLA: Learning to Act Anywhere with Task-centric Latent Actions✓ Link3.80UniVLA2025-05-09
Towards Synergistic, Generalized, and Efficient Dual-System for Robotic Manipulation3.66RoboDual2024-10-10
VidMan: Exploiting Implicit Dynamics from Video Diffusion Model for Effective Robot Manipulation3.42VidMan2024-11-14
3D Diffuser Actor: Policy Diffusion with 3D Scene Representations✓ Link3.353DDA2024-02-18
OpenVLA: An Open-Source Vision-Language-Action Model✓ Link3.27OpenVLA2024-06-13
3D Diffuser Actor: Policy Diffusion with 3D Scene Representations✓ Link3.273D Diffusor Actor2024-02-18
Unleashing Large-Scale Video Generative Pre-training for Visual Robot Manipulation✓ Link3.06GR-12023-12-20
Vision-Language Foundation Models as Effective Robot Imitators2.47Roboflamingo2023-11-02
From LLMs to Actions: Latent Codes as Bridges in Hierarchical Robot Control1.78LCB2024-05-08
Learning Universal Policies via Text-Guided Video Generation0.92Uni-Pi2023-01-31
RT-1: Robotics Transformer for Real-World Control at Scale✓ Link0.90RT-12022-12-13