OpenCodePapers

3d-human-pose-estimation-on-human36m

Pose Estimation3D Human Pose Estimation
Dataset Link
Results over time
Click legend items to toggle metrics. Hover points for model names.
Leaderboard
PaperCodeAverage MPJPE (mm)Using 2D ground-truth jointsMulti-View or MonocularPA-MPJPEAcceleration ErrorAngular ErrorMPVE (mm)ModelNameReleaseDate
Learnable human mesh triangulation for 3D human pose and shape estimation17.59NoMulti-View11.3323.7LMT R152 384x3842022-08-24
Geometry-Biased Transformer for Robust Multi-View 3D Human Pose Reconstruction26.0NoMulti-ViewGeometry-Biased Transformer (HRNet)2023-12-28
Epipolar Transformers✓ Link26.9NoMulti-ViewEpipolar Transformer+R50 256×256+RPSM2020-05-10
Adaptive Multi-view and Temporal Fusing Transformer for 3D Human Pose Estimation28.5NoMulti-ViewMTF-Transformer (M=0.4, T=7)2021-10-11
Generalizable Human Pose Triangulation29.1NoMulti-ViewGeneralizable Human Pose Triangulation2021-10-01
Adaptive Multi-view and Temporal Fusing Transformer for 3D Human Pose Estimation29.4NoMulti-ViewMTF-Transformer (M=0.4, T=1)2021-10-11
Real-Time Multi-View 3D Human Pose Estimation using Semantic Feedback to Smart Edge Sensors✓ Link29.8NoMulti-ViewSmartEdgeSensor2021-06-28
Lightweight Multi-View 3D Pose Estimation through Camera-Disentangled Representation30.2NoMulti-ViewLWCDR2020-04-05
Learnable human mesh triangulation for 3D human pose and shape estimation30.56NoMulti-View14.6142.28LMT R50 224x2242022-08-24
FLEX: Extrinsic Parameters-free Multi-view 3D Human Motion Reconstruction✓ Link30.9NoMulti-ViewFLEX2021-05-05
Cross View Fusion for 3D Human Pose Estimation✓ Link31.17NoMulti-ViewFusion-RPSM (t=10)2019-09-03
KTPFormer: Kinematics and Trajectory Prior Knowledge-Enhanced Transformer for 3D Human Pose Estimation✓ Link33.0NoMonocular26.2KTPFormer (T=243)2024-03-31
Differentiable Dynamics for Articulated 3d Human Motion Reconstruction33.4NoMonocular21.9DiffPhy (W=480)2022-05-24
PoseRN: A 2D pose refinement network for bias-free multi-view 3D human pose estimation38.4NoMulti-ViewPoseRN2021-07-07
SoloPose: One-Shot Kinematic 3D Human Pose Estimation with Video Data Augmentation✓ Link 38.9NoMonocular29.9SoloPose2023-12-15
Consensus-based Optimization for 3D Human Pose Estimation in Camera Coordinates✓ Link39NoMulti-ViewPose Consensus (multi-view, GT calib.)2019-11-21
MixSTE: Seq2seq Mixed Spatio-Temporal Encoder for 3D Human Pose Estimation in Video✓ Link39.8NoMonocularMixSTE (HRNet, T=243)2022-03-02
3D Human Pose Estimation using Spatio-Temporal Networks with Explicit Occlusion Training40.1NoMonocular30.7Spatio-Temporal Network (T=128)2020-04-07
IVT: An End-to-End Instance-guided Video Transformer for 3D Pose Estimation40.2NoMonocularIVT (f=5)2022-08-06
Graph and Temporal Convolutional Networks for 3D Multi-person Pose Estimation in Monocular Videos✓ Link40.9NoMonocular30.4GnTCN2020-12-22
MixSTE: Seq2seq Mixed Spatio-Temporal Encoder for 3D Human Pose Estimation in Video✓ Link40.9NoMonocularMixSTE (CPN, T=243)2022-03-02
Conditional Directed Graph Convolution for 3D Human Pose Estimation✓ Link41.1NoMonocularU-CondDGConv2021-07-16
P-STMO: Pre-Trained Spatial Temporal Many-to-One Model for 3D Human Pose Estimation✓ Link42.1NoMonocular34.4P-STMO (N=243)2022-03-15
MixSTE: Seq2seq Mixed Spatio-Temporal Encoder for 3D Human Pose Estimation in Video✓ Link42.4NoMonocularMixSTE (CPN, T=81)2022-03-02
Motion Guided 3D Pose Estimation from Videos✓ Link42.6NoMonocularUGCN (HR-Net)2020-04-29
Occlusion-Aware Networks for 3D Human Pose Estimation in Video42.9NoMonocularOcclusion-Aware Networks2019-10-01
MHFormer: Multi-Hypothesis Transformer for 3D Human Pose Estimation✓ Link43NoMonocularMHFormer2021-11-24
ConvFormer: Parameter Reduction in Transformer Models for 3D Human Pose Estimation by Leveraging Dynamic Multi-Headed Convolutional Attention✓ Link43.2NoMonocularConvFormer (T=243, CPN)2023-04-04
Context Modeling in 3D Human Pose Estimation: A Unified Perspective✓ Link43.4NoMonocularContextPose2021-03-29
CrossFormer: Cross Spatio-Temporal Transformer for 3D Human Pose Estimation✓ Link43.7NoMonocularCrossFormer (T=81)2022-03-24
Exploiting Temporal Contexts with Strided Transformer for 3D Human Pose Estimation✓ Link43.7NoMonocularStridedTransformer (T=351)2021-03-26
Exploiting Temporal Contexts with Strided Transformer for 3D Human Pose Estimation✓ Link44NoMonocularStridedTransformer (T=243)2021-03-26
Anatomy-aware 3D Human Pose Estimation with Bone-based Pose Decomposition✓ Link44.1NoMonocularAnatomy3D2020-02-24
P-STMO: Pre-Trained Spatial Temporal Many-to-One Model for 3D Human Pose Estimation✓ Link44.1NoMonocularP-STMO-S (N=81)2022-03-15
3D Human Pose Estimation with Spatial and Temporal Transformers✓ Link44.3NoMonocularPoseFormer (f=81)2021-03-18
Improving Robustness and Accuracy via Relative Information Encoding in 3D Human Pose Estimation✓ Link44.3NoMonocularRIE (T=243 CPN)2021-07-29
Probabilistic Monocular 3D Human Pose Estimation with Normalizing Flows✓ Link44.3NoMonocularProbabilistic Monocular (T=200)2021-07-29
Shape-Aware Human Pose and Shape Reconstruction Using Multi-View Images44.4NoMulti-ViewShape-aware SMPL2019-08-26
TesseTrack: End-to-End Learnable Multi-Person Articulated 3D Pose Tracking44.6NoMonocularTesseTrack (Monocular)2021-06-16
SRNet: Improving Generalization in 3D Human Pose Estimation with a Split-and-Recombine Approach✓ Link44.8NoMonocularSRNet (T=243)2020-07-18
Enhanced 3D Human Pose Estimation from Videos by using Attention-Based Neural Network with Dilated Convolutions44.8NoMonocularAttention (T=243 CPN)2021-03-04
Motion Projection Consistency Based 3D Human Pose Estimation with Virtual Bones from Monocular Videos44.8NoMonocularVirtual Bones (T=243 CPN)2021-06-28
Consensus-based Optimization for 3D Human Pose Estimation in Camera Coordinates✓ Link45NoMulti-ViewPose Consensus (multi-view, est. calib.)2019-11-21
HEMlets Pose: Learning Part-Centric Heatmap Triplets for Accurate 3D Human Pose Estimation45.1NoMonocularHEMlets Pose2019-10-26
Attention Mechanism Exploits Temporal Contexts: Real-Time 3D Human Pose Reconstruction✓ Link45.1NoMulti-ViewAttention3DHumanPose (T=243 CPN)2020-06-01
Exploiting Temporal Contexts with Strided Transformer for 3D Human Pose Estimation✓ Link45.4NoMonocularStridedTransformer (T=81)2021-03-26
Double-chain Constraints for 3D Human Pose Estimation in Images and Videos✓ Link46.1NoMonocularDC-GCT(T=1)2023-08-10
Trajectory Space Factorization for Deep Video-Based 3D Human Pose Estimation✓ Link46.6NoMonocularTrajectory Space Factorization (50 frames)2019-08-22
3D human pose estimation in video with temporal convolutions and semi-supervised training✓ Link46.8NoMonocular36.5VideoPose3D (T=243)2018-11-28
Exploiting Temporal Contexts with Strided Transformer for 3D Human Pose Estimation✓ Link46.9NoMonocularStridedTransformer (T=27)2021-03-26
Motion Projection Consistency Based 3D Human Pose Estimation with Virtual Bones from Monocular Videos47.4NoMonocularVirtual Bones (T=9 CPN)2021-06-28
Learning Skeletal Graph Neural Networks for Hard 3D Pose Estimation47.9NoMonocularSkeletal GNN2021-08-16
GraphMLP: A Graph MLP-Like Architecture for 3D Human Pose Estimation✓ Link48NoMonocularGraphMLP2022-06-13
Exploiting Spatial-Temporal Relationships for 3D Pose Estimation via Graph Convolutional Networks✓ Link48.8NoMonocularSTRGCN (T=7)2019-10-01
Exploiting Spatial-Temporal Relationships for 3D Pose Estimation via Graph Convolutional Networks✓ Link49.1NoMonocularSTRGCN (T=3)2019-10-01
Learning Pyramid-structured Long-range Dependencies for 3D Human Pose Estimation✓ Link49.2NoMonocularDiffPyramid (CPN)2025-06-03
Dual networks based 3D Multi-Person Pose Estimation from Monocular Video✓ Link49.31NoMonocularDual network2022-05-02
Modulated Graph Convolutional Network for 3D Human Pose Estimation✓ Link49.4NoMonocularModulated-GCN2021-01-01
Adaptive Multi-view and Temporal Fusing Transformer for 3D Human Pose Estimation49.4NoMonocularMTF-Transformer (M=0.4, T=7, N=1)2021-10-11
Learning Pyramid-structured Long-range Dependencies for 3D Human Pose Estimation✓ Link49.5NoMonocularPGFormer (CPN)2025-06-03
Generating Multiple Hypotheses for 3D Human Pose Estimation with Mixture Density Network✓ Link49.6NoMulti-ViewMDN (Multi-View)2019-04-11
Ray3D: ray-based 3D human pose estimation for monocular absolute 3D localization✓ Link49.7NoMonocularRay3D (T=9 CPN)2022-03-22
SRNet: Improving Generalization in 3D Human Pose Estimation with a Split-and-Recombine Approach✓ Link49.9NoMonocularSRNet (T=1)2020-07-18
PoseAug: A Differentiable Pose Augmentation Framework for 3D Human Pose Estimation✓ Link50.2NoMonocularHR-Net+VPose+PoseAug2021-05-06
Jointformer: Single-Frame Lifting Transformer with Error Prediction and Refinement for 3D Human Pose Estimation✓ Link50.5NoMonocularJointformer (CPN)2022-08-07
Learning Temporal 3D Human Pose Estimation with Pseudo-Labels✓ Link50.6NoMulti-ViewMulti-view Temporal self-supervised2021-10-14
Exploiting Spatial-Temporal Relationships for 3D Pose Estimation via Graph Convolutional Networks✓ Link50.6NoMonocularSTRGCN (T=1)2019-10-01
Adaptive Multi-view and Temporal Fusing Transformer for 3D Human Pose Estimation50.7NoMonocularMTF-Transformer (M=0.4, T=1, N=1)2021-10-11
PoseAug: A Differentiable Pose Augmentation Framework for 3D Human Pose Estimation✓ Link50.8NoMonocularHR-Net+ST-GCN+PoseAug2021-05-06
Cascaded deep monocular 3D human pose estimation with evolutionary training data✓ Link50.9NoMonocularTAG-Net2020-06-14
Compressed Volumetric Heatmaps for Multi-Person 3D Pose Estimation✓ Link51.1NoMonocular43.4LoCO2020-04-01
3D human pose estimation in video with temporal convolutions and semi-supervised training✓ Link51.8NoMonocular40VideoPose3D (T=1)2018-11-28
Graph Stacked Hourglass Networks for 3D Human Pose Estimation✓ Link51.9NoMonocularGraph Stacked Hourglass Network (CPN)2021-03-30
Consensus-based Optimization for 3D Human Pose Estimation in Camera Coordinates✓ Link52NoMonocularPose Consensus (monocular)2019-11-21
3D Human Pose Estimation Using Möbius Graph Convolutional Networks52.1NoMonocularMöbiusGCN2022-03-20
PoseLifter: Absolute 3D human pose lifting network from a single noisy 2D human pose✓ Link52.5NoMonocular39.1PoseLifter2019-10-26
Generating Multiple Hypotheses for 3D Human Pose Estimation with Mixture Density Network✓ Link52.7NoMonocular42.6MDN2019-04-11
Optimizing Network Structure for 3D Human Pose Estimation52.7NoMonocularONS LCN2019-10-01
Semantic Graph Convolutional Networks for 3D Human Pose Regression✓ Link57.6NoMonocularSemGCN2019-04-06
Generalizing Monocular 3D Human Pose Estimation in the Wild✓ Link58NoMulti-ViewStereoscopic View Synthesis Subnetwork2019-04-11
Exploiting temporal information for 3D pose estimation✓ Link58.5NoMonocularSequence-to-sequence network2017-11-23
TAPE: Temporal Attention-based Probabilistic human pose and shape Estimation✓ Link60NoMonocular39.56.5TAPE (T=16)2023-04-29
Probabilistic Monocular 3D Human Pose Estimation with Normalizing Flows✓ Link61.8NoMonocularProbabilistic Monocular (T=1)2021-07-29
Self-Supervised 3D Human Pose Estimation with Multiple-View Geometry✓ Link62.0NoMulti-View2D-3D Lifting self-supervised2021-08-17
A simple yet effective baseline for 3d human pose estimation✓ Link62.9NoMonocularSIM (SH detections FT) (MA)2017-05-08
VoxelKeypointFusion: Generalizable Multi-View Multi-Person Pose Estimation✓ Link64.3NoMulti-ViewVoxelKeypointFusion (transfer)2024-10-24
VIBE: Video Inference for Human Body Pose and Shape Estimation✓ Link65.6NoMonocular41.4VIBE2019-12-11
CanonPose: Self-Supervised Monocular 3D Human Pose Estimation in the Wild✓ Link74.3NoMultiViewCanonPose2020-11-30