ExtPose: Robust and Coherent Pose Estimation by Extending ViTs | | 4.9 | 5.1 | 0.823 | 0.993 | ExtPose | 2025-06-18 |
HandOS: 3D Hand Reconstruction in One Stage | | 5.0 | 5.3 | 0.812 | 0.991 | HandOS | 2024-12-02 |
WiLoR: End-to-end 3D Hand Localization and Reconstruction in-the-wild | ✓ Link | 5.5 | 5.1 | 0.825 | 0.993 | WiLoR | 2024-09-18 |
MMHMR: Generative Masked Modeling for Hand Mesh Recovery | | 5.5 | 5.4 | 0.801 | 0.991 | MaskHand | 2024-12-18 |
Hamba: Single-view 3D Hand Reconstruction with Graph-guided Bi-Scanning Mamba | ✓ Link | 5.7 | 5.3 | 0.806 | 0.992 | Hamba | 2024-07-12 |
MobRecon: Mobile-Friendly Hand Mesh Reconstruction from Monocular Image | ✓ Link | 5.7 | 5.8 | 0.784 | 0.986 | MobRecon | 2021-12-06 |
A Simple Baseline for Efficient Hand Mesh Reconstruction | ✓ Link | 5.7 | 6.0 | 0.772 | 0.986 | Zhou et al. | 2024-03-04 |
HHMR: Holistic Hand Mesh Recovery by Enhancing the Multimodal Controllability of Graph Diffusion Models | | 5.8 | 5.8 | | | HHMR | 2024-06-03 |
Mesh Graphormer | ✓ Link | 5.9 | 6.0 | 0.764 | 0.986 | MeshGraphormer | 2021-04-01 |
Reconstructing Hands in 3D with Transformers | ✓ Link | 6.0 | 5.7 | 0.785 | 0.990 | HaMeR | 2023-12-08 |
Sampling is Matter: Point-guided 3D Human Mesh Reconstruction | ✓ Link | 6.1 | 6.6 | 0.720 | 0.984 | PointHMR | 2023-04-19 |
A Probabilistic Attention Model with Occlusion-aware Texture Regression for 3D Hand Reconstruction from a Single RGB Image | ✓ Link | 6.2 | 6.1 | 0.767 | 0.987 | AMVUR | 2023-04-27 |
Deformable Mesh Transformer for 3D Human Mesh Recovery | ✓ Link | 6.2 | 6.4 | 0.743 | 0.984 | Deformer | 2023-01-01 |
[]() | | 6.3 | 6.5 | 0.724 | 0.981 | SAR | |
End-to-End Human Pose and Mesh Reconstruction with Transformers | ✓ Link | 6.5 | 6.3 | 0.731 | 0.984 | METRO | 2020-12-17 |
Cross-Attention of Disentangled Modalities for 3D Human Mesh Recovery with Transformers | ✓ Link | 6.5 | 7.1 | 0.687 | 0.983 | FastMETRO | 2022-07-27 |
FastViT: A Fast Hybrid Vision Transformer using Structural Reparameterization | ✓ Link | 6.6 | 6.7 | 0.722 | 0.981 | FastViT-MA36 | 2023-03-24 |
PeCLR: Self-Supervised 3D Hand Pose Estimation from monocular RGB via Equivariant Contrastive Learning | ✓ Link | 6.6 | | | | PeCLR | 2021-06-10 |
Towards Accurate Alignment in Real-time 3D Hand-Mesh Reconstruction | | 6.7 | 6.7 | 0.724 | 0.981 | Tang et al. | 2021-09-03 |
I2UV-HandNet: Image-to-UV Prediction Network for Accurate and High-fidelity 3D Hand Mesh Modeling | | 6.7 | 6.9 | 0.707 | 0.977 | I2UV-HandNet | 2021-02-07 |
Camera-Space Hand Mesh Recovery via Semantic Aggregation and Adaptive 2D-1D Registration | ✓ Link | 6.9 | 7.0 | 0.715 | 0.977 | CMR | 2021-03-04 |
Hand Image Understanding via Deep Multi-Task Learning | ✓ Link | 7.1 | 7.3 | 0.699 | 0.974 | HIU-DMTL | 2021-07-24 |
I2L-MeshNet: Image-to-Lixel Prediction Network for Accurate 3D Human Pose and Mesh Estimation from a Single RGB Image | ✓ Link | 7.4 | 7.6 | 0.681 | 0.973 | I2L-MeshNet | 2020-08-09 |
Accurate 3D Hand Pose Estimation for Whole-Body 3D Human Mesh Estimation | ✓ Link | 7.7 | 7.7 | 0.664 | 0.971 | Hand4Whole | 2020-11-23 |
Pose2Mesh: Graph Convolutional Network for 3D Human Pose and Mesh Recovery from a 2D Human Pose | ✓ Link | 7.7 | 7.8 | 0.674 | 0.969 | Pose2Mesh | 2020-08-20 |
Weakly-Supervised Mesh-Convolutional Hand Reconstruction in the Wild | ✓ Link | 8.4 | 8.6 | 0.614 | 0.966 | YoutubeHand | 2020-04-04 |
FreiHAND: A Dataset for Markerless Capture of Hand Pose and Shape from Single RGB Images | | 11.0 | 10.9 | 0.516 | 0.934 | MANO CNN | 2019-09-10 |
Collaborative Regression of Expressive Bodies using Moderation | ✓ Link | 12 | 12.1 | 0.468 | 0.919 | PIXIE hand expert | 2021-05-11 |
Monocular Expressive Body Regression through Body-Driven Attention | ✓ Link | 12.2 | 11.8 | 0.484 | 0.918 | ExPose (hand sub-network h) | 2020-08-20 |
Monocular Real-time Full Body Capture with Inter-part Correlations | | 15.7 | | | | DetNet | 2020-12-11 |
FreiHAND: A Dataset for Markerless Capture of Hand Pose and Shape from Single RGB Images | | | 10.7 | 0.529 | 0.935 | Zimmermann et al. | 2019-09-10 |
3D Hand Shape and Pose from Images in the Wild | ✓ Link | | 13.0 | 0.435 | 0.898 | Boukhayma et al. | 2019-02-09 |
Learning joint reconstruction of hands and manipulated objects | ✓ Link | | 13.2 | 0.436 | 0.908 | Hasson et al. | 2019-04-11 |