Learning Delicate Local Representations for Multi-Person Pose Estimation | ✓ Link | 78.6 | | | 4xRSN-50(384×288) | 2020-03-09 |
Poseur: Direct Human Pose Regression with Transformers | ✓ Link | 78.3 | 79.6 | | Poseur(384x288) | 2022-01-19 |
EvoPose2D: Pushing the Boundaries of 2D Human Pose Estimation using Accelerated Neuroevolution with Weight Transfer | ✓ Link | 76.8 | 77.5 | | EvoPose2D-L(512x384) | 2020-11-17 |
PoseFix: Model-agnostic General Human Pose Refinement Network | ✓ Link | 76.7 | 77.3 | | PoseFix(384x288) | 2018-12-10 |
Distribution-Aware Coordinate Representation for Human Pose Estimation | ✓ Link | 76.2 | | | DarkPose(384x288) | 2019-10-14 |
Rethinking on Multi-Stage Networks for Human Pose Estimation | ✓ Link | 76.1 | | | MSPN(384x288) | 2019-01-01 |
AggPose: Deep Aggregation Vision Transformer for Infant Pose Estimation | ✓ Link | 75.7 | 76.4 | | AggPose(256x192) | 2022-05-11 |
Multi-Instance Pose Networks: Rethinking Top-Down Pose Estimation | ✓ Link | 75.7 | 76.3 | | MIPNet(384x288) | 2021-01-27 |
Deep High-Resolution Representation Learning for Human Pose Estimation | ✓ Link | 75.5 | 76.3 | | HRNet-48(384x288) | 2019-02-25 |
TransPose: Keypoint Localization via Transformer | ✓ Link | 75.0 | 75.8 | | TransPose(256x192) | 2020-12-28 |
RMPE: Regional Multi-person Pose Estimation | ✓ Link | 73.3 | | 23 | AlphaPose | 2016-12-01 |
Cascaded Pyramid Network for Multi-Person Pose Estimation | ✓ Link | 73.0 | | | CPN+ | 2017-11-20 |
Pose Neural Fabrics Search | ✓ Link | 70.9 | | | PNFS | 2019-09-16 |
PersonLab: Person Pose Estimation and Instance Segmentation with a Bottom-Up, Part-Based, Geometric Embedding Model | ✓ Link | 66.5 | | | PersonLab | 2018-03-22 |
Associative Embedding: End-to-End Learning for Joint Detection and Grouping | ✓ Link | 62.8 | | | Pose-AE | 2016-11-16 |
Sapiens: Foundation for Human Vision Models | ✓ Link | | 82.2 | | Sapiens-2B | 2024-08-22 |
Sapiens: Foundation for Human Vision Models | ✓ Link | | 82.1 | | Sapiens-1B | 2024-08-22 |
Sapiens: Foundation for Human Vision Models | ✓ Link | | 81.2 | | Sapiens-0.6B | 2024-08-22 |
Sapiens: Foundation for Human Vision Models | ✓ Link | | 79.6 | | Sapiens-0.3B | 2024-08-22 |
Polarized Self-Attention: Towards High-quality Pixel-wise Regression | ✓ Link | | 79.5 | | UDP-Pose-PSA(384x288) | 2021-07-02 |
Deep High-Resolution Representation Learning for Human Pose Estimation | ✓ Link | | 75.8 | | HRNet-32 | 2019-02-25 |
Simple Baselines for Human Pose Estimation and Tracking | ✓ Link | | 72.2 | | ResNet-50 | 2018-04-17 |
MultiPoseNet: Fast Multi-Person Pose Estimation using Pose Residual Network | ✓ Link | | 69.6 | | Pose Residual Network | 2018-07-11 |
Non-local Neural Networks | ✓ Link | | 66.5 | | Mask R-CNN + NL blocks (4 in head, 1 in backbone) | 2017-11-21 |