OpenCodePapers

monocular-depth-estimation-on-nyu-depth-v2

Depth EstimationMonocular Depth Estimation
Dataset Link
Results over time
Click legend items to toggle metrics. Hover points for model names.
Leaderboard
PaperCodeabsolute relative errorRMSElog 10Delta < 1.25Delta < 1.25^2Delta < 1.25^3ModelNameReleaseDate
HybridDepth: Robust Metric Depth Fusion by Leveraging Depth from Focus and Single-Image Priors✓ Link0.0260.1280.9881.0001.000HybridDepth2024-07-26
Distill Any Depth: Distillation Creates a Stronger Monocular Depth Estimator✓ Link0.0430.981Distill Any Depth2025-02-26
UniK3D: Universal Camera Monocular 3D Estimation✓ Link0.0440.1730.0190.9890.9981.000UniK3D (FT, metric)2025-03-20
UniDepthV2: Universal Monocular Metric Depth Estimation Made Simpler✓ Link0.0460.1800.0200.9880.9981.000UniDepthV2 (FT, metric)2025-02-27
PrimeDepth: Efficient Monocular Depth Estimation with a Stable Diffusion Preimage✓ Link0.0460.977PrimeDepth + Depth Anything2024-09-13
Metric3Dv2: A Versatile Monocular Geometric Foundation Model for Zero-shot Metric Depth and Surface Normal Estimation✓ Link0.0470.1830.0200.9890.9981.000Metric3Dv2(L, FT)2024-03-22
DepthMaster: Taming Diffusion Models for Monocular Depth Estimation✓ Link0.0500.972DepthMaster2025-01-05
GRIN: Zero-Shot Metric Depth with Pixel-Level Diffusion0.0510.251GRIN2024-09-15
Fine-Tuning Image-Conditional Diffusion Models is Easier than You Think✓ Link0.0520.966Marigold + E2E FT(zero-shot)2024-09-17
Repurposing Diffusion-Based Image Generators for Monocular Depth Estimation✓ Link0.0550.2240.0240.9640.9910.998Marigold2023-12-04
Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data✓ Link0.0560.2060.0240.9840.9981.000Depth Anything2024-01-19
UniDepth: Universal Monocular Metric Depth Estimation✓ Link0.0580.2010.0240.9840.9970.999UniDepth (Zero-shot)2024-03-27
PrimeDepth: Efficient Monocular Depth Estimation with a Stable Diffusion Preimage✓ Link0.0580.966PrimeDepth2024-09-13
ECoDepth: Effective Conditioning of Diffusion Models for Monocular Depth Estimation✓ Link0.0590.2180.0260.9780.9970.999ECoDepth2024-03-27
Harnessing Diffusion Models for Visual Perception with Meta Prompts✓ Link0.0610.2230.0270.9760.9970.999MetaPrompt-SD2023-12-22
EVP: Enhanced Visual Perception using Inverse Multi-Attentive Feature Refinement and Regularized Image-Text Alignment✓ Link0.0610.2240.0270.9760.9970.999EVP2023-12-13
Text-image Alignment for Diffusion-based Perception✓ Link0.0620.2250.0270.9760.9970.999TADP2023-09-29
FutureDepth: Learning to Predict the Future Improves Video Depth Estimation0.0630.2330.0270.9810.9960.999FutureDepth2024-03-19
MeSa: Masked, Geometric, and Supervised Pre-training for Monocular Depth Estimation0.0660.2380.0290.9640.9950.999MeSa2023-10-06
PolyMaX: General Dense Prediction with Mask Transformer✓ Link0.0670.250.0290.9690.99580.999PolyMaX(ConvNeXt-L)2023-11-09
Unleashing Text-to-Image Diffusion Models for Visual Perception✓ Link0.0690.2540.0300.9640.9950.999VPD2023-03-03
NVDS+: Towards Efficient and Versatile Neural Stabilizer for Video Depth Estimation✓ Link0.0720.2820.0310.94930.9910.997NVDS(DPT-L)2023-07-17
Zero-Shot Metric Depth with a Field-of-View Conditioned Diffusion Model0.0720.2960.0310.9530.9890.996DMD2023-12-20
ScaleDepth: Decomposing Metric Depth Estimation into Scale Prediction and Relative Depth Estimation✓ Link0.0740.2670.0320.9570.9940.999ScaleDepth-N2024-07-11
Monocular Depth Estimation using Diffusion Models0.0740.3140.0320.9460.987 0.996DepthGen2023-02-28
ZoeDepth: Zero-shot Transfer by Combining Relative and Metric Depth✓ Link0.0750.2700.0320.9550.9950.999ZoeD-M12-N2023-02-23
All in Tokens: Unifying Output Space of Visual Tasks via Soft Token✓ Link0.0760.2750.0330.9540.9940.999AiT-P(SwinV2-L)2023-01-05
Large-scale Monocular Depth Estimation in the Wild0.0800.3640.0330.9310.9860.996Gaming for Depth (GfD)2023-09-18
Revealing the Dark Secrets of Masked Image Modeling✓ Link0.0830.2870.0350.9490.9940.999SwinV2-L 1K-MIM2022-05-26
Metric3D: Towards Zero-shot Metric 3D Prediction from A Single Image✓ Link0.0830.3100.0350.9440.9860.995Metric3D (ConvNeXt-Large, Zero-shot testing)2023-07-20
VA-DepthNet: A Variational Approach to Single Image Depth Prediction✓ Link0.0860.3040.0370.9370.9920.999VA-DepthNet(SwinV1-L)2023-02-13
iDisc: Internal Discretization for Monocular Depth Estimation✓ Link0.0860.9930.999iDisc2023-04-13
Analysis of NaN Divergence in Training Monocular Depth Estimation Model0.08640.30460.03650.93610.99160.9981MIM-Swin-V22023-11-07
NDDepth: Normal-Distance Assisted Monocular Depth Estimation✓ Link0.0870.3110.0380.9360.9910.998NDDepth2023-09-19
IEBins: Iterative Elastic Bins for Monocular Depth Estimation✓ Link0.0870.3140.0380.9360.9920.998IEBins2023-09-25
URCDC-Depth: Uncertainty Rectified Cross-Distillation with CutFlip for Monocular Depth Estimation✓ Link0.0880.3160.0380.9330.9920.998URCDC-Depth2023-02-16
Improving Deep Regression with Ordinal Entropy✓ Link0.0890.3210.0390.932OrdinalEntropy2023-01-21
Attention Attention Everywhere: Monocular Depth Prediction with Skip Attention✓ Link0.0900.3220.0390.9290.9910.998PixelFormer2022-10-17
Learning to Recover 3D Scene Shape from a Single Image✓ Link0.090.916LeReS2020-12-17
DINOv2: Learning Robust Visual Features without Supervision✓ Link0.09070.2790.03710.94970.9960.9994DINOv2 (ViT-g/14 frozen, w/ DPT decoder)2023-04-14
DDP: Diffusion Model for Dense Visual Prediction✓ Link0.0940.3290.0400.9210.9900.998DDP (step3)2023-03-30
BinsFormer: Revisiting Adaptive Bins for Monocular Depth Estimation✓ Link0.0940.3300.0400.9250.9890.997BinsFormer2022-04-03
NeW CRFs: Neural Window Fully-connected CRFs for Monocular Depth Estimation✓ Link0.0950.3340.0410.9220.9920.998NeWCRFs2022-03-03
D-Net: A Generalised and Optimised Deep Network for Monocular Depth Estimation✓ Link0.0950.3540.0410.9190.9880.997D-Net2021-09-29
DepthFormer: Exploiting Long-Range Correlation and Local Information for Accurate Monocular Depth Estimation✓ Link0.0960.3390.0410.9210.9890.998DepthFormer2022-03-27
Global-Local Path Networks for Monocular Depth Estimation with Vertical CutDepth✓ Link0.0980.3440.0420.9150.9880.997GLPDepth2022-01-19
LocalBins: Improving Depth Estimation by Learning Local Distributions✓ Link0.0980.3510.0420.910.9860.997LocalBins2022-03-28
Depth Map Decomposition for Monocular Depth Estimation✓ Link0.0980.3550.0420.9130.9870.998Depth-Map-Decomposition-HRWSI2022-08-23
Depthformer : Multiscale Vision Transformer For Monocular Depth Estimation With Local Global Information Fusion✓ Link0.1000.3450.0420.9130.9880.997Depthformer2022-07-10
Depth Map Decomposition for Monocular Depth Estimation✓ Link0.1000.3620.0430.9070.9860.997Depth-Map-Decomposition2022-08-23
IronDepth: Iterative Refinement of Single-View Depth using Surface Normal and its Uncertainty✓ Link0.1010.3520.0430.9100.9850.997IronDepth2022-10-07
AdaBins: Depth Estimation using Adaptive Bins✓ Link0.1030.3640.0440.9030.9840.997AdaBins2020-11-28
P3Depth: Monocular Depth Estimation with a Piecewise Planarity Prior✓ Link0.1040.3560.0430.8980.9810.996P3Depth2022-04-05
CutDepth:Edge-aware Data Augmentation in Depth Estimation✓ Link0.1040.3750.0440.8990.9850.997CutDepth2021-07-16
Monocular Depth Estimation Using Laplacian Pyramid-Based Depth Residuals✓ Link0.1050.3840.0450.8950.9830.996LapDepth2021-01-08
Vision Transformers for Dense Prediction✓ Link0.1100.3570.0450.9040.9880.994DPT-Hybrid2021-03-24
Enforcing geometric constraints of virtual normal for depth prediction✓ Link0.1110.4160.0480.8750.9760.989VNL2019-07-29
Focal-WNet: An Architecture Unifying Convolution and Attention for Depth Estimation✓ Link0.1160.3980.0480.8750.9800.995Focal-WNet2022-07-18
Auto-Rectify Network for Unsupervised Indoor Depth Estimation✓ Link0.1380.5320.0590.8200.956SC-DepthV22020-06-04
NVS-MonoDepth: Improving Monocular Depth Prediction with Novel View Synthesis0.331NVS-MonoDepth2021-12-22
From Big to Small: Multi-Scale Local Planar Guidance for Monocular Depth Estimation✓ Link0.3920.995BTS2019-07-24
On Deep Learning Techniques to Boost Monocular Depth Estimation for Autonomous Navigation0.429DSN2020-10-13
High Quality Monocular Depth Estimation via Transfer Learning✓ Link0.465DenseDepth2018-12-31
Attention-based Context Aggregation Network for Monocular Depth Estimation✓ Link0.496ACAN2019-01-29
SharpNet: Fast and Accurate Recovery of Occluding Contours in Monocular Depth Estimation✓ Link0.496SharpNet2019-05-21
Pattern-Affinitive Propagation across Depth, Surface Normal and Semantic Segmentation0.497PAP-Depth2019-06-08
SDC-Depth: Semantic Divide-and-Conquer Network for Monocular Depth Estimation0.497SDC-Depth2020-06-01
Deep Ordinal Regression Network for Monocular Depth Estimation✓ Link0.509DORN2018-06-06
Structure-Aware Residual Pyramid Network for Monocular Depth Estimation✓ Link0.514SARPN2019-07-13
InvPT: Inverted Pyramid Multi-task Transformer for Dense Scene Understanding✓ Link0.5183InvPT2022-03-15
Fast Neural Architecture Search of Compact Semantic Segmentation Models via Auxiliary Cells✓ Link0.523FastDenseNas-arch02018-10-25
Fast Neural Architecture Search of Compact Semantic Segmentation Models via Auxiliary Cells✓ Link0.525FastDenseNas-arch22018-10-25
Fast Neural Architecture Search of Compact Semantic Segmentation Models via Auxiliary Cells✓ Link0.526FastDenseNas-arch12018-10-25
Revisiting Single Image Depth Estimation: Toward Higher Resolution Maps with Accurate Object Boundaries✓ Link0.530SENet-1542018-03-23
Generating and Exploiting Probabilistic Monocular Depth Estimates✓ Link0.536ProbMonoDepth2019-06-13
Monocular Depth Estimation Using Relative Depth Maps0.538RelativeDepth2019-06-01
Prompt Guided Transformer for Multi-Task Dense Prediction✓ Link0.5468PGT (Swin-S)2023-07-28
Index Network✓ Link0.565Index Network2019-08-11
Real-Time Joint Semantic Segmentation and Depth Estimation Using Asymmetric Annotations✓ Link0.565Multi-Task Light-Weight-RefineNet2018-09-13
Single Image Depth Estimation Trained via Depth from Defocus Cues✓ Link0.575DeepLabV3+ (F10)2020-01-14
Multi-Scale Continuous CRFs as Sequential Deep Networks for Monocular Depth Estimation✓ Link0.586Xu et al.2017-04-07
Prompt Guided Transformer for Multi-Task Dense Prediction✓ Link0.59PGT (Swin-T)2023-07-28
Structure-Attentioned Memory Network for Monocular Depth Estimation0.604SOM2019-09-10
A Two-Streamed Network for Estimating Fine-Scaled Depth Maps from Single RGB Images0.635Li et al.2016-07-04
Predicting Depth, Surface Normals and Semantic Labels with a Common Multi-Scale Convolutional Architecture✓ Link0.641Eigen et al.2014-11-18