OpenCodePapers

semantic-segmentation-on-cityscapes

Semantic Segmentation
Dataset Link
Results over time
Click legend items to toggle metrics. Hover points for model names.
Leaderboard
PaperCodeMean IoU (class)Category mIoUModelNameReleaseDate
Strong but simple: A Baseline for Domain Generalized Dense Perception by CLIP-based Transfer Learning✓ Link86.4VLTSeg2023-12-04
Harnessing Diffusion Models for Visual Perception with Meta Prompts✓ Link86.2MetaPrompt-SD2023-12-22
InternImage: Exploring Large-Scale Vision Foundation Models with Deformable Convolutions✓ Link86.1%InternImage-H2022-11-10
HS3: Learning with Proper Task Complexity in Hierarchically Supervised Semantic Segmentation85.8%HS3-Fuse2021-11-03
InverseForm: A Loss Function for Structured Boundary-Aware Segmentation✓ Link85.6%InverseForm2021-04-06
Vision Transformer Adapter for Dense Predictions✓ Link85.2%ViT-Adapter-L (Mask2Former, BEiT pretrain)2022-05-17
SERNet-Former: Semantic Segmentation by Efficient Residual Network with Attention-Boosting Gates and Attention-Fusion Networks✓ Link84.83SERNet-Former2024-01-28
Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data✓ Link84.8%Depth Anything2024-01-19
Segmentation Transformer: Object-Contextual Representations for Semantic Segmentation✓ Link84.5%HRNetV2 + OCR +2019-09-24
EfficientPS: Efficient Panoptic Segmentation✓ Link84.21%EfficientPS2020-04-05
Panoptic-DeepLab: A Simple, Strong, and Fast Baseline for Bottom-Up Panoptic Segmentation✓ Link84.2%Panoptic-DeepLab2019-11-22
Segmentation Transformer: Object-Contextual Representations for Semantic Segmentation✓ Link83.7%HRNetV2 + OCR (w/ ASP)2019-09-24
DCNAS: Densely Connected Neural Architecture Search for Semantic Image Segmentation83.6%DCNAS(coarse + Mapillary)2020-03-26
Regularized Frank-Wolfe for Dense CRFs: Generalizing Mean Field and Beyond✓ Link83.6%Euclidean Frank-Wolfe CRFs (backbone: DeepLabv3+)(coarse)2021-10-27
Global Aggregation then Local Distribution in Fully Convolutional Networks✓ Link83.3%GALDNet(+Mapillary)++2019-09-16
ResNeSt: Split-Attention Networks✓ Link83.3%ResNeSt200 (Mapillary)2020-04-19
Cars Can't Fly up in the Sky: Improving Urban-Scene Segmentation via Height-driven Attention Networks✓ Link83.2%HANet (Height-driven Attention Networks by LGE A&B)(coarse)2020-03-11
kMaX-DeepLab: k-means Mask Transformer✓ Link83.2%kMaX-DeepLab (ConvNeXt-L, fine only)2022-07-08
SegFormer: Simple and Efficient Design for Semantic Segmentation with Transformers✓ Link83.1%SegFormer (MiT-B5, Mapillary)2021-05-31
Segmentation Transformer: Object-Contextual Representations for Semantic Segmentation✓ Link83.0%OCR (HRNetV2-W48, coarse)2019-09-24
Multi Receptive Field Network for Semantic Segmentation83.0%MRFM(coarse)2020-11-17
Disentangled Non-Local Neural Networks✓ Link83%DNL (coarse)2020-06-11
Scene Segmentation with Dual Relation-aware Attention Network✓ Link82.9%DRAN(ResNet-101) WITH ONLY FINE ANNOTATED DATA2020-08-05
Gated-SCNN: Gated Shape CNNs for Semantic Segmentation✓ Link82.8%Gated-SCNN2019-07-12
Searching for Efficient Multi-Scale Architectures for Dense Image Prediction✓ Link82.7%Dense Prediction Cell2018-09-11
Channelized Axial Attention for Semantic Segmentation -- Considering Channel Relation within Spatial Attention for Semantic Segmentation✓ Link82.6%CAA (ResNet-101)2021-01-19
Segmentation Transformer: Object-Contextual Representations for Semantic Segmentation✓ Link82.4%OCR (ResNet-101, coarse)2019-09-24
Deep Dual-resolution Networks for Real-time and Accurate Semantic Segmentation of Road Scenes✓ Link82.4%DDRNet-39 1.5x2021-01-15
Self-Supervised Model Adaptation for Multimodal Semantic Segmentation✓ Link82.3%SSMA2018-08-11
GFF: Gated Fully Fusion for Semantic Segmentation✓ Link82.3%Gated Fully Fusion2019-04-03
Auto-DeepLab: Hierarchical Neural Architecture Search for Semantic Image Segmentation✓ Link82.1%Auto-DeepLab-L2019-01-10
Dual Graph Convolutional Network for Semantic Segmentation✓ Link82%DGCNet (ResNet-101)2019-09-13
Strip Pooling: Rethinking Spatial Pooling for Scene Parsing✓ Link82.0%SPNet (ResNet-101)2020-03-30
Segmentation Transformer: Object-Contextual Representations for Semantic Segmentation✓ Link81.8%OCR (ResNet-101)2019-09-24
Joint Semantic Segmentation and Boundary Detection using Iterative Pyramid Contexts81.8RPCNet2020-04-16
OCNet: Object Context Network for Scene Parsing✓ Link81.7%OCNet2018-09-04
Rethinking Semantic Segmentation from a Sequence-to-Sequence Perspective with Transformers✓ Link81.64%SETR-PUP++2020-12-31
High-Resolution Representations for Labeling Pixels and Regions✓ Link81.6%HRNet (HRNetV2-W48)2019-04-09
Deep High-Resolution Representation Learning for Visual Recognition✓ Link81.6%HRNetV2 (train+val)2019-08-20
Dual Attention Network for Scene Segmentation✓ Link81.5%DANet (ResNet-101)2018-09-09
CCNet: Criss-Cross Attention for Semantic Segmentation✓ Link81.4%CCNet2018-11-28
Boundary-Aware Feature Propagation for Scene Segmentation✓ Link81.4%BFP2019-08-31
Rethinking Atrous Convolution for Semantic Image Segmentation✓ Link81.3%DeepLabv3 (ResNet-101, coarse)2017-06-17
Context Prior for Scene Segmentation✓ Link81.3%CPN(ResNet-101)2020-04-03
Asymmetric Non-local Neural Networks for Semantic Segmentation✓ Link81.3%Asymmetric ALNN2019-08-21
Self-Supervised Model Adaptation for Multimodal Semantic Segmentation✓ Link81.24%AdapNet++2018-08-11
Semantic Correlation Promoted Shape-Variant Context for Segmentation✓ Link81.0%SVCNet (ResNet-101)2019-09-05
Densely connected multidilated convolutional networks for dense prediction tasks✓ Link80.8%D3Net-L2020-11-21
DenseASPP for Semantic Segmentation in Street Scenes✓ Link80.6%DenseASPP (DenseNet-161)2018-06-01
Learning a Discriminative Feature Network for Semantic Segmentation✓ Link80.3%Smooth Network with Channel Attention Block2018-04-25
Pyramid Scene Parsing Network✓ Link80.2%PSPNet++2016-12-04
PSANet: Point-wise Spatial Attention Network for Scene Parsing✓ Link80.1%PSANet (ResNet-101)2018-09-01
Efficient RGB-D Semantic Segmentation for Indoor Scene Analysis✓ Link80.09%ESANet-R34-NBt1D2020-11-13
Resolution-Aware Design of Atrous Rates for Semantic Segmentation Networks79.9%DeepLabV3 with R-1012023-07-26
Learning a Discriminative Feature Network for Semantic Segmentation✓ Link79.3%DFN (ResNet-101)2018-04-25
Adaptive Affinity Fields for Semantic Segmentation✓ Link79.1%AAF (ResNet-101)2018-03-27
ShelfNet for Fast Semantic Segmentation✓ Link79.0%ShelfNet-342018-11-27
BiSeNet: Bilateral Segmentation Network for Real-time Semantic Segmentation✓ Link78.9%BiSeNet (ResNet-101)2018-08-02
Wider or Deeper: Revisiting the ResNet Model for Visual Recognition✓ Link78.4%ResNet-382016-11-30
Pyramid Scene Parsing Network✓ Link78.4%PSPNet2016-12-04
Recurrent Scene Parsing with Perspective Understanding in the Loop✓ Link78.2%DepthSeg (ResNet-101)2017-05-20
Dynamic-structured Semantic Propagation Network77.8%DSSPN (ResNet-101)2018-03-16
Understanding Convolution for Semantic Segmentation✓ Link77.6%DUC-HDC (ResNet-101)2017-02-27
Semantic-Aware Generation for Self-Supervised Visual Representation Learning✓ Link76.9SaGe2021-11-25
SwinMTL: A Shared Architecture for Simultaneous Depth Estimation and Semantic Segmentation from Monocular Camera Images✓ Link76.41%SwinMTL2024-03-15
In Defense of Pre-trained ImageNet Architectures for Real-time Semantic Segmentation of Road-driving Images✓ Link75.5%SwiftNetRN-182019-03-20
RefineNet: Multi-Path Refinement Networks for High-Resolution Semantic Segmentation✓ Link73.6%RefineNet (ResNet-101)2016-11-20
Searching for MobileNetV3✓ Link72.6%MobileNet V3-Large 1.02019-05-06
SqueezeNAS: Fast neural architecture search for faster semantic segmentation✓ Link72.5%SqueezeNAS (LAT Large)2019-08-05
Semantic Segmentation With Multi Scale Spatial Attention For Self Driving Cars72.4%Multi Scale Spatial Attention2020-06-30
Full-Resolution Residual Networks for Semantic Segmentation in Street Scenes✓ Link71.8%FRRN2016-11-24
Laplacian Pyramid Reconstruction and Refinement for Semantic Segmentation✓ Link71.8%LRR-4x2016-05-08
Efficient piecewise training of deep structured models for semantic segmentation71.6%Context2015-04-04
FasterSeg: Searching for Faster Real-time Semantic Segmentation✓ Link71.5%FasterSeg2019-12-23
DFANet: Deep Feature Aggregation for Real-Time Semantic Segmentation✓ Link71.3%DFANet A2019-04-03
Incorporating Luminance, Depth and Color Information by a Fusion-based Network for Semantic Segmentation✓ Link71.3LDFNet2018-09-24
LiteSeg: A Novel Lightweight ConvNet for Semantic Segmentation✓ Link70.75%88.29LightSeg-DarkNet192019-12-13
ESNet: An Efficient Symmetric Network for Real-time Semantic Segmentation✓ Link70.7%ESNet2019-06-24
ICNet for Real-Time Semantic Segmentation on High-Resolution Images✓ Link70.6%ICNet2017-04-27
LEDNet: A Lightweight Encoder-Decoder Network for Real-Time Semantic Segmentation✓ Link70.6%LEDNet2019-05-07
Waterfall Atrous Spatial Pooling Architecture for Efficient Semantic Segmentation✓ Link70.5%WASPnet (ours)2019-12-06
DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs✓ Link70.4%DeepLab-CRF (ResNet-101)2016-06-02
ERFNet: Efficient Residual Factorized ConvNet for Real-time Semantic Segmentation✓ Link69.8%ERFNet (PyTorch)2017-10-09
Fast-SCNN: Fast Semantic Segmentation Network✓ Link68%Fast-SCNN2019-02-12
LiteSeg: A Novel Lightweight ConvNet for Semantic Segmentation✓ Link67.81%86.79LightSeg-MobileNet2019-12-13
LiteSeg: A Novel Lightweight ConvNet for Semantic Segmentation✓ Link67.81%LiteSeg-MobileNet2019-12-13
Template-Based Automatic Search of Compact Semantic Segmentation Architectures✓ Link67.8%Template-Based NAS-arch12019-04-04
Template-Based Automatic Search of Compact Semantic Segmentation Architectures✓ Link67.7%Template-Based NAS-arch02019-04-04
Efficient Dense Modules of Asymmetric Convolution for Real-Time Semantic Segmentation✓ Link67.3EDANet2018-09-17
Multi-Scale Context Aggregation by Dilated Convolutions✓ Link67.1%Dilation102015-11-23
Semantic Image Segmentation via Deep Parsing Network✓ Link66.8%DPN2015-09-09
SqueezeNAS: Fast neural architecture search for faster semantic segmentation✓ Link66.8%SqueezeNAS (LAT Small)2019-08-05
SINet: Extreme Lightweight Portrait Segmentation Networks with Spatial Squeeze Modules and Information Blocking Decoder✓ Link66.5%SINet2019-11-20
ESPNetv2: A Light-weight, Power Efficient, and General Purpose Convolutional Neural Network✓ Link66.2%ESPNetv22018-11-28
Fully Convolutional Networks for Semantic Segmentation✓ Link65.3%FCN2016-05-20
LiteSeg: A Novel Lightweight ConvNet for Semantic Segmentation✓ Link65.17%85.39LightSeg-ShuffleNet2019-12-13
LiteSeg: A Novel Lightweight ConvNet for Semantic Segmentation✓ Link65.17%LiteSeg-ShuffleNet2019-12-13
Semantic Image Segmentation with Deep Convolutional Nets and Fully Connected CRFs✓ Link63.1%DeepLab2014-12-22
The Lovász-Softmax loss: A tractable surrogate for the optimization of the intersection-over-union measure in neural networks✓ Link63.06%ENet + Lovász-Softmax2017-05-24
ESPNet: Efficient Spatial Pyramid of Dilated Convolutions for Semantic Segmentation✓ Link60.3%ESPNet2018-03-19
ENet: A Deep Neural Network Architecture for Real-Time Semantic Segmentation✓ Link58.3%ENet2016-06-07
SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation✓ Link57.0%SegNet2015-11-02
The Ikshana Hypothesis of Human Scene Understanding✓ Link54.82%82.22%IkshanaNet-12021-01-21
The Ikshana Hypothesis of Human Scene Understanding✓ Link45.02%76.73%IkshanaNet-22021-01-21
The Ikshana Hypothesis of Human Scene Understanding✓ Link42.07%75.61%IkshanaNet-32021-01-21