OpenCodePapers

instance-segmentation-on-ade20k-val

Instance Segmentation
Dataset Link
Results over time
Click legend items to toggle metrics. Hover points for model names.
Leaderboard
PaperCodeAPAPLAPMAPSModelNameReleaseDate
OneFormer: One Transformer to Rule Universal Image Segmentation✓ Link44.264.349.923.7OneFormer (InternImage-H, emb_dim=1024, single-scale, 896x896, COCO-Pretrained)2022-11-10
A Simple Framework for Open-Vocabulary Segmentation and Detection✓ Link42.6OpenSeeD2023-03-14
The Missing Point in Vision Transformers for Universal Image Segmentation✓ Link40.7ViT-P (OneFormer, DiNAT-L, single-scale, 1280x1280, COCO_pretrain)2025-05-26
OneFormer: One Transformer to Rule Universal Image Segmentation✓ Link40.259.744.419.2OneFormer (DiNAT-L, single-scale, 1280x1280, COCO-pretrain)2022-11-10
Generalized Decoding for Pixel, Image, and Language✓ Link38.759.643.318.9X-Decoder (Davit-d5, Deform, single-scale, 1280x1280)2022-12-21
The Missing Point in Vision Transformers for Universal Image Segmentation✓ Link37.8ViT-P (OneFormer, DiNAT-L, single-scale, 1280x1280)2025-05-26
OneFormer: One Transformer to Rule Universal Image Segmentation✓ Link36.0OneFormer (DiNAT-L, single-scale)2022-11-10
OneFormer: One Transformer to Rule Universal Image Segmentation✓ Link35.9OneFormer (Swin-L, single-scale)2022-11-10
Generalized Decoding for Pixel, Image, and Language✓ Link35.8X-Decoder (L)2022-12-21
Dilated Neighborhood Attention Transformer✓ Link35.455.539.016.3DiNAT-L (Mask2Former, single-scale)2022-09-29
Masked-attention Mask Transformer for Universal Image Segmentation✓ Link34.954.74016.3Mask2Former (Swin-L, single-scale)2021-12-02
Masked-attention Mask Transformer for Universal Image Segmentation✓ Link33.454.637.614.6Mask2Former (Swin-L + FAPN)2021-12-02
Masked-attention Mask Transformer for Universal Image Segmentation✓ Link26.410.4Mask2Former (ResNet50)2021-12-02
Masked-attention Mask Transformer for Universal Image Segmentation✓ Link43.128.9Mask2Former (ResNet-50)2021-12-02