OpenCodePapers

referring-expression-segmentation-on-refcoco-3

Referring Expression Segmentation
Dataset Link
Results over time
Click legend items to toggle metrics. Hover points for model names.
Leaderboard
PaperCodeOverall IoUMean IoUModelNameReleaseDate
Multi-label Cluster Discrimination for Visual Representation Learning✓ Link79.4MLCD-Seg-7B2024-07-24
DeRIS: Decoupling Perception and Cognition for Enhanced Referring Image Segmentation through Loopback Synergy✓ Link79.0181.28DeRIS-L2025-07-02
HyperSeg: Towards Universal Visual Segmentation with Large Language Model✓ Link79.0HyperSeg2024-11-26
EVF-SAM: Early Vision-Language Fusion for Text-Prompted Segment Anything Model✓ Link76.5EVF-SAM2024-06-28
Densely Connected Parameter-Efficient Tuning for Referring Image Segmentation✓ Link75.2DETRIS2025-01-15
Multi-task Visual Grounding with Coarse-to-Fine Consistency Constraints✓ Link74.68C3VG2025-01-12
Hierarchical Open-vocabulary Universal Image Segmentation✓ Link73.9HIPIE2023-07-03
Universal Segmentation at Arbitrary Granularity with Language Instruction✓ Link73.18UniLSeg-1002023-12-04
Universal Segmentation at Arbitrary Granularity with Language Instruction✓ Link72.70UniLSeg-202023-12-04
SegAgent: Exploring Pixel Understanding Capabilities in MLLMs by Imitating Human Annotator Trajectories✓ Link72.49SegAgent2025-03-11
Universal Instance Perception as Object Discovery and Retrieval✓ Link72.47UNINEXT-H2023-03-12
SafaRi:Adaptive Sequence Transformer for Weakly Supervised Referring Expression Segmentation70.78SafaRi-B2024-07-02
GROUNDHOG: Grounding Large Language Models to Holistic Segmentation70.5GROUNDHOG2024-02-26
MaskRIS: Semantic Distortion-aware Data Augmentation for Referring Image Segmentation✓ Link70.26MaskRIS (Swin-B, combined DB)2024-11-28
General Object Foundation Model for Images and Videos at Scale✓ Link69.6GLEE-Pro2023-12-14
PolyFormer: Referring Image Segmentation as Sequential Polygon Generation✓ Link69.3372.15PolyFormer-L2023-02-14
PolyFormer: Referring Image Segmentation as Sequential Polygon Generation✓ Link67.6470.65PolyFormer-B2023-02-14
MaskRIS: Semantic Distortion-aware Data Augmentation for Referring Image Segmentation✓ Link67.5471.68MaskRIS (Swin-B)2024-11-28
Mask Grounding for Referring Image Segmentation✓ Link66.16MagNet2023-12-19
GRES: Generalized Referring Expression Segmentation✓ Link66.04ReLA2023-06-01
VLT: Vision-Language Transformer and Query Generation for Referring Segmentation✓ Link63.53VLT2022-10-28
CRIS: CLIP-Driven Referring Image Segmentation✓ Link62.27CRIS2021-11-30
MaIL: A Unified Mask-Image-Language Trimodal Network for Referring Image Segmentation62.23MaIL2021-11-21
LAVT: Language-Aware Vision Transformer for Referring Image Segmentation✓ Link62.14LAVT2021-12-04
Vision-Language Transformer and Query Generation for Referring Segmentation✓ Link55.50VLT2021-08-12
Comprehensive Multi-Modal Interactions for Referring Image Segmentation✓ Link52.75SHNet2021-04-21
Referring Image Segmentation via Cross-Modal Progressive Comprehension✓ Link49.56CPMC2020-10-01
Bi-Directional Relationship Inferring Network for Referring Image Segmentation48.57BRINet2020-06-01
See-Through-Text Grouping for Referring Image Segmentation48.18STEP (5-fold)2019-10-01
MAttNet: Modular Attention Network for Referring Expression Comprehension✓ Link46.67MattNet2018-01-24
RefVOS: A Closer Look at Referring Expressions for Video Object Segmentation✓ Link44.71RefVOS with BERT + MLM loss2020-10-01
Cross-Modal Self-Attention Network for Referring Image Segmentation✓ Link43.76CMSA2019-04-09
Vision-Aware Text Features in Referring Image Segmentation: From Object Understanding to Context Understanding✓ Link70.02VATEX2024-04-12