OpenCodePapers

referring-expression-segmentation-on-refcocog-1

Referring Expression Segmentation
Dataset Link
Results over time
Click legend items to toggle metrics. Hover points for model names.
Leaderboard
PaperCodeOverall IoUMean IoUmIoUModelNameReleaseDate
Universal Segmentation at Arbitrary Granularity with Language Instruction✓ Link80.54UniLSeg-1002023-12-04
Multi-label Cluster Discrimination for Visual Representation Learning✓ Link80.5MLCD-Seg-7B2024-07-24
Universal Segmentation at Arbitrary Granularity with Language Instruction✓ Link79.47UniLSeg-202023-12-04
HyperSeg: Towards Universal Visual Segmentation with Large Language Model✓ Link78.9HyperSeg2024-11-26
EVF-SAM: Early Vision-Language Fusion for Text-Prompted Segment Anything Model✓ Link78.3EVF-SAM2024-06-28
Multi-task Visual Grounding with Coarse-to-Fine Consistency Constraints✓ Link76.39C3VG2025-01-12
Densely Connected Parameter-Efficient Tuning for Referring Image Segmentation✓ Link75.3DETRIS2025-01-15
GROUNDHOG: Grounding Large Language Models to Holistic Segmentation74.6GROUNDHOG2024-02-26
MaskRIS: Semantic Distortion-aware Data Augmentation for Referring Image Segmentation✓ Link71.09MaskRIS (Swin-B, combined DB)2024-11-28
SafaRi:Adaptive Sequence Transformer for Weakly Supervised Referring Expression Segmentation71.06SafaRi-B2024-07-02
PolyFormer: Referring Image Segmentation as Sequential Polygon Generation✓ Link70.1971.17PolyFormer-L2023-02-14
PolyFormer: Referring Image Segmentation as Sequential Polygon Generation✓ Link69.0569.88PolyFormer-B2023-02-14
MaskRIS: Semantic Distortion-aware Data Augmentation for Referring Image Segmentation✓ Link66.569.42MaskRIS (Swin-B)2024-11-28
Mask Grounding for Referring Image Segmentation✓ Link66.03MagNet2023-12-19
LAVT: Language-Aware Vision Transformer for Referring Image Segmentation✓ Link62.09LAVT (Swin-B)2021-12-04
Vision-Language Transformer and Query Generation for Referring Segmentation✓ Link56.65VLT (Darknet53)2021-08-12
DeRIS: Decoupling Perception and Cognition for Enhanced Referring Image Segmentation through Loopback Synergy✓ Link81.32DeRIS-L2025-07-02
Vision-Aware Text Features in Referring Image Segmentation: From Object Understanding to Context Understanding✓ Link70.58VATEX2024-04-12