OpenCodePapers

referring-video-object-segmentation-on-long

Video Object SegmentationReferring Video Object Segmentation
Dataset Link
Results over time
Click legend items to toggle metrics. Hover points for model names.
Leaderboard
PaperCodeJ&FtIoUvIoUModelNameReleaseDate
Long-RVOS: A Comprehensive Benchmark for Long-term Referring Video Object Segmentation51.371.242.6ReferMo2025-05-19
ReferDINO: Referring Video Object Segmentation with Visual Grounding Foundations48.771.741.2ReferDINO2025-01-24
Referred by Multi-Modality: A Unified Temporal Transformer for Video Object Segmentation✓ Link42.270.436.2MUTR2023-05-25
GLUS: Global-Local Reasoning Unified into A Single Large Language Model for Video Segmentation✓ Link36.668.434.6GLUS2025-04-10
SAMWISE: Infusing Wisdom in SAM2 for Text-Driven Video Segmentation✓ Link35.668.428.6SAMWISE2024-11-26
SOC: Semantic-Assisted Object Cluster for Referring Video Object Segmentation✓ Link34.968.128.6SOC2023-05-26
One Token to Seg Them All: Language Instructed Reasoning Segmentation in Videos✓ Link33.169.628.2VideoLISA2024-09-29