Paper | Code | J&F | tIoU | vIoU | ModelName | ReleaseDate |
---|---|---|---|---|---|---|
Long-RVOS: A Comprehensive Benchmark for Long-term Referring Video Object Segmentation | 51.3 | 71.2 | 42.6 | ReferMo | 2025-05-19 | |
ReferDINO: Referring Video Object Segmentation with Visual Grounding Foundations | 48.7 | 71.7 | 41.2 | ReferDINO | 2025-01-24 | |
Referred by Multi-Modality: A Unified Temporal Transformer for Video Object Segmentation | ✓ Link | 42.2 | 70.4 | 36.2 | MUTR | 2023-05-25 |
GLUS: Global-Local Reasoning Unified into A Single Large Language Model for Video Segmentation | ✓ Link | 36.6 | 68.4 | 34.6 | GLUS | 2025-04-10 |
SAMWISE: Infusing Wisdom in SAM2 for Text-Driven Video Segmentation | ✓ Link | 35.6 | 68.4 | 28.6 | SAMWISE | 2024-11-26 |
SOC: Semantic-Assisted Object Cluster for Referring Video Object Segmentation | ✓ Link | 34.9 | 68.1 | 28.6 | SOC | 2023-05-26 |
One Token to Seg Them All: Language Instructed Reasoning Segmentation in Videos | ✓ Link | 33.1 | 69.6 | 28.2 | VideoLISA | 2024-09-29 |