OpenCodePapers

referring-video-object-segmentation-on-revos

Video Object SegmentationReferring Video Object Segmentation
Dataset Link
Results over time
Click legend items to toggle metrics. Hover points for model names.
Leaderboard
PaperCodeJ&FJFRModelNameReleaseDate
The Devil is in Temporal Token: High Quality Video Reasoning Segmentation✓ Link6057.662.518.9VRS-HQ (Chat-UniVi-13B)2025-01-15
The Devil is in Temporal Token: High Quality Video Reasoning Segmentation✓ Link59.156.661.619.7VRS-HQ (Chat-UniVi-7B)2025-01-15
VISA: Reasoning Video Object Segmentation via Large Language Models✓ Link50.948.852.914.5VISA (Chat-UniVi-13B)2024-07-16
VISA: Reasoning Video Object Segmentation via Large Language Models✓ Link46.944.949.015.5VISA (Chat-UniVi-7B)2024-07-16
Tracking with Human-Intent Reasoning✓ Link45.043.246.812.8TrackGPT (LLaVA-13B)2023-12-29
LISA: Reasoning Segmentation via Large Language Model✓ Link41.639.843.58.6LISA (LLaVA-13B)2023-08-01
Language as Queries for Referring Video Object Segmentation✓ Link28.126.229.98.8ReferFormer (Video-Swin-B)2022-01-03
MeViS: A Large-scale Benchmark for Video Segmentation with Motion Expressions✓ Link26.421.231.73.2LMPM (Swin-T)2023-08-16
End-to-End Referring Video Object Segmentation with Multimodal Transformers✓ Link25.525.125.95.6MTTR (Video-Swin-T)2021-11-29