OpenCodePapers
referring-video-object-segmentation-on-revos
Video Object Segmentation
Referring Video Object Segmentation
Dataset Link
Results over time
Click legend items to toggle metrics. Hover points for model names.
Leaderboard
Show papers without code
Paper
Code
J&F
↕
J
↕
F
↕
R
↕
ModelName
ReleaseDate
↕
The Devil is in Temporal Token: High Quality Video Reasoning Segmentation
✓ Link
60
57.6
62.5
18.9
VRS-HQ (Chat-UniVi-13B)
2025-01-15
The Devil is in Temporal Token: High Quality Video Reasoning Segmentation
✓ Link
59.1
56.6
61.6
19.7
VRS-HQ (Chat-UniVi-7B)
2025-01-15
VISA: Reasoning Video Object Segmentation via Large Language Models
✓ Link
50.9
48.8
52.9
14.5
VISA (Chat-UniVi-13B)
2024-07-16
VISA: Reasoning Video Object Segmentation via Large Language Models
✓ Link
46.9
44.9
49.0
15.5
VISA (Chat-UniVi-7B)
2024-07-16
Tracking with Human-Intent Reasoning
✓ Link
45.0
43.2
46.8
12.8
TrackGPT (LLaVA-13B)
2023-12-29
LISA: Reasoning Segmentation via Large Language Model
✓ Link
41.6
39.8
43.5
8.6
LISA (LLaVA-13B)
2023-08-01
Language as Queries for Referring Video Object Segmentation
✓ Link
28.1
26.2
29.9
8.8
ReferFormer (Video-Swin-B)
2022-01-03
MeViS: A Large-scale Benchmark for Video Segmentation with Motion Expressions
✓ Link
26.4
21.2
31.7
3.2
LMPM (Swin-T)
2023-08-16
End-to-End Referring Video Object Segmentation with Multimodal Transformers
✓ Link
25.5
25.1
25.9
5.6
MTTR (Video-Swin-T)
2021-11-29