OpenCodePapers

referring-expression-segmentation-on-j-hmdb

Referring Expression Segmentation
Dataset Link
Results over time
Click legend items to toggle metrics. Hover points for model names.
Leaderboard
PaperCodeAPIoU overallIoU meanPrecision@0.5Precision@0.6Precision@0.7Precision@0.8Precision@0.9ModelNameReleaseDate
Spectrum-guided Multi-granularity Referring Video Object Segmentation✓ Link0.4500.7370.7250.9720.9170.7140.2250.003SgMg (Video-Swin-B)2023-07-25
SOC: Semantic-Assisted Object Cluster for Referring Video Object Segmentation✓ Link0.4460.7360.7230.9690.9140.7110.2130.001SOC (Video-Swin-B)2023-05-26
Deeply Interleaved Two-Stream Encoder for Referring Video Segmentation0.4410.680.6660.8740.7910.5860.1820.30VLIDE2022-03-30
SOC: Semantic-Assisted Object Cluster for Referring Video Object Segmentation✓ Link0.3970.7070.7010.9470.8640.6270.1790.001SOC (Video-Swin-T)2023-05-26
End-to-End Referring Video Object Segmentation with Multimodal Transformers✓ Link0.3920.7010.6980.9390.8520.6160.1660.001MTTR (w=10)2021-11-29
End-to-End Referring Video Object Segmentation with Multimodal Transformers✓ Link0.3660.6740.6790.910.8150.570.1440.001MTTR (w=8)2021-11-29
Cross-Modal Progressive Comprehension for Referring Segmentation✓ Link0.3420.6160.6170.8130.6570.3710.070.000CMPC-V2021-05-15
Collaborative Spatial-Temporal Modeling for Language-Queried Video Actor Segmentation0.3350.5980.6040.7830.6390.3780.0760.000Hui et al.2021-05-14
Actor and Action Modular Network for Text-based Video Segmentation0.3210.5830.5760.7730.6270.3600.0440.000AAMN2020-11-02
Context Modulated Dynamic Networks for Actor and Action Video Segmentation with Language Queries0.3010.5540.5760.7420.5870.3160.0470.000CMDy2020-04-03
Polar Relative Positional Encoding for Video-Language Segmentation0.2940.5720.6900.3190.060.001PRPE2020-07-20
Asymmetric Cross-Guided Attention Network for Actor and Action Video Segmentation From Natural Language Query✓ Link0.2890.5760.5840.7560.5640.2870.0340.000ACGA2019-10-01
Actor and Action Video Segmentation from a Sentence✓ Link0.2670.5550.5700.7120.5180.2640.0300.000Gavrilyuk et al. (Optical flow)2018-03-20
Visual-Textual Capsule Routing for Text-Based Video Segmentation0.2610.5350.5500.6770.5130.2830.0510.000VT-Capsule2020-06-01
Actor and Action Video Segmentation from a Sentence✓ Link0.2330.5410.5420.6990.4600.1730.0140.000Gavrilyuk et al.2018-03-20
Segmentation from Natural Language Expressions✓ Link0.1780.5460.5280.6330.3500.0850.0020.000Hu et al.2016-03-20
Tracking by Natural Language Specification0.1730.5290.4910.5780.3350.1030.0600.000Li et al.2017-07-01
Hierarchical interaction network for video object segmentation from referring expressions0.6520.6270.8190.7360.5420.1680.4HINet2021-11-22
ClawCraneNet: Leveraging Object-level Relation for Text-based Video Segmentation0.6440.6550.8800.7960.5660.1470.002ClawCraneNet2021-03-19
Referring Segmentation in Images and Videos with Cross-Modal Self-Attention Network0.6280.5810.7640.6250.3890.090.001CMSA+CFSA2021-02-09
Hierarchical interaction network for video object segmentation from referring expressions0.6060.5680.7310.620.3920.0880.0RefVOS2021-11-22