Hierarchical Memory Matching Network for Video Object Segmentation | ✓ Link | 80.4 | 77.7 | 83.1 | | | | 89.4 | 88.2 | 90.6 | 10.0 | HMMN | 2021-09-23 |
Tackling Background Distraction in Video Object Segmentation | ✓ Link | 80.0 | 77.6 | 82.3 | 69.4 | 66.6 | 72.2 | 86.8 | 87.5 | 86.2 | 50.1 | TBD | 2022-07-14 |
Associating Objects with Transformers for Video Object Segmentation | ✓ Link | 79.2 | 76.4 | 82.0 | | | | | | | 40.0 | AOT-S | 2021-06-04 |
Joint Inductive and Transductive Learning for Video Object Segmentation | ✓ Link | 78.6 | 76.0 | 81.2 | | | | | | | 4.00 | JOINT | 2021-08-08 |
SSTVOS: Sparse Spatiotemporal Transformers for Video Object Segmentation | ✓ Link | 78.4 | 75.4 | 81.4 | | | | | | | | SSTVOS | 2021-01-21 |
SWEM: Towards Real-Time Video Object Segmentation with Sequential Weighted Expectation-Maximization | ✓ Link | 77.2 | 74.5 | 79.8 | | | | 88.1 | 87.3 | 89.0 | 36.0 | SWEM | 2022-08-22 |
Kernelized Memory Network for Video Object Segmentation | ✓ Link | 76.0 | 74.2 | 77.8 | | | | 87.6 | 87.1 | 88.1 | 8.33 | KMN | 2020-07-16 |
Learning Position and Target Consistency for Memory-based Video Object Segmentation | | 75.2 | 73.1 | 77.2 | | | | | | | 8.47 | LCM | 2021-04-09 |
Efficient Regional Memory Network for Video Object Segmentation | ✓ Link | 75.0 | 72.8 | 77.2 | | | | 81.5 | 80.6 | 82.3 | 11.9 | RMNet | 2021-03-24 |
Collaborative Video Object Segmentation by Foreground-Background Integration | ✓ Link | 74.9 | 72.1 | 77.7 | | | | 86.1 | 85.3 | 86.9 | 5.56 | CFBI | 2020-03-18 |
Spatiotemporal Graph Neural Network based Mask Reconstruction for Video Object Segmentation | | 74.7 | 71.5 | 77.9 | 63.1 | 59.7 | 66.5 | 85.7 | 85.4 | 86.0 | | STG-Net | 2020-12-10 |
Video Object Segmentation with Adaptive Feature Bank and Uncertain-Region Refinement | ✓ Link | 74.6 | 73.0 | 76.1 | | | | | | | 4.00 | AFB-URR | 2020-10-15 |
Learning What to Learn for Video Object Segmentation | ✓ Link | 74.3 | 72.2 | 76.3 | | | | | | | 14.0 | LWL | 2020-03-25 |
Pixel-Level Bijective Matching for Video Object Segmentation | ✓ Link | 72.7 | 70.7 | 74.7 | 62.7 | 60.7 | 64.7 | 82.2 | 82.9 | 81.4 | 45.9 | BMVOS | 2021-10-04 |
A Transductive Approach for Video Object Segmentation | ✓ Link | 72.3 | 69.9 | 74.7 | 63.1 | 58.8 | 67.4 | | | | 37.0 | TVOS | 2020-04-15 |
Video Object Segmentation using Space-Time Memory Networks | ✓ Link | 71.6 | 69.2 | 74.0 | | | | 86.5 | 84.8 | 88.1 | 6.25 | STM | 2019-04-01 |
Fast Video Object Segmentation using the Global Context Module | ✓ Link | 71.4 | 69.3 | 73.5 | | | | 86.6 | 87.6 | 85.7 | 25.0 | GC | 2020-01-30 |
DMM-Net: Differentiable Mask-Matching Network for Video Object Segmentation | ✓ Link | 70.7 | 68.1 | 73.3 | | | | | | | | DMM-Net | 2019-09-27 |
FEELVOS: Fast End-to-End Embedding Learning for Video Object Segmentation | ✓ Link | 69.1 | 65.9 | 72.3 | 54.4 | 51.2 | 57.5 | 81.7 | 80.3 | 83.1 | 2.22 | FEELVOS | 2019-02-25 |
Learning Fast and Robust Target Models for Video Object Segmentation | ✓ Link | 68.8 | 66.4 | 71.2 | | | | 81.7 | | | 21.9 | FRTM | 2020-02-27 |
[]() | | 68.5 | 65.3 | 71.6 | 55.2 | | | 86.1 | 85.8 | 86.4 | 0.92 | DIPNet | |
AGSS-VOS: Attention Guided Single-Shot Video Object Segmentation | ✓ Link | 67.4 | 64.9 | 69.9 | 57.2 | 54.8 | 59.7 | | | | 10.0 | AGSS-VOS | 2019-10-01 |
Fast Video Object Segmentation via Dynamic Targeting Network | | 67.4 | 64.2 | 70.6 | | | | 83.6 | 83.7 | 83.5 | 14.3 | DTN | 2019-10-01 |
RANet: Ranking Attention Network for Fast Video Object Segmentation | ✓ Link | 65.7 | 63.2 | 68.2 | 55.3 | 53.4 | 57.2 | 85.5 | 85.5 | 85.4 | 30.3 | RANet | 2019-08-19 |
Spatiotemporal CNN for Video Object Segmentation | ✓ Link | 61.7 | 58.7 | 64.6 | | | | 83.8 | 83.8 | 83.8 | 0.26 | STCNN | 2019-04-04 |
XMem: Long-Term Video Object Segmentation with an Atkinson-Shiffrin Memory Model | ✓ Link | | | | | | | | | | 29.6 | XMem | 2022-07-14 |