Paper | Code | Val m_vIoU | Val vIoU@0.3 | Val vIoU@0.5 | ModelName | ReleaseDate |
---|---|---|---|---|---|---|
Knowing Your Target: Target-Aware Transformer Makes Better Spatio-Temporal Video Grounding | ✓ Link | 40.2 | 65.8 | 36.7 | TA-STVG | 2025-02-16 |
Context-Guided Spatio-Temporal Video Grounding | ✓ Link | 39.5 | 64.5 | 36.3 | CG-STVG | 2024-01-03 |
STVGFormer: Spatio-Temporal Video Grounding with Static-Dynamic Cross-Modal Understanding | 38.7 | 65.5 | 33.8 | STVGFormer | 2022-07-06 | |
TubeDETR: Spatio-Temporal Video Grounding with Transformers | ✓ Link | 36.4 | 58.8 | 30.6 | TubeDETR | 2022-03-30 |