OpenCodePapers

natural-language-moment-retrieval-on

VideoNatural Language Moment Retrieval
Dataset Link
Results over time
Click legend items to toggle metrics. Hover points for model names.
Leaderboard
PaperCodeR@1,IoU=0.5R@1,IoU=0.7R@5,IoU=0.5R@5,IoU=0.7ModelNameReleaseDate
Learning Grounded Vision-Language Representation for Versatile Understanding in Untrimmed Videos✓ Link60.6738.55GVL (paragraph-level)2023-03-11
LLaVA-MR: Large Language-and-Vision Assistant for Video Moment Retrieval✓ Link55.1635.68LLaVA-MR2024-11-21
Learning Grounded Vision-Language Representation for Versatile Understanding in Untrimmed Videos✓ Link49.1829.69GVL2023-03-11
UnLoc: A Unified Framework for Video Localization Tasks✓ Link48.330.279.261.3UnLoc-L2023-08-21
UnLoc: A Unified Framework for Video Localization Tasks✓ Link48.029.781.561.4UnLoc-B2023-08-21
VLG-Net: Video-Language Graph Matching Network for Video Grounding✓ Link46.3229.8277.1563.33VLG-Net2020-11-19
Dense Regression Network for Video Grounding✓ Link45.4524.3677.9750.30DRN2020-04-07
UniMD: Towards Unifying Moment Retrieval and Temporal Action Detection✓ Link80.5457.04UniMD+Sync.2024-04-07