OpenCodePapers

video-retrieval-on-condensed-movies

Video Retrieval
Dataset Link
Results over time
Click legend items to toggle metrics. Hover points for model names.
Leaderboard
PaperCodetext-to-video R@1text-to-video R@5text-to-video R@10ModelNameReleaseDate
TESTA: Temporal-Spatial Token Aggregation for Long-form Video-Language Understanding✓ Link24.946.555.1TESTA (ViT-B/16)2023-10-29
VindLU: A Recipe for Effective Video-and-Language Pretraining✓ Link18.436.4 44.3VINDLU2022-12-09
Long-Form Video-Language Pre-Training with Multimodal Temporal Contrastive Learning✓ Link13.632.541.8LF-VILA 2022-10-12