OpenCodePapers

video-retrieval-on-queryd

Video Retrieval
Dataset Link
Results over time
Click legend items to toggle metrics. Hover points for model names.
Leaderboard
PaperCodetext-to-video R@1text-to-video R@5text-to-video R@10ModelNameReleaseDate
TESTA: Temporal-Spatial Token Aggregation for Long-form Video-Language Understanding✓ Link83.493.895.3TESTA (ViT-B/16)2023-10-29
Long-Form Video-Language Pre-Training with Multimodal Temporal Contrastive Learning✓ Link69.785.790.3LF-VILA 2022-10-12
VindLU: A Recipe for Effective Video-and-Language Pretraining✓ Link67.886.381.8VINDLU2022-12-09
Frozen in Time: A Joint Video and Image Encoder for End-to-End Retrieval✓ Link53.875.782.7Frozen 2021-04-01
Cross Modal Retrieval with Querybank Normalisation✓ Link15.1QB-Norm+TT-CE+2021-12-23