OpenCodePapers

zero-shot-video-retrieval-on-vatex

Zero-Shot Video Retrieval
Dataset Link
Results over time
Click legend items to toggle metrics. Hover points for model names.
Leaderboard
PaperCodetext-to-video R@1text-to-video R@5text-to-video R@10video-to-text R@1video-to-text R@5video-to-text R@10ModelNameReleaseDate
Gramian Multimodal Representation Learning and Alignment✓ Link83.999.582.799GRAM2024-12-16
InternVideo2: Scaling Foundation Models for Multimodal Video Understanding✓ Link71.594.097.185.397.999.3InternVideo2-6B2024-03-22
InternVideo2: Scaling Foundation Models for Multimodal Video Understanding✓ Link70.493.496.985.497.699.1InternVideo2-1B2024-03-22
VideoCoCa: Video-Text Modeling with Zero-Shot Transfer from Contrastive Captioners53.283.390.173.693.297.2VideoCoCa2022-12-09
InternVideo: General Video Foundation Models via Generative and Discriminative Learning✓ Link49.569.5InternVideo2022-12-06