OpenCodePapers

video-text-retrieval-on-test-of-time

Video RetrievalVideo-Text Retrieval
Dataset Link
Results over time
Click legend items to toggle metrics. Hover points for model names.
Leaderboard
PaperCode2-Class AccuracyModelNameReleaseDate
Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding✓ Link88.33Video-LLAMA2023-06-05
TimeChat: A Time-sensitive Multimodal Large Language Model for Long Video Understanding✓ Link76.67Time-Chat2023-12-04
Test of Time: Instilling Video-Language Models with a Sense of Time✓ Link64.4TACT2023-01-05
Videoprompter: an ensemble of foundational models for zero-shot video understanding60.0VideoPrompter2023-10-23