OpenCodePapers

visual-entailment-on-snli-ve-test

Natural Language InferenceVisual Entailment
Dataset Link
Results over time
Click legend items to toggle metrics. Hover points for model names.
Leaderboard
PaperCodeAccuracyModelNameReleaseDate
OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework✓ Link91.2OFA2022-02-07
Prompt Tuning for Generative Multimodal Pretrained Models✓ Link90.12Prompt Tuning2022-08-04
CoCa: Contrastive Captioners are Image-Text Foundation Models✓ Link87.1CoCa2022-05-04
SimVLM: Simple Visual Language Model Pretraining with Weak Supervision✓ Link86.32SimVLM2021-08-24
Seeing Out of tHe bOx: End-to-End Pre-training for Vision-Language Representation Learning✓ Link84.95SOHO2021-04-07
Multimodal Adaptive Distillation for Leveraging Unimodal Encoders for Vision-Language Tasks80.32MAD (Single Model, Formerly CLIP-TD)2022-04-22
UNITER: UNiversal Image-TExt Representation Learning✓ Link78.98UNITER (Large)2019-09-25
Visual Entailment: A Novel Task for Fine-Grained Image Understanding✓ Link70.47EVE-ROI*2019-01-20