OpenCodePapers
image-sentence-alignment-on-valse-coreference-1
Multimodal Text and Image Classification
image-sentence alignment
Dataset Link
Results over time
Click legend items to toggle metrics. Hover points for model names.
Leaderboard
Show papers without code
Paper
Code
pairwise accuracy
↕
Accuracy (%)
↕
ModelName
ReleaseDate
↕
VALSE: A Task-Independent Benchmark for Vision and Language Models Centered on Linguistic Phenomena
✓ Link
69.2
54.3
ViLBERT 12-in-1
2021-12-14
VALSE: A Task-Independent Benchmark for Vision and Language Models Centered on Linguistic Phenomena
✓ Link
50.0
GPT2
2021-12-14
VALSE: A Task-Independent Benchmark for Vision and Language Models Centered on Linguistic Phenomena
✓ Link
49.7
CLIP
2021-12-14
VALSE: A Task-Independent Benchmark for Vision and Language Models Centered on Linguistic Phenomena
✓ Link
48.1
50.0
ViLBERT
2021-12-14
VALSE: A Task-Independent Benchmark for Vision and Language Models Centered on Linguistic Phenomena
✓ Link
47.6
50.0
VisualBERT
2021-12-14
VALSE: A Task-Independent Benchmark for Vision and Language Models Centered on Linguistic Phenomena
✓ Link
45.2
GPT1
2021-12-14
VALSE: A Task-Independent Benchmark for Vision and Language Models Centered on Linguistic Phenomena
✓ Link
44.2
49.0
LXMERT
2021-12-14