Paper | Code | pairwise accuracy | Accuracy (%) | ModelName | ReleaseDate |
---|---|---|---|---|---|
VALSE: A Task-Independent Benchmark for Vision and Language Models Centered on Linguistic Phenomena | ✓ Link | 95.6 | 89.0 | ViLBERT 12-in-1 | 2021-12-14 |
VALSE: A Task-Independent Benchmark for Vision and Language Models Centered on Linguistic Phenomena | ✓ Link | 78.6 | 55.8 | LXMERT | 2021-12-14 |
VALSE: A Task-Independent Benchmark for Vision and Language Models Centered on Linguistic Phenomena | ✓ Link | 66.9 | CLIP | 2021-12-14 | |
VALSE: A Task-Independent Benchmark for Vision and Language Models Centered on Linguistic Phenomena | ✓ Link | 66.5 | 2.4 | ViLBERT | 2021-12-14 |
VALSE: A Task-Independent Benchmark for Vision and Language Models Centered on Linguistic Phenomena | ✓ Link | 61.8 | GPT1 | 2021-12-14 | |
VALSE: A Task-Independent Benchmark for Vision and Language Models Centered on Linguistic Phenomena | ✓ Link | 58.0 | GPT2 | 2021-12-14 | |
VALSE: A Task-Independent Benchmark for Vision and Language Models Centered on Linguistic Phenomena | ✓ Link | 39.7 | 49.3 | VisualBERT | 2021-12-14 |