OpenCodePapers

visual-question-answering-on-textvqa-test-1

Visual Question Answering (VQA)
Dataset Link
Results over time
Click legend items to toggle metrics. Hover points for model names.
Leaderboard
PaperCodeoverallModelNameReleaseDate
PaLI: A Jointly-Scaled Multilingual Language-Image Model✓ Link73.1PaLI2022-09-14
[]()53.97TAP
TAG: Boosting Text-VQA via Text-aware Visual Question-answer Generation✓ Link53.69TAG2022-08-03
[]()45.66ssbaseline
[]()45.51SMA single model
[]()44.8SAM (Single Model)
[]()44.73colab_buaa
[]()40.96CRN (Single Model)
[]()40.77CIG
[]()40.46M4C
[]()39.95Shuai
[]()32.46mmgnn