OpenCodePapers

visual-question-answering-on-clevr-humans

Visual Question Answering (VQA)
Dataset Link
Results over time
Click legend items to toggle metrics. Hover points for model names.
Leaderboard
PaperCodeAccuracyModelNameReleaseDate
MDETR -- Modulated Detection for End-to-End Multi-Modal Understanding✓ Link81.7MDETR2021-04-26
Compositional Attention Networks for Machine Reasoning✓ Link81.5MAC2018-03-08
FiLM: Visual Reasoning with a General Conditioning Layer✓ Link75.9CNN+GRU+FiLM2017-09-22
Neural-Symbolic VQA: Disentangling Reasoning from Vision and Language Understanding✓ Link67.8NS-VQA (1K programs)2018-10-04
Inferring and Executing Programs for Visual Reasoning✓ Link66.6IEP-18K2017-05-10