Paper | Code | Percentage correct | ModelName | ReleaseDate |
---|---|---|---|---|
Modeling Relationships in Referential Expressions with Compositional Modular Networks | ✓ Link | 72.53 | CMN | 2016-11-30 |
Compact Trilinear Interaction for Visual Question Answering | ✓ Link | 72.3 | CTI (with Boxes) | 2019-09-26 |
Coarse-to-Fine Reasoning for Visual Question Answering | ✓ Link | 71.9 | CFR | 2021-10-06 |
Multimodal Compact Bilinear Pooling for Visual Question Answering and Visual Grounding | ✓ Link | 62.2 | MCB+Att. | 2016-06-06 |