OpenCodePapers

science-question-answering-on-scienceqa

Question AnsweringScience Question Answering
Dataset Link
Results over time
Click legend items to toggle metrics. Hover points for model names.
Leaderboard
PaperCodeAvg. AccuracyNatural ScienceSocial ScienceLanguage ScienceText ContextImage ContextNo ContextGrades 1-6Grades 7-12ModelNameReleaseDate
Boosting the Power of Small Multimodal Reasoning Models to Match Larger Models with Self-Consistency Training✓ Link94.8897.4790.4493.1896.9793.7594.4995.394.13MC-CoT F-Large2023-11-23
Honeybee: Locality-enhanced Projector for Multimodal LLM✓ Link94.3995.2096.2991.1894.4893.7593.1795.0493.21Honeybee2023-12-11
[]()92.53LLaVA (+ GPT-4)
Multimodal Chain-of-Thought Reasoning in Language Models✓ Link91.6895.9182.0090.8295.2688.8092.8992.4490.31Multimodal CoT2023-02-02
Chat-UniVi: Unified Visual Representation Empowers Large Language Models with Image and Video Understanding✓ Link90.9990.4195.0588.9189.6488.0590.9491.1990.64Chat-UniVi-13B2023-11-14
Learn to Explain: Multimodal Reasoning via Thought Chains for Science Question Answering✓ Link75.1775.4470.8778.0974.6867.4379.9378.23 69.68GPT-3 - CoT (QCM→ALE , 2-shot)2022-09-20
Learn to Explain: Multimodal Reasoning via Thought Chains for Science Question Answering✓ Link74.6176.6065.9277.5575.5166.0979.5878.4967.63GPT-3 - CoT(QCM→AE, 2-shot)2022-09-20
Learn to Explain: Multimodal Reasoning via Thought Chains for Science Question Answering✓ Link74.1171.0076.0478.9166.4266.5381.8177.0668.82UnifiedQA-BASE - CoT (QCM→ALE)2022-09-20
Learn to Explain: Multimodal Reasoning via Thought Chains for Science Question Answering✓ Link73.9774.6469.7476.0074.44 67.2877.4276.80 68.89GPT-3 (QCM→A, 2-shot)2022-09-20
Video-LaVIT: Unified Video-Language Pre-training with Decoupled Visual-Motional Tokenization✓ Link70.0Video-LaVIT2024-02-05