OpenCodePapers

multiple-choice-question-answering-mcqa-on-8

Question AnsweringMultiple Choice Question Answering (MCQA)
Dataset Link
Results over time
Click legend items to toggle metrics. Hover points for model names.
Leaderboard
PaperCodeAccuracyModelNameReleaseDate
Towards Expert-Level Medical Question Answering with Large Language Models✓ Link92Med-PaLM 2 (ER)2023-05-16
Towards Expert-Level Medical Question Answering with Large Language Models✓ Link90Med-PaLM 2 (5-shot)2023-05-16
Towards Expert-Level Medical Question Answering with Large Language Models✓ Link89Med-PaLM 2 (CoT + SC)2023-05-16
Galactica: A Large Language Model for Science✓ Link70GAL 30B (zero-shot)2022-11-16
Galactica: A Large Language Model for Science✓ Link69Chinchilla (few-shot, k=5)2022-11-16
Galactica: A Large Language Model for Science✓ Link68GAL 120B (zero-shot)2022-11-16
Galactica: A Large Language Model for Science✓ Link36BLOOM (few-shot, k=5)2022-11-16
Galactica: A Large Language Model for Science✓ Link35OPT (few-shot, k=5)2022-11-16