OpenCodePapers

multiple-choice-question-answering-mcqa-on-11

Question AnsweringMultiple Choice Question Answering (MCQA)
Dataset Link
Results over time
Click legend items to toggle metrics. Hover points for model names.
Leaderboard
PaperCodeAccuracyModelNameReleaseDate
Towards Expert-Level Medical Question Answering with Large Language Models✓ Link95.8Med-PaLM 2 (ER)2023-05-16
Towards Expert-Level Medical Question Answering with Large Language Models✓ Link95.1Med-PaLM 2 (CoT + SC)2023-05-16
Towards Expert-Level Medical Question Answering with Large Language Models✓ Link94.4Med-PaLM 2 (5-shot)2023-05-16
Galactica: A Large Language Model for Science✓ Link79.9Chinchilla (few-shot, k=5)2022-11-16
Galactica: A Large Language Model for Science✓ Link70.8Gopher (few-shot, k=5)2022-11-16
Galactica: A Large Language Model for Science✓ Link68.8GAL 120B (zero-shot)2022-11-16
Galactica: A Large Language Model for Science✓ Link30.6OPT (few-shot, k=5)2022-11-16
Galactica: A Large Language Model for Science✓ Link28.5BLOOM (few-shot, k=5)2022-11-16