OpenCodePapers

question-answering-on-drop

Question Answering
Dataset Link
Results over time
Click legend items to toggle metrics. Hover points for model names.
Leaderboard
PaperCodeAccuracyModelNameReleaseDate
Large Language Models Can Self-Improve83PaLM 540B (Self Improvement, Self Consistency)2022-10-20
Large Language Models Can Self-Improve78.2PaLM 540B (Self Consistency)2022-10-20
Large Language Models Can Self-Improve76.2PaLM 540B (Self Improvement, CoT Prompting)2022-10-20
Large Language Models Can Self-Improve71.7PaLM 540B (Self Improvement, Standard-Prompting)2022-10-20
Large Language Models Can Self-Improve70.6PaLM 540B (CoT Prompting)2022-10-20
Large Language Models Can Self-Improve60PaLM 540B (Standard-Prompting)2022-10-20