OpenCodePapers

question-answering-on-quality

Question Answering
Dataset Link
Results over time
Click legend items to toggle metrics. Hover points for model names.
Leaderboard
PaperCodeAccuracyModelNameReleaseDate
Model Card and Evaluations for Claude Models84.1Claude 1.3 (5-shot)2023-07-11
Model Card and Evaluations for Claude Models83.2Claude 2 (5-shot)2023-07-11
RAPTOR: Recursive Abstractive Processing for Tree-Organized Retrieval✓ Link82.6RAPTOR + GPT-4 (June 2023)2024-01-31
Model Card and Evaluations for Claude Models80.5Claude Instant 1.1 (5-shot)2023-07-11