OpenCodePapers

reading-comprehension-on-race

Reading Comprehension
Dataset Link
Results over time
Click legend items to toggle metrics. Hover points for model names.
Leaderboard
PaperCodeAccuracyAccuracy (Middle)Accuracy (High)ModelNameReleaseDate
Improving Machine Reading Comprehension with Single-choice Decision and Transfer Learning91.4ALBERT (Ensemble)2020-11-06
Megatron-LM: Training Multi-Billion Parameter Language Models Using Model Parallelism✓ Link90.993.190.0Megatron-BERT (ensemble)2019-09-17
DUMA: Reading Comprehension with Transposition Thinking✓ Link89.888.792.6ALBERTxxlarge+DUMA(ensemble)2020-01-26
Megatron-LM: Training Multi-Billion Parameter Language Models Using Model Parallelism✓ Link89.591.888.6Megatron-BERT2019-09-17
DeBERTa: Decoding-enhanced BERT with Disentangled Attention✓ Link86.8DeBERTalarge2020-06-05
Funnel-Transformer: Filtering out Sequential Redundancy for Efficient Language Processing✓ Link85.788.884.4B10-10-102020-06-05
RoBERTa: A Robustly Optimized BERT Pretraining Approach✓ Link83.286.581.3RoBERTa2019-07-26
Orca 2: Teaching Small Language Models How to Reason82.87Orca 2-13B2023-11-18
Orca 2: Teaching Small Language Models How to Reason80.79Orca 2-7B2023-11-18
Hierarchical Learning for Generation with Long Source Sequences67.3HAT (Encoder)2021-04-15
XLNet: Generalized Autoregressive Pretraining for Language Understanding✓ Link88.684.0XLNet2019-06-19
PaLM: Scaling Language Modeling with Pathways✓ Link68.149.1PaLM 540B (zero-shot)2022-04-05
LLaMA: Open and Efficient Foundation Language Models✓ Link67.951.6LLaMA 65B (zero-shot)2023-02-27
PaLM: Scaling Language Modeling with Pathways✓ Link64.347.5PaLM 62B (zero-shot)2022-04-05
LLaMA: Open and Efficient Foundation Language Models✓ Link64.148.3LLaMA 33B (zero-shot)2023-02-27
LLaMA: Open and Efficient Foundation Language Models✓ Link61.647.2LLaMA 13B (zero-shot)2023-02-27
LLaMA: Open and Efficient Foundation Language Models✓ Link61.146.9LLaMA 7B (zero-shot)2023-02-27
Language Models are Few-Shot Learners✓ Link58.4GPT-3 175B (0-shot)2020-05-28
PaLM: Scaling Language Modeling with Pathways✓ Link57.942.3PaLM 8B (zero-shot)2022-04-05
BloombergGPT: A Large Language Model for Finance✓ Link54.3241.74Bloomberg GPT (one-shot)2023-03-30
BloombergGPT: A Large Language Model for Finance✓ Link52.339.14BLOOM 176B (one-shot)2023-03-30
BloombergGPT: A Large Language Model for Finance✓ Link47.4237.02OPT 66B (one-shot)2023-03-30
BloombergGPT: A Large Language Model for Finance✓ Link41.2334.33GPT-NeoX (one-shot)2023-03-30
Language Models are Few-Shot Learners✓ Link45.5GPT-3 175B (zero-shot)2020-05-28