OpenCodePapers

question-answering-on-multirc

Question Answering
Dataset Link
Results over time
Click legend items to toggle metrics. Hover points for model names.
Leaderboard
PaperCodeF1EMModelNameReleaseDate
PaLM: Scaling Language Modeling with Pathways✓ Link90.169.2PaLM 540B (finetuned) 2022-04-05
ST-MoE: Designing Stable and Transferable Sparse Expert Models✓ Link89.6ST-MoE-32B 269B (fine-tuned)2022-02-17
Toward Efficient Language Model Pretraining and Downstream Adaptation via Self-Evolution: A Case Study on SuperGLUE88.463Turing NLR v5 XXL 5.4B (fine-tuned)2022-12-04
DeBERTa: Decoding-enhanced BERT with Disentangled Attention✓ Link88.263.7DeBERTa-1.5B2020-06-05
Toward Efficient Language Model Pretraining and Downstream Adaptation via Self-Evolution: A Case Study on SuperGLUE88.262.4Vega v2 6B (fine-tuned)2022-12-04
PaLM 2 Technical Report✓ Link88.2PaLM 2-L (one-shot)2023-05-17
Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer✓ Link88.1T5-XXL 11B (fine-tuned)2019-10-23
ST-MoE: Designing Stable and Transferable Sparse Expert Models✓ Link86ST-MoE-L 4.1B (fine-tuned)2022-02-17
PaLM 2 Technical Report✓ Link84.1PaLM 2-M (one-shot)2023-05-17
PaLM 2 Technical Report✓ Link84.0PaLM 2-S (one-shot)2023-05-17
Finetuned Language Models Are Zero-Shot Learners✓ Link83.4FLAN 137B (prompt-tuned)2021-09-03
Finetuned Language Models Are Zero-Shot Learners✓ Link77.5FLAN 137B (zero-shot)2021-09-03
Language Models are Few-Shot Learners✓ Link75.4GPT-3 175B (Few-Shot)2020-05-28
Finetuned Language Models Are Zero-Shot Learners✓ Link72.1FLAN 137B (1-shot)2021-09-03
KELM: Knowledge Enhanced Pre-Trained Language Representations with Message Passing on Hierarchical Relational Graphs✓ Link70.827.2KELM (finetuning BERT-large based single model)2021-09-09
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding✓ Link70.024.1BERT-large(single model)2018-10-11
Ask Me Anything: A simple strategy for prompting language models✓ Link 63.8Neo-6B (QA + WS)2022-10-05
BloombergGPT: A Large Language Model for Finance✓ Link62.3Bloomberg GPT 50B (1-shot)2023-03-30
N-Grammer: Augmenting Transformers with latent n-grams✓ Link6211.3N-Grammer 343M2022-07-13
Ask Me Anything: A simple strategy for prompting language models✓ Link60.8Neo-6B (few-shot)2022-10-05
AlexaTM 20B: Few-Shot Learning Using a Large-Scale Multilingual Seq2Seq Model✓ Link59.6AlexaTM 20B2022-08-02
Ask Me Anything: A simple strategy for prompting language models✓ Link58.8Neo-6B (QA)2022-10-05
BloombergGPT: A Large Language Model for Finance✓ Link26.7BLOOM 176B (1-shot)2023-03-30
BloombergGPT: A Large Language Model for Finance✓ Link22.9GPT-NeoX 20B (1-shot)2023-03-30
BloombergGPT: A Large Language Model for Finance✓ Link18.8OPT 66B (1-shot)2023-03-30
Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer✓ Link63.3T5-11B2019-10-23
Hungry Hungry Hippos: Towards Language Modeling with State Space Models✓ Link59.7Hybrid H3 355M (3-shot, logit scoring)2022-12-28
Hungry Hungry Hippos: Towards Language Modeling with State Space Models✓ Link59.5Hybrid H3 355M (0-shot, logit scoring)2022-12-28
Hungry Hungry Hippos: Towards Language Modeling with State Space Models✓ Link51.4Hybrid H3 125M (0-shot, logit scoring)2022-12-28
Hungry Hungry Hippos: Towards Language Modeling with State Space Models✓ Link48.9Hybrid H3 125M (3-shot, logit scoring)2022-12-28