OpenCodePapers

natural-language-inference-on-commitmentbank

Natural Language Inference
Dataset Link
Results over time
Click legend items to toggle metrics. Hover points for model names.
Leaderboard
PaperCodeAccuracyF1ModelNameReleaseDate
PaLM: Scaling Language Modeling with Pathways✓ Link100100PaLM 540B (finetuned)2022-04-05
Toward Efficient Language Model Pretraining and Downstream Adaptation via Self-Evolution: A Case Study on SuperGLUE99.298.6Vega v2 6B (KD-based prompt transfer)2022-12-04
ST-MoE: Designing Stable and Transferable Sparse Expert Models✓ Link98.2ST-MoE-L 4.1B (fine-tuned)2022-02-17
ST-MoE: Designing Stable and Transferable Sparse Expert Models✓ Link98ST-MoE-32B 269B (fine-tuned)2022-02-17
Toward Efficient Language Model Pretraining and Downstream Adaptation via Self-Evolution: A Case Study on SuperGLUE97.695.9Turing NLR v5 XXL 5.4B (fine-tuned)2022-12-04
DeBERTa: Decoding-enhanced BERT with Disentangled Attention✓ Link97.294.9DeBERTa-1.5B2020-06-05
Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer✓ Link96.893.9T5-XXL 11B (fine-tuned)2019-10-23
Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer✓ Link94.490.3T5-Large 770M (fine-tuned)2019-10-23
Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer✓ Link9486.2T5-Base 220M (fine-tuned)2019-10-23
PaLM 2 Technical Report✓ Link87.5PaLM 2-L (one-shot)2023-05-17
PaLM 2 Technical Report✓ Link82.1PaLM 2-S (one-shot)2023-05-17
PaLM 2 Technical Report✓ Link80.4PaLM 2-M (one-shot)2023-05-17
Language Models are Few-Shot Learners✓ Link75.6GPT-3 175B (Few-Shot)2020-05-28
N-Grammer: Augmenting Transformers with latent n-grams✓ Link67.9 59.7N-Grammer 343M2022-07-13
AlexaTM 20B: Few-Shot Learning Using a Large-Scale Multilingual Seq2Seq Model✓ Link67.9AlexaTM 20B2022-08-02
BloombergGPT: A Large Language Model for Finance✓ Link53.57Bloomberg GPT (one-shot)2023-03-30
BloombergGPT: A Large Language Model for Finance✓ Link48.21GPT-NeoX (one-shot)2023-03-30
BloombergGPT: A Large Language Model for Finance✓ Link48.21BLOOM 176B (one-shot)2023-03-30
BloombergGPT: A Large Language Model for Finance✓ Link44.64OPT 66B (one-shot)2023-03-30
Language Models are Few-Shot Learners✓ Link52GPT-3 175B (few-shot, k=32)2020-05-28