OpenCodePapers

word-sense-disambiguation-on-words-in-context

Word Sense Disambiguation
Results over time
Click legend items to toggle metrics. Hover points for model names.
Leaderboard
PaperCodeAccuracyModelNameReleaseDate
Fine-Tuning Pre-trained Language Model with Weak Supervision: A Contrastive-Regularized Self-Training Approach✓ Link85.3COSINE + Transductive Learning2020-10-15
PaLM: Scaling Language Modeling with Pathways✓ Link78.8PaLM 540B (finetuned) 2022-04-05
ST-MoE: Designing Stable and Transferable Sparse Expert Models✓ Link77.7ST-MoE-32B 269B (fine-tuned)2022-02-17
DeBERTa: Decoding-enhanced BERT with Disentangled Attention✓ Link77.5DeBERTa-Ensemble2020-06-05
Toward Efficient Language Model Pretraining and Downstream Adaptation via Self-Evolution: A Case Study on SuperGLUE77.4Vega v2 6B (fine-tuned)2022-12-04
UL2: Unifying Language Learning Paradigms✓ Link77.3UL2 20B (fine-tuned)2022-05-10
Toward Efficient Language Model Pretraining and Downstream Adaptation via Self-Evolution: A Case Study on SuperGLUE77.1Turing NLR v5 XXL 5.4B (fine-tuned)2022-12-04
Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer✓ Link76.9T5-XXL 11B2019-10-23
DeBERTa: Decoding-enhanced BERT with Disentangled Attention✓ Link76.4DeBERTa-1.5B2020-06-05
ST-MoE: Designing Stable and Transferable Sparse Expert Models✓ Link74ST-MoE-L 4.1B (fine-tuned)2022-02-17
SenseBERT: Driving Some Sense into BERT72.1SenseBERT-large 340M2019-08-15
SenseBERT: Driving Some Sense into BERT70.3SenseBERT-base 110M2019-08-15
PaLM 2 Technical Report✓ Link66.8PaLM 2-L (one-shot)2023-05-17
WiC: the Word-in-Context Dataset for Evaluating Context-Sensitive Meaning Representations65.5BERT-large 340M2018-08-28
LaMini-LM: A Diverse Herd of Distilled Models from Large-Scale Instructions✓ Link64.7FLAN-T5-Large 783M2023-04-27
LaMini-LM: A Diverse Herd of Distilled Models from Large-Scale Instructions✓ Link63.8LaMini-F-T5 783M2023-04-27
WiC: the Word-in-Context Dataset for Evaluating Context-Sensitive Meaning Representations59.3Context2vec2018-08-28
WiC: the Word-in-Context Dataset for Evaluating Context-Sensitive Meaning Representations58.7DeConf2018-08-28
WiC: the Word-in-Context Dataset for Evaluating Context-Sensitive Meaning Representations58.1SW2V2018-08-28
WiC: the Word-in-Context Dataset for Evaluating Context-Sensitive Meaning Representations57.7ElMo2018-08-28
The CoT Collection: Improving Zero-shot and Few-shot Learning of Language Models via Chain-of-Thought Fine-Tuning✓ Link56.7T0-3B (CoT fine-tuned)2023-05-23
N-Grammer: Augmenting Transformers with latent n-grams✓ Link56.1N-Grammer 343M2022-07-13
AlexaTM 20B: Few-Shot Learning Using a Large-Scale Multilingual Seq2Seq Model✓ Link53.3AlexaTM 20B2022-08-02
WiC: the Word-in-Context Dataset for Evaluating Context-Sensitive Meaning Representations53.1Sentence LSTM2018-08-28
Exploring the Benefits of Training Expert Language Models over Instruction Tuning✓ Link52.97RoE-3B2023-02-07
LaMini-LM: A Diverse Herd of Distilled Models from Large-Scale Instructions✓ Link52.4LaMini-GPT 1.5B2023-04-27
Knowledge-in-Context: Towards Knowledgeable Semi-Parametric Language Models52.40KiC-770M2022-10-28
PaLM 2 Technical Report✓ Link52.0PaLM 2-M (one-shot)2023-05-17
Hungry Hungry Hippos: Towards Language Modeling with State Space Models✓ Link51.4Hybrid H3 125M (0-shot, logit scoring)2022-12-28
Hungry Hungry Hippos: Towards Language Modeling with State Space Models✓ Link51.4Hybrid H3 125M (0-shot, rank classification)2022-12-28
PaLM 2 Technical Report✓ Link50.6PaLM 2-S (one-shot)2023-05-17
LaMini-LM: A Diverse Herd of Distilled Models from Large-Scale Instructions✓ Link50.5LaMini-T5 738M2023-04-27
Guess the Instruction! Flipped Learning Makes Language Models Stronger Zero-Shot Learners✓ Link50.42Flipped-3B2022-10-06
LaMini-LM: A Diverse Herd of Distilled Models from Large-Scale Instructions✓ Link49.8GPT-2-XL 1.5B2023-04-27
UL2: Unifying Language Learning Paradigms✓ Link49.8UL2 20B (0-shot)2022-05-10
Language Models are Few-Shot Learners✓ Link49.4GPT-3 175B (few-shot, k=32)2020-05-28
Hungry Hungry Hippos: Towards Language Modeling with State Space Models✓ Link49.1Hybrid H3 125M (3-shot, logit scoring)2022-12-28