OpenCodePapers

common-sense-reasoning-on-rwsd

Common Sense Reasoning
Dataset Link
Results over time
Click legend items to toggle metrics. Hover points for model names.
Leaderboard
PaperCodeAccuracyModelNameReleaseDate
[]()0.545Golden Transformer
[]()0.571ruRoberta-large finetune
Unreasonable Effectiveness of Rule-Based Heuristics in Solving Russian SuperGLUE Tasks0.597Random weighted2021-05-03
[]()0.636RuGPT3Large
[]()0.649RuGPT3XL few-shot
[]()0.662SBERT_Large
RussianSuperGLUE: A Russian Language Understanding Evaluation Benchmark✓ Link0.662Baseline TF-IDF1.12020-10-29
[]()0.669ruT5-large-finetune
[]()0.669ruT5-base-finetune
[]()0.669ruBert-large finetune
[]()0.669ruBert-base finetune
[]()0.669YaLM 1.0B few-shot
mT5: A massively multilingual pre-trained text-to-text transformer✓ Link0.669MT5 Large2020-10-22
[]()0.669RuBERT plain
[]()0.669RuBERT conversational
[]()0.669Multilingual Bert
Unreasonable Effectiveness of Rule-Based Heuristics in Solving Russian SuperGLUE Tasks0.669heuristic majority2021-05-03
[]()0.669RuGPT3Medium
[]()0.669RuGPT3Small
Unreasonable Effectiveness of Rule-Based Heuristics in Solving Russian SuperGLUE Tasks0.669majority_class2021-05-03
[]()0.675SBERT_Large_mt_ru_finetuning
RussianSuperGLUE: A Russian Language Understanding Evaluation Benchmark✓ Link0.84Human Benchmark2020-10-29