common-sense-reasoning-on-rwsd

Common Sense Reasoning

Results over time

Click legend items to toggle metrics. Hover points for model names.

Leaderboard

Paper	Code	Accuracy	ModelName	ReleaseDate
[]()		0.545	Golden Transformer
[]()		0.571	ruRoberta-large finetune
Unreasonable Effectiveness of Rule-Based Heuristics in Solving Russian SuperGLUE Tasks		0.597	Random weighted	2021-05-03
[]()		0.636	RuGPT3Large
[]()		0.649	RuGPT3XL few-shot
[]()		0.662	SBERT_Large
RussianSuperGLUE: A Russian Language Understanding Evaluation Benchmark	✓ Link	0.662	Baseline TF-IDF1.1	2020-10-29
[]()		0.669	ruT5-large-finetune
[]()		0.669	ruT5-base-finetune
[]()		0.669	ruBert-large finetune
[]()		0.669	ruBert-base finetune
[]()		0.669	YaLM 1.0B few-shot
mT5: A massively multilingual pre-trained text-to-text transformer	✓ Link	0.669	MT5 Large	2020-10-22
[]()		0.669	RuBERT plain
[]()		0.669	RuBERT conversational
[]()		0.669	Multilingual Bert
Unreasonable Effectiveness of Rule-Based Heuristics in Solving Russian SuperGLUE Tasks		0.669	heuristic majority	2021-05-03
[]()		0.669	RuGPT3Medium
[]()		0.669	RuGPT3Small
Unreasonable Effectiveness of Rule-Based Heuristics in Solving Russian SuperGLUE Tasks		0.669	majority_class	2021-05-03
[]()		0.675	SBERT_Large_mt_ru_finetuning
RussianSuperGLUE: A Russian Language Understanding Evaluation Benchmark	✓ Link	0.84	Human Benchmark	2020-10-29

OpenCodePapers