OpenCodePapers

dialogue-evaluation-on-usr-topicalchat

Dialogue Evaluation

Results over time

Click legend items to toggle metrics. Hover points for model names.

Leaderboard

Paper	Code	Spearman Correlation	Pearson Correlation	ModelName	ReleaseDate
MDD-Eval: Self-Training on Augmented Data for Multi-Domain Dialogue Evaluation	✓ Link	0.5109	0.4575	MDD-Eval	2021-12-14
Proxy Indicators for the Quality of Open-domain Dialogues	✓ Link	0.4877	0.4974	Lin-Reg (all)
USR: An Unsupervised and Reference Free Evaluation Metric for Dialog Generation	✓ Link	0.4192	0.4220	USR	2020-05-01
USR: An Unsupervised and Reference Free Evaluation Metric for Dialog Generation	✓ Link	0.3245	0.4068	USR - DR (x = c)	2020-05-01
USR: An Unsupervised and Reference Free Evaluation Metric for Dialog Generation	✓ Link	0.3086	0.3345	USR - MLM	2020-05-01
USR: An Unsupervised and Reference Free Evaluation Metric for Dialog Generation	✓ Link	0.1419	0.3221	USR - DR (x = f)	2020-05-01