Paper | Code | Spearman Correlation | Pearson Correlation | ModelName | ReleaseDate |
---|---|---|---|---|---|
MDD-Eval: Self-Training on Augmented Data for Multi-Domain Dialogue Evaluation | ✓ Link | 0.5109 | 0.4575 | MDD-Eval | 2021-12-14 |
Proxy Indicators for the Quality of Open-domain Dialogues | ✓ Link | 0.4877 | 0.4974 | Lin-Reg (all) | |
USR: An Unsupervised and Reference Free Evaluation Metric for Dialog Generation | ✓ Link | 0.4192 | 0.4220 | USR | 2020-05-01 |
USR: An Unsupervised and Reference Free Evaluation Metric for Dialog Generation | ✓ Link | 0.3245 | 0.4068 | USR - DR (x = c) | 2020-05-01 |
USR: An Unsupervised and Reference Free Evaluation Metric for Dialog Generation | ✓ Link | 0.3086 | 0.3345 | USR - MLM | 2020-05-01 |
USR: An Unsupervised and Reference Free Evaluation Metric for Dialog Generation | ✓ Link | 0.1419 | 0.3221 | USR - DR (x = f) | 2020-05-01 |