| Paper | Code | MRPC | SICK-R | SICK-E | STS | ModelName | ReleaseDate |
|---|---|---|---|---|---|---|---|
| Learning General Purpose Distributed Sentence Representations via Large Scale Multi-task Learning | ✓ Link | 78.6/84.4 | 0.888 | 87.8 | 78.9/78.6 | GenSen | 2018-03-30 |
| Supervised Learning of Universal Sentence Representations from Natural Language Inference Data | ✓ Link | 76.2/83.1 | 0.884 | 86.3 | 75.8/75.5 | InferSent | 2017-05-05 |
| Discriminative Improvements to Distributional Sentence Similarity | 80.4/85.9 | - | - | - | TF-KLD | 2013-10-01 | |
| Training Complex Models with Multi-Task Weak Supervision | ✓ Link | 91.5/88.5 | - | - | 90.1/89.7* | Snorkel MeTaL(ensemble) | 2018-10-05 |
| Improving Multi-Task Deep Neural Networks via Knowledge Distillation for Natural Language Understanding | ✓ Link | 92.7/90.3 | - | - | 91.1/90.7* | MT-DNN-ensemble | 2019-04-20 |
| XLNet: Generalized Autoregressive Pretraining for Language Understanding | ✓ Link | 93.0/90.7 | - | - | 91.6/91.1* | XLNet-Large | 2019-06-19 |