| A Hybrid Neural Network Model for Commonsense Reasoning | ✓ Link | 90 | HNN | 2019-07-27 |
| BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding | ✓ Link | 78.3 | BERT-large 340M | 2018-10-11 |
| Unsupervised Deep Structured Semantic Models for Commonsense Reasoning | | 78.3 | UDSSM-II (ensemble) | 2019-04-03 |
| Unsupervised Deep Structured Semantic Models for Commonsense Reasoning | | 76.7 | UDSSM-I (ensemble) | 2019-04-03 |
| Unsupervised Deep Structured Semantic Models for Commonsense Reasoning | | 75.0 | DSSM | 2019-04-03 |
| Unsupervised Deep Structured Semantic Models for Commonsense Reasoning | | 75 | UDSSM-II | 2019-04-03 |
| Attention Is (not) All You Need for Commonsense Reasoning | ✓ Link | 68.3 | BERT-base 110M + MAS | 2019-05-31 |
| Attention Is (not) All You Need for Commonsense Reasoning | ✓ Link | 66.7 | USSM + Supervised Deepnet + 3 Knowledge Bases | 2019-05-31 |
| A Simple Method for Commonsense Reasoning | ✓ Link | 60.0 | Word-level CNN+LSTM (full scoring) | 2018-06-07 |
| Attention Is All You Need | ✓ Link | 58.3 | Subword-level Transformer LM | 2017-06-12 |
| Probabilistic Reasoning via Deep Learning: Neural Association Models | | 55.0 | USSM + Cause-Effect Knowledge Base | 2016-03-24 |
| A Simple Method for Commonsense Reasoning | ✓ Link | 53.3 | Word-level CNN+LSTM (partial scoring) | 2018-06-07 |
| Attention Is (not) All You Need for Commonsense Reasoning | ✓ Link | 53.3 | USSM + Supervised Deepnet | 2019-05-31 |