OpenCodePapers

language-modelling-on-penn-treebank-character

Language Modelling
Dataset Link
Results over time
Click legend items to toggle metrics. Hover points for model names.
Leaderboard
PaperCodeBit per Character (BPC)Number of paramsModelNameReleaseDate
Mogrifier LSTM✓ Link1.08324MMogrifier LSTM + dynamic eval 2019-09-04
Mogrifier LSTM✓ Link1.12024MMogrifier LSTM2019-09-04
Recurrent Highway Networks with Grouped Auxiliary Memory✓ Link1.14716.0MGAM-RHN-52019-12-13
Trellis Networks for Sequence Modeling✓ Link1.15813.4MTrellis Network2018-10-15
Addressing Some Limitations of Transformers with Feedback Memory✓ Link1.16010.7MFeedback Transformer2020-02-21
Improved Language Modeling by Decoding the Past1.16913.8MPast Decode Reg. + AWD-LSTM-MoS + dyn. eval.2018-08-14
An Analysis of Neural Language Modeling at Multiple Scales✓ Link1.17513.8M3-layer AWD-LSTM2018-03-22
Deep Independently Recurrent Neural Network (IndRNN)✓ Link1.18Dense IndRNN2019-10-11
An Analysis of Neural Language Modeling at Multiple Scales✓ Link1.18713.8M6-layer QRNN2018-03-22
Fast-Slow Recurrent Neural Networks✓ Link1.19027MFS-LSTM-42017-05-24
Independently Recurrent Neural Network (IndRNN): Building A Longer and Deeper RNN✓ Link1.19IndRNN2018-03-13
Fast-Slow Recurrent Neural Networks✓ Link1.19327MFS-LSTM-22017-05-24
Neural Architecture Search with Reinforcement Learning✓ Link1.21416.3MNAS-RL2016-11-05
HyperNetworks✓ Link1.21914.4M2-layer Norm HyperLSTM2016-09-27
R-Transformer: Recurrent Neural Network Enhanced Transformer✓ Link1.24R-Transformer2019-07-12
Seq-U-Net: A One-Dimensional Causal U-Net for Efficient Sequence Modelling✓ Link1.35.9MSeq-U-Net2019-11-14
Gating Revisited: Deep Multi-layer RNNs That Can Be Trained✓ Link1.30STAR2019-11-25
Seq-U-Net: A One-Dimensional Causal U-Net for Efficient Sequence Modelling✓ Link1.315.9MTCN2019-11-14
An Empirical Evaluation of Generic Convolutional and Recurrent Networks for Sequence Modeling✓ Link1.31Temporal Convolutional Network2018-03-04
Discrete Flows: Invertible Generative Models of Discrete Data✓ Link1.38Bipartite Flow2019-05-24