OpenCodePapers

long-range-modeling-on-scrolls

Language ModellingLong-range modeling
Dataset Link
Results over time
Click legend items to toggle metrics. Hover points for model names.
Leaderboard
PaperCodeAvg.GovRepSumScrQMSumQsprNrtvQALT EM-T/HCNLIModelNameReleaseDate
CoLT5: Faster Long-Range Transformers with Conditional Computation43.5161.3/32.2/33.836.4/10.2/21.736.2/12.9/24.353.931.148.1/43.888.4CoLT5 XL2023-03-17
LongT5: Efficient Text-To-Text Transformer for Long Sequences✓ Link42.5361.1 / 32.3 / 33.735.8 / 9.6 / 21.134.9 / 11.8 / 23.553.129.346.0 / 42.188.2LongT5 XL2021-12-15
LongT5: Efficient Text-To-Text Transformer for Long Sequences✓ Link41.0361.3/32.2/33.860.3 / 31.1 / 32.835.1 / 12.0 / 23.352.327.240.6 / 38.687.3LongT5 Large2021-12-15
Adapting Pretrained Text-to-Text Models for Long Text Sequences✓ Link39.7659.4 / 29.8 / 30.837.7 / 10.2 / 21.535.1 / 11.0 / 22.048.726.237.8 / 34.087.1BART-LS2022-09-21
LongT5: Efficient Text-To-Text Transformer for Long Sequences✓ Link38.657.7 / 30.0 / 31.434.8 / 9.6 / 21.133.9 / 11.0 / 22.846.623.037.9 / 36.685.6LongT5 Base2021-12-15
Efficient Long-Text Understanding with Short-Text Models✓ Link37.9957.5 / 26.3 / 27.435.2 / 8.7 / 19.434.2 / 11.0 / 22.046.924.134.8 / 34.887.3BART-large SLED2022-08-01
UL2: Unifying Language Learning Paradigms✓ Link37.8753.6 / 26.1 / 28.832.9 / 7.8 / 19.431.1 / 8.5 / 20.437.624.245.8 / 40.7UL22022-05-10
SCROLLS: Standardized CompaRison Over Long Language Sequences✓ Link29.1656.2 / 26.6 / 28.824.2 / 4.5 / 15.425.1 / 6.7 / 18.826.618.525.8 / 25.4 71.5LED Base2022-01-10
SCROLLS: Standardized CompaRison Over Long Language Sequences✓ Link29.0147.9 / 18.6 / 22.727.2 / 4.9 / 16.730.2 / 8.7 / 20.726.315.426.0 / 25.977.4BART Base2022-01-10
SCROLLS: Standardized CompaRison Over Long Language Sequences✓ Link19.3545.3 / 17.9 / 20.819.6 / 1.8 / 11.014.2 / 2.0 / 9.33.41.525.2 / 26.166Naive2022-01-10
Investigating Efficiently Extending Transformers for Long Input Summarization✓ Link60.3 / 30.0 / 31.535.7 / 9.1 / 20.633.2 / 9.6 / 21.6 PEGASUS-X2022-08-08
Investigating Efficiently Extending Transformers for Long Input Summarization✓ Link59.3 / 29.3 / 30.9 35.0 / 8.9 / 20.432.9 / 9.8 / 21.4PEGASUS-X-Base2022-08-08
UL2: Unifying Language Learning Paradigms✓ Link88.7UL2 20B2022-05-10