An Empirical Study of Scaling Law for OCR | ✓ Link | 99.42 | CLIP4STR-L* | 2023-12-29 |
DTrOCR: Decoder-only Transformer for Optical Character Recognition | ✓ Link | 99.4 | DTrOCR 105M | 2023-08-30 |
CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained Vision-Language Model | ✓ Link | 99.0 | CLIP4STR-L (DataComp-1B) | 2023-05-23 |
Multi-Granularity Prediction for Scene Text Recognition | ✓ Link | 98.5 | MGP-STR | 2022-09-08 |
CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained Vision-Language Model | ✓ Link | 98.5 | CLIP4STR-L | 2023-05-23 |
Scene Text Recognition with Permuted Autoregressive Sequence Models | ✓ Link | 98.4±0.2 | PARSeq | 2022-07-14 |
Self-supervised Character-to-Character Distillation for Text Recognition | ✓ Link | 98.3 | CCD-ViT-Base(ARD_2.8M) | 2022-11-01 |
Self-supervised Character-to-Character Distillation for Text Recognition | ✓ Link | 98.3 | CCD-ViT-Small(ARD_2.8M) | 2022-11-01 |
CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained Vision-Language Model | ✓ Link | 98.3 | CLIP4STR-B | 2023-05-23 |
Multi-modal Text Recognition Networks: Interactive Enhancements between Visual and Semantic Features | ✓ Link | 97.9 | MATRN | 2021-11-30 |
Visual Semantics Allow for Textual Reasoning Better in Scene Text Recognition | ✓ Link | 97.8 | S-GTR | 2021-12-24 |
Self-supervised Implicit Glyph Attention for Text Recognition | ✓ Link | 97.8 | SIGA_T | 2022-03-07 |
Look Back Again: Dual Parallel Attention Network for Accurate and Robust Scene Text Recognition | ✓ Link | 97.7 | DPAN | 2021-08-01 |
CDistNet: Perceiving Multi-Domain Character Distance for Robust Text Recognition | ✓ Link | 97.67 | CDistNet (Ours) | 2021-11-22 |
Self-supervised Character-to-Character Distillation for Text Recognition | ✓ Link | 97.5 | CCD-ViT-Tiny(ARD_2.8M) | 2022-11-01 |
SVTR: Scene Text Recognition with a Single Visual Model | ✓ Link | 97.2 | SVTR-L (Large) | 2022-04-30 |
SVTR: Scene Text Recognition with a Single Visual Model | ✓ Link | 97.1 | SVTR-B (Base) | 2022-04-30 |
DiffusionSTR: Diffusion Model for Scene Text Recognition | | 97.1 | DiffusionSTR | 2023-06-29 |
Why You Should Try the Real Data for the Scene Text Recognition | ✓ Link | 96.8 | Yet Another Text Recognizer | 2021-07-29 |
SVTR: Scene Text Recognition with a Single Visual Model | ✓ Link | 96.3 | SVTR-T (Tiny) | 2022-04-30 |
SVTR: Scene Text Recognition with a Single Visual Model | ✓ Link | 95.7 | SVTR-S (Small) | 2022-04-30 |
Towards Accurate Scene Text Recognition with Semantic Reasoning Networks | ✓ Link | 95.5 | SRN | 2020-03-27 |
Representation and Correlation Enhanced Encoder-Decoder Framework for Scene Text Recognition | ✓ Link | 94.7 | RCEED | 2021-06-13 |
On Recognizing Texts of Arbitrary Shapes with 2D Self-Attention | ✓ Link | 94.1 | SATRN | 2019-10-10 |
Decoupled Attention Network for Text Recognition | ✓ Link | 93.9 | DAN | 2019-12-21 |
Revisiting Classification Perspective on Scene Text Recognition | ✓ Link | 93.2 | CSTR | 2021-02-22 |
TextScanner: Reading Characters in Order for Robust Scene Text Recognition | | 92.9 | TextScanner | 2019-12-28 |
SEED: Semantics Enhanced Encoder-Decoder Framework for Scene Text Recognition | ✓ Link | 92.8 | SEED | 2020-05-22 |
SAFL: A Self-Attention Scene Text Recognizer with Focal Loss | ✓ Link | 92.8 | SAFL | 2022-01-01 |
Vision Transformer for Fast and Efficient Scene Text Recognition | ✓ Link | 92.4 | ViTSTR | 2021-05-18 |
What Is Wrong With Scene Text Recognition Model Comparisons? Dataset and Model Analysis | ✓ Link | 92.3 | Baek et al. | 2019-04-03 |
ASTER: An Attentional Scene Text Recognizer with Flexible Rectification | ✓ Link | 91.8 | ASTER | 2018-06-25 |
Scene Text Recognition from Two-Dimensional Perspective | | 91.5 | CA-FCN | 2018-09-18 |
Show, Attend and Read: A Simple and Strong Baseline for Irregular Text Recognition | ✓ Link | 91.0 | SAR | 2018-11-02 |
Star-net: A spatial attention residue network for scene text recognition. | ✓ Link | 89.1 | STAR-Net | 2016-09-20 |
Robust Scene Text Recognition with Automatic Rectification | ✓ Link | 88.6 | RARE | 2016-03-12 |
An End-to-End Trainable Neural Network for Image-based Sequence Recognition and Its Application to Scene Text Recognition | ✓ Link | 86.7 | CRNN | 2015-07-21 |
Synthetic Data and Artificial Neural Networks for Natural Scene Text Recognition | ✓ Link | 79.5 | CHAR | 2014-06-09 |