OpenCodePapers

scene-text-recognition-on-svt

Scene Text Recognition
Dataset Link
Results over time
Click legend items to toggle metrics. Hover points for model names.
Leaderboard
PaperCodeAccuracyModelNameReleaseDate
CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained Vision-Language Model✓ Link99.1CLIP4STR-H (DFN-5B)2023-05-23
DTrOCR: Decoder-only Transformer for Optical Character Recognition✓ Link98.9DTrOCR 105M2023-08-30
An Empirical Study of Scaling Law for OCR✓ Link98.76CLIP4STR-B*2023-12-29
Multi-Granularity Prediction for Scene Text Recognition✓ Link98.6MGP-STR2022-09-08
CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained Vision-Language Model✓ Link98.6CLIP4STR-L (DataComp-1B)2023-05-23
Context Perception Parallel Decoder for Scene Text Recognition✓ Link98.5CPPD2023-07-23
CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained Vision-Language Model✓ Link98.5CLIP4STR-L2023-05-23
CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained Vision-Language Model✓ Link98.3CLIP4STR-B2023-05-23
Scene Text Recognition with Permuted Autoregressive Sequence Models✓ Link97.9±0.2PARSeq2022-07-14
Self-supervised Character-to-Character Distillation for Text Recognition✓ Link97.8CCD-ViT-Base(ARD_2.8M)2022-11-01
Self-supervised Character-to-Character Distillation for Text Recognition✓ Link96.4CCD-ViT-Small(ARD_2.8M)2022-11-01
Self-supervised Character-to-Character Distillation for Text Recognition✓ Link96.0CCD-ViT-Tiny(ARD_2.8M)2022-11-01
Visual Semantics Allow for Textual Reasoning Better in Scene Text Recognition✓ Link95.8S-GTR2021-12-24
Self-supervised Implicit Glyph Attention for Text Recognition✓ Link95.1SIGA_T2022-03-07
Multi-modal Text Recognition Networks: Interactive Enhancements between Visual and Semantic Features✓ Link95MATRN2021-11-30
Why You Should Try the Real Data for the Scene Text Recognition✓ Link94.7Yet Another Text Recognizer2021-07-29
TPS++: Attention-Enhanced Thin-Plate Spline for Scene Text Recognition✓ Link94.6NRTR+TPS++2023-05-09
Look Back Again: Dual Parallel Attention Network for Accurate and Robust Scene Text Recognition✓ Link93.9DPAN2021-08-01
CDistNet: Perceiving Multi-Domain Character Distance for Robust Text Recognition✓ Link93.82CDistNet (Ours)2021-11-22
DiffusionSTR: Diffusion Model for Scene Text Recognition93.6DiffusionSTR2023-06-29
Representation and Correlation Enhanced Encoder-Decoder Framework for Scene Text Recognition✓ Link91.8RCEED2021-06-13
Towards Accurate Scene Text Recognition with Semantic Reasoning Networks✓ Link91.5SRN2020-03-27
On Recognizing Texts of Arbitrary Shapes with 2D Self-Attention✓ Link91.3SATRN2019-10-10
Revisiting Classification Perspective on Scene Text Recognition✓ Link90.6CSTR2021-02-22
TextScanner: Reading Characters in Order for Robust Scene Text Recognition90.1TextScanner2019-12-28
SEED: Semantics Enhanced Encoder-Decoder Framework for Scene Text Recognition✓ Link89.6SEED2020-05-22
ASTER: An Attentional Scene Text Recognizer with Flexible Rectification✓ Link89.5ASTER2018-06-25
Decoupled Attention Network for Text Recognition✓ Link89.2DAN2019-12-21
SAFL: A Self-Attention Scene Text Recognizer with Focal Loss✓ Link88.6SAFL2022-01-01
Vision Transformer for Fast and Efficient Scene Text Recognition✓ Link87.7ViTSTR2021-05-18
What Is Wrong With Scene Text Recognition Model Comparisons? Dataset and Model Analysis✓ Link87.5Baek et al.2019-04-03
Scene Text Recognition from Two-Dimensional Perspective86.4CA-FCN2018-09-18
Show, Attend and Read: A Simple and Strong Baseline for Irregular Text Recognition✓ Link84.5SAR2018-11-02
Star-net: A spatial attention residue network for scene text recognition.✓ Link83.6STAR-Net2016-09-20
Robust Scene Text Recognition with Automatic Rectification✓ Link81.9RARE2016-03-12
An End-to-End Trainable Neural Network for Image-based Sequence Recognition and Its Application to Scene Text Recognition✓ Link80.8CRNN2015-07-21
Synthetic Data and Artificial Neural Networks for Natural Scene Text Recognition✓ Link68.0CHAR2014-06-09