scene-text-recognition-on-svt

Scene Text Recognition

Results over time

Click legend items to toggle metrics. Hover points for model names.

Leaderboard

Paper	Code	Accuracy	ModelName	ReleaseDate
CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained Vision-Language Model	✓ Link	99.1	CLIP4STR-H (DFN-5B)	2023-05-23
DTrOCR: Decoder-only Transformer for Optical Character Recognition	✓ Link	98.9	DTrOCR 105M	2023-08-30
An Empirical Study of Scaling Law for OCR	✓ Link	98.76	CLIP4STR-B*	2023-12-29
Multi-Granularity Prediction for Scene Text Recognition	✓ Link	98.6	MGP-STR	2022-09-08
CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained Vision-Language Model	✓ Link	98.6	CLIP4STR-L (DataComp-1B)	2023-05-23
CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained Vision-Language Model	✓ Link	98.5	CLIP4STR-L	2023-05-23
Context Perception Parallel Decoder for Scene Text Recognition	✓ Link	98.5	CPPD	2023-07-23
CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained Vision-Language Model	✓ Link	98.3	CLIP4STR-B	2023-05-23
Scene Text Recognition with Permuted Autoregressive Sequence Models	✓ Link	97.9±0.2	PARSeq	2022-07-14
Self-supervised Character-to-Character Distillation for Text Recognition	✓ Link	97.8	CCD-ViT-Base(ARD_2.8M)	2022-11-01
Self-supervised Character-to-Character Distillation for Text Recognition	✓ Link	96.4	CCD-ViT-Small(ARD_2.8M)	2022-11-01
Self-supervised Character-to-Character Distillation for Text Recognition	✓ Link	96.0	CCD-ViT-Tiny(ARD_2.8M)	2022-11-01
Visual Semantics Allow for Textual Reasoning Better in Scene Text Recognition	✓ Link	95.8	S-GTR	2021-12-24
Self-supervised Implicit Glyph Attention for Text Recognition	✓ Link	95.1	SIGA_T	2022-03-07
Multi-modal Text Recognition Networks: Interactive Enhancements between Visual and Semantic Features	✓ Link	95	MATRN	2021-11-30
Why You Should Try the Real Data for the Scene Text Recognition	✓ Link	94.7	Yet Another Text Recognizer	2021-07-29
TPS++: Attention-Enhanced Thin-Plate Spline for Scene Text Recognition	✓ Link	94.6	NRTR+TPS++	2023-05-09
Look Back Again: Dual Parallel Attention Network for Accurate and Robust Scene Text Recognition	✓ Link	93.9	DPAN	2021-08-01
CDistNet: Perceiving Multi-Domain Character Distance for Robust Text Recognition	✓ Link	93.82	CDistNet (Ours)	2021-11-22
DiffusionSTR: Diffusion Model for Scene Text Recognition		93.6	DiffusionSTR	2023-06-29
Representation and Correlation Enhanced Encoder-Decoder Framework for Scene Text Recognition	✓ Link	91.8	RCEED	2021-06-13
Towards Accurate Scene Text Recognition with Semantic Reasoning Networks	✓ Link	91.5	SRN	2020-03-27
On Recognizing Texts of Arbitrary Shapes with 2D Self-Attention	✓ Link	91.3	SATRN	2019-10-10
Revisiting Classification Perspective on Scene Text Recognition	✓ Link	90.6	CSTR	2021-02-22
TextScanner: Reading Characters in Order for Robust Scene Text Recognition		90.1	TextScanner	2019-12-28
SEED: Semantics Enhanced Encoder-Decoder Framework for Scene Text Recognition	✓ Link	89.6	SEED	2020-05-22
ASTER: An Attentional Scene Text Recognizer with Flexible Rectification	✓ Link	89.5	ASTER	2018-06-25
Decoupled Attention Network for Text Recognition	✓ Link	89.2	DAN	2019-12-21
SAFL: A Self-Attention Scene Text Recognizer with Focal Loss	✓ Link	88.6	SAFL	2022-01-01
Vision Transformer for Fast and Efficient Scene Text Recognition	✓ Link	87.7	ViTSTR	2021-05-18
What Is Wrong With Scene Text Recognition Model Comparisons? Dataset and Model Analysis	✓ Link	87.5	Baek et al.	2019-04-03
Scene Text Recognition from Two-Dimensional Perspective		86.4	CA-FCN	2018-09-18
Show, Attend and Read: A Simple and Strong Baseline for Irregular Text Recognition	✓ Link	84.5	SAR	2018-11-02
Star-net: A spatial attention residue network for scene text recognition.	✓ Link	83.6	STAR-Net	2016-09-20
Robust Scene Text Recognition with Automatic Rectification	✓ Link	81.9	RARE	2016-03-12
An End-to-End Trainable Neural Network for Image-based Sequence Recognition and Its Application to Scene Text Recognition	✓ Link	80.8	CRNN	2015-07-21
Synthetic Data and Artificial Neural Networks for Natural Scene Text Recognition	✓ Link	68.0	CHAR	2014-06-09

OpenCodePapers

scene-text-recognition-on-svt