OpenCodePapers

scene-text-recognition-on-svtp

Scene Text Recognition
Results over time
Click legend items to toggle metrics. Hover points for model names.
Leaderboard
PaperCodeAccuracyModelNameReleaseDate
DTrOCR: Decoder-only Transformer for Optical Character Recognition✓ Link98.6DTrOCR 105M2023-08-30
Multi-Granularity Prediction for Scene Text Recognition✓ Link98.3MGP-STR2022-09-08
An Empirical Study of Scaling Law for OCR✓ Link98.13CLIP4STR-L*2023-12-29
CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained Vision-Language Model✓ Link98.1CLIP4STR-L (DataComp-1B)2023-05-23
CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained Vision-Language Model✓ Link97.4CLIP4STR-L2023-05-23
CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained Vision-Language Model✓ Link97.2CLIP4STR-B2023-05-23
Context Perception Parallel Decoder for Scene Text Recognition✓ Link96.7CPPD2023-07-23
Self-supervised Character-to-Character Distillation for Text Recognition✓ Link96.1CCD-ViT-Base2022-11-01
Scene Text Recognition with Permuted Autoregressive Sequence Models✓ Link95.7±0.9PARSeq2022-07-14
Self-supervised Character-to-Character Distillation for Text Recognition✓ Link92.7CCD-ViT-Small2022-11-01
Self-supervised Character-to-Character Distillation for Text Recognition✓ Link91.6CCD-ViT-Tiny2022-11-01
Visual Semantics Allow for Textual Reasoning Better in Scene Text Recognition✓ Link90.6S-GTR2021-12-24
Multi-modal Text Recognition Networks: Interactive Enhancements between Visual and Semantic Features✓ Link90.6MATRN2021-11-30
Self-supervised Implicit Glyph Attention for Text Recognition✓ Link90.5SIGA_T2022-03-07
CDistNet: Perceiving Multi-Domain Character Distance for Robust Text Recognition✓ Link89.77CDistNet (Ours)2021-11-22
DiffusionSTR: Diffusion Model for Scene Text Recognition89.2DiffusionSTR2023-06-29
Look Back Again: Dual Parallel Attention Network for Accurate and Robust Scene Text Recognition✓ Link89.0DPAN2021-08-01