OpenCodePapers

scene-text-recognition-on-iiit5k

Scene Text Recognition
Dataset Link
Results over time
Click legend items to toggle metrics. Hover points for model names.
Leaderboard
PaperCodeAccuracyModelNameReleaseDate
CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained Vision-Language Model✓ Link99.6CLIP4STR-L (DataComp-1B)2023-05-23
DTrOCR: Decoder-only Transformer for Optical Character Recognition✓ Link99.6DTrOCR 105M2023-08-30
CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained Vision-Language Model✓ Link99.5CLIP4STR-L2023-05-23
CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained Vision-Language Model✓ Link99.5CLIP4STR-B (DataComp-1B)2023-05-23
Context Perception Parallel Decoder for Scene Text Recognition✓ Link99.3CPPD2023-07-23
CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained Vision-Language Model✓ Link99.2CLIP4STR-B2023-05-23
Scene Text Recognition with Permuted Autoregressive Sequence Models✓ Link99.1±0.1PARSeq2022-07-14
Multi-Granularity Prediction for Scene Text Recognition✓ Link98.8MGP-STR2022-09-08
Self-supervised Character-to-Character Distillation for Text Recognition✓ Link98.0CCD-ViT-Small(ARD_2.8M)2022-11-01
Self-supervised Character-to-Character Distillation for Text Recognition✓ Link98.0CCD-ViT-Base(ARD_2.8M)2022-11-01
Visual Semantics Allow for Textual Reasoning Better in Scene Text Recognition✓ Link97.5S-GTR2021-12-24
DiffusionSTR: Diffusion Model for Scene Text Recognition97.3DiffusionSTR2023-06-29
Self-supervised Character-to-Character Distillation for Text Recognition✓ Link97.1CCD-ViT-Tiny(ARD_2.8M)2022-11-01
Self-supervised Implicit Glyph Attention for Text Recognition✓ Link96.9SIGA_S2022-03-07
Multi-modal Text Recognition Networks: Interactive Enhancements between Visual and Semantic Features✓ Link96.6MATRN2021-11-30
CDistNet: Perceiving Multi-Domain Character Distance for Robust Text Recognition✓ Link96.57CDistNet (Ours)2021-11-22
Look Back Again: Dual Parallel Attention Network for Accurate and Robust Scene Text Recognition✓ Link96.2DPAN2021-08-01