Paper | Code | 1:1 Accuracy | ModelName | ReleaseDate |
---|---|---|---|---|
CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained Vision-Language Model | ✓ Link | 90.9 | CLIP4STR-H (DFN-5B) | 2023-05-23 |
CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained Vision-Language Model | ✓ Link | 90.6 | CLIP4STR-L (DataComp-1B) | 2023-05-23 |
CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained Vision-Language Model | ✓ Link | 88.8 | CLIP4STR-L | 2023-05-23 |
CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained Vision-Language Model | ✓ Link | 87.0 | CLIP4STR-B | 2023-05-23 |
Self-supervised Character-to-Character Distillation for Text Recognition | ✓ Link | 86.0 | CCD-ViT-Base | 2022-11-01 |