Paper | Code | F1 | Accuracy | ModelName | ReleaseDate |
---|---|---|---|---|---|
LayoutLMv2: Multi-modal Pre-training for Visually-Rich Document Understanding | ✓ Link | 97.81 | LayoutLMv2LARGE (Excluding OCR mismatch) | 2020-12-29 | |
Modeling Layout Reading Order as Ordering Relations for Visually-rich Document Understanding | ✓ Link | 96.97 | RORE (GeoLayoutLM) | 2024-09-29 | |
LayoutLMv2: Multi-modal Pre-training for Visually-Rich Document Understanding | ✓ Link | 96.61 | LayoutLMv2LARGE | 2020-12-29 | |
LayoutLMv2: Multi-modal Pre-training for Visually-Rich Document Understanding | ✓ Link | 96.25 | LayoutLMv2BASE | 2020-12-29 | |
LAPDoc: Layout-Aware Prompting for Documents | 77.0 | ChatGPT 3.5 SpatialFormat | 2024-02-15 |