Paper | Code | Top 1 Accuracy | ModelName | ReleaseDate |
---|---|---|---|---|
SyncVSR: Data-Efficient Visual Speech Recognition with End-to-End Crossmodal Audio Token Synchronization | ✓ Link | 80.3 | SyncVSR (Word Boundary) | 2024-06-18 |
SyncVSR: Data-Efficient Visual Speech Recognition with End-to-End Crossmodal Audio Token Synchronization | ✓ Link | 75.1 | SyncVSR | 2024-06-18 |
Another Point of View on Visual Speech Recognition | 62.7 | Another Point of View | 2023-08-20 | |
Adaptive Semantic-Spatio-Temporal Graph Convolutional Network for Lip Reading | 60.7 | Adaptive GCN | 2021-08-16 | |
Lip Graph Assisted Audio-Visual Speech Recognition Using Bidirectional Synchronous Fusion | 49.3 | Lip Graph Assisted | 2020-10-25 |