Paper | Code | Top-1 Accuracy | ModelName | ReleaseDate |
---|---|---|---|---|
Audio-Visual Speech Recognition based on Regulated Transformer and Spatio-Temporal Fusion Strategy for Driver Assistive Systems | ✓ Link | 98.81 | AVCRFormer | 2024-05-09 |
Audio-Visual Speech and Gesture Recognition by Sensors of Mobile Devices | 98.76 | 2DCNN + BiLSTM + ResNet + MLF | 2023-02-17 | |
Part-based Lipreading for Audio-Visual Speech Recognition | 98.3 | PBL | 2020-12-14 |