Paper | Code | Word Error Rate (WER) | ModelName | ReleaseDate |
---|---|---|---|---|
Visual Speech Recognition for Multiple Languages in the Wild | ✓ Link | 1.2 | CTC/Attention | 2022-02-26 |
LCANet: End-to-End Lipreading with Cascaded Attention-CTC | 2.9 | LCANet | 2018-03-13 | |
Can We Read Speech Beyond the Lips? Rethinking RoI Selection for Deep Visual Speech Recognition | ✓ Link | 2.9 | LipNet (with Face Cutout) | 2020-03-06 |
Lip Reading Sentences in the Wild | 3 | WAS | 2016-11-16 | |
LipNet: End-to-End Sentence-level Lipreading | ✓ Link | 4.6 | LipNet | 2016-11-05 |