OpenCodePapers

speech-recognition-on-common-voice-german

Speech Recognition
Dataset Link
Results over time
Click legend items to toggle metrics. Hover points for model names.
Leaderboard
PaperCodeTest WERTest CERModelNameReleaseDate
TEVR: Improving Speech Recognition by Token Entropy Variance Reduction✓ Link3.64%1.54%wav2vec 2.0 XLS-R 1B + TEVR (5-gram)2022-06-25
TEVR: Improving Speech Recognition by Token Entropy Variance Reduction✓ Link3.70%wav2vec 2.0 XLS-R 1B + TEVR (4-gram)2022-06-25
Scribosermo: Fast Speech-to-Text models for German and other Languages✓ Link4.05%1.37%ConformerCTC-L (5-gram)2021-10-15
NeMo: a toolkit for building AI applications using Neural Modules✓ Link4.09%canary-1b-flash2025-03-07
TEVR: Improving Speech Recognition by Token Entropy Variance Reduction✓ Link4.38%1.62%wav2vec 2.0 XLS-R 1B (5-gram)2022-06-25
NeMo: a toolkit for building AI applications using Neural Modules✓ Link6.03%ConformerCTC-L (4-gram)2019-09-14
Automatic Speech Recognition in German: A Detailed Error Analysis6.28%Conformer Transducer (no LM)2022-08-03
Robust Speech Recognition via Large-Scale Weak Supervision✓ Link6.4%Whisper (Large v2)2022-12-06
Scribosermo: Fast Speech-to-Text models for German and other Languages✓ Link6.6%2.7%QuartzNet15x5DE (D37, 5-gram)2021-10-15
NeMo: a toolkit for building AI applications using Neural Modules✓ Link6.68%ConformerCTC-L (no LM)2019-09-14
Scribosermo: Fast Speech-to-Text models for German and other Languages✓ Link7.33%2.05%ConformerCTC-L (no LM)2021-10-15
Scribosermo: Fast Speech-to-Text models for German and other Languages✓ Link7.7%3.2%QuartzNet15x5DE (CV-only, 5-gram)2021-10-15
VoxPopuli: A Large-Scale Multilingual Speech Corpus for Representation Learning, Semi-Supervised Learning and Interpretation✓ Link7.8%VoxPopuli (n-gram)2021-01-02
TEVR: Improving Speech Recognition by Token Entropy Variance Reduction✓ Link10.10%wav2vec 2.0 XLS-R 1B + TEVR (no LM)2022-06-25
TEVR: Improving Speech Recognition by Token Entropy Variance Reduction✓ Link12.06%wav2vec 2.0 XLS-R (no LM)2022-06-25