OpenCodePapers

speech-recognition-on-swb_hub_500-wer

Speech Recognition
Results over time
Click legend items to toggle metrics. Hover points for model names.
Leaderboard
PaperCodePercentage errorModelNameReleaseDate
On the limit of English conversational speech recognition6.8IBM (LSTM+Conformer encoder-decoder)2021-05-03
Single headed attention based sequence-to-sequence model for state-of-the-art results on Switchboard7.8IBM (LSTM encoder-decoder)2020-01-20
English Conversational Telephone Speech Recognition by Humans and Machines10.3ResNet + BiLSTMs acoustic model2017-03-06
The Microsoft 2016 Conversational Speech Recognition System11.9VGG/Resnet/LACE/BiLSTM acoustic model trained on SWB+Fisher+CH, N-gram + RNNLM language model trained on Switchboard+Fisher+Gigaword+Broadcast2016-09-12
The IBM 2016 English Conversational Telephone Speech Recognition System12.2RNN + VGG + LSTM acoustic model trained on SWB+Fisher+CH, N-gram + "model M" + NNLM language model2016-04-27
[]()13HMM-BLSTM trained with MMI + data augmentation (speed) + iVectors + 3 regularizations + Fisher
[]()13.3HMM-TDNN trained with MMI + data augmentation (speed) + iVectors + 3 regularizations + Fisher (10% / 15.1% respectively trained on SWBD only)
Deep Speech: Scaling up end-to-end speech recognition✓ Link16CNN + Bi-RNN + CTC (speech to letters), 25.9% WER if trainedonlyon SWB2014-12-17
[]()17.1HMM-TDNN + iVectors
[]()18.4HMM-DNN +sMBR
Building DNN Acoustic Models for Large Vocabulary Speech Recognition✓ Link19.1DNN + Dropout2014-06-30
[]()19.3HMM-TDNN + pNorm + speed up/down speech