OpenCodePapers

keyword-spotting-on-google-speech-commands

Keyword Spotting
Dataset Link
Results over time
Click legend items to toggle metrics. Hover points for model names.
Leaderboard
PaperCodeGoogle Speech Commands V1 12Google Speech Commands V2 12Google Speech Commands V2 2Google Speech Commands V2 20Google Speech Commands V2 35Google Speech Commands V1 2Google Speech Commands V1 20Google Speech Commands V1 35Google Speech Commands V1 610-keyword Speech Commands datasetGoogle Speech Command-Musan% Test AccuracyGoogle Speech CommandsModelNameReleaseDate
Learning Efficient Representations for Keyword Spotting with Triplet Loss✓ Link98.5698.3797.0TripletLoss-res152021-01-12
Broadcasted Residual Learning for Efficient Keyword Spotting✓ Link98.098.7BC-ResNet-82021-06-08
Wav2KWS: Transfer Learning from Speech Representations for Keyword Spotting✓ Link97.998.597.8Wav2KWS2021-05-10
Howl: A Deployed, Open-Source Wake Word Detection System✓ Link97.8res 82020-08-21
Keyword Transformer: A Self-Attention Model for Keyword Spotting✓ Link97.49 ±0.1598.56 ±0.0797.69 ±0.09KWT-32021-04-01
MatchboxNet: 1D Time-Channel Separable Convolutional Neural Network Architecture for Speech Commands Recognition✓ Link97.4897.63MatchboxNet-3x2x642020-04-21
ConvMixer: Feature Interactive Convolution with Curriculum Learning for Small Footprint and Noisy Far-field Keyword Spotting✓ Link97.398.2ConvMixer2022-01-15
Keyword Transformer: A Self-Attention Model for Keyword Spotting✓ Link97.27 ±0.0898.43±0.0897.74 ±0.03KWT-22021-04-01
Keyword Transformer: A Self-Attention Model for Keyword Spotting✓ Link97.26±0.1898.08±0.1096.95±0.14KWT-12021-04-01
Streaming keyword spotting on mobile devices✓ Link97.298MHAtt-RNN2020-05-14
Neural Architecture Search For Keyword Spotting97.06NAS12020-09-01
SSAMBA: Self-Supervised Audio Representation Learning with Mamba State Space Model✓ Link96.997.4SSAMBA2024-05-20
A neural attention model for speech command recognition✓ Link95.696.999.494.593.999.294.194.3Attention RNN2018-08-27
Hello Edge: Keyword Spotting on Microcontrollers✓ Link94.4DS-CNN2017-11-20
Hello Edge: Keyword Spotting on Microcontrollers✓ Link93.5GRU2017-11-20
Hello Edge: Keyword Spotting on Microcontrollers✓ Link92.9LSTM2017-11-20
Hello Edge: Keyword Spotting on Microcontrollers✓ Link92.0Basic LSTM2017-11-20
Hello Edge: Keyword Spotting on Microcontrollers✓ Link91.6DNN2017-11-20
Hello Edge: Keyword Spotting on Microcontrollers✓ Link84.6CNN2017-11-20
Work in Progress: Linear Transformers for TinyML98.899.1WaveFormer2024-03-25
EdgeCRNN: an edgecomputing oriented model of acoustic feature enhancement for keyword spotting98.05EdgeCRNN 2.0×2021-03-14
Training Keyword Spotters with Limited and Synthesized Speech Data✓ Link97.7Embedding + Head2020-01-31
Training Keyword Spotters with Limited and Synthesized Speech Data✓ Link97.4Head without Embedding2020-01-31
Temporal Convolution for Real-time Keyword Spotting on Mobile Devices✓ Link96.6TC-ResNet14-1.52019-04-08
End-to-end Keyword Spotting using Neural Architecture Search and Quantization95.55End-to-end KWS model2021-04-14
MicroNets: Neural Network Architectures for Deploying TinyML Applications on Commodity Microcontrollers✓ Link95.3MicroNet-KWS-L2020-10-21
Effective Combination of DenseNet andBiLSTM for Keyword Spotting96.6DenseNet-BiLTSM2019-01-19
Multi-layer Attention Mechanism for Speech Keyword Recognition93.72LSTM2019-07-10
Towards on-Device Keyword Spotting using Low-Footprint Quaternion Neural Models✓ Link98.6098.53QNN2023-09-15
Masked Modeling Duo: Learning Representations by Encouraging Both Networks to Model the Input✓ Link98.5M2D2022-10-26
End-to-End Audio Strikes Back: Boosting Augmentations Towards An Efficient Audio Classification Network✓ Link98.15EAT-S2022-04-25
AST: Audio Spectrogram Transformer✓ Link98.11Audio Spectrogram Transformer2021-04-05
HTS-AT: A Hierarchical Token-Semantic Audio Transformer for Sound Classification and Detection✓ Link98.0HTS-AT2022-02-02
Attention-Free Keyword Spotting✓ Link97.56KW-MLP2021-10-14
ImportantAug: a data augmentation agent for speech✓ Link9586.7ImportantAug2021-12-14
Neural Architecture Search For Keyword Spotting97.22NAS22020-09-01
Decentralizing Feature Extraction with Quantum Convolutional Neural Network for Automatic Speech Recognition✓ Link95.12Quantum CNN2020-10-26
Efficient keyword spotting using time delay neural networks94.3TDNN2018-08-28
PATE-AAE: Incorporating Adversarial Autoencoder into Private Aggregation of Teacher Ensembles for Spoken Command Classification92.37PATE-AAE (Differentially-Private)2021-04-02
SubSpectral Normalization for Neural Audio Data Processing95.4% ±0.22res8 w/ SSN(S=4, A=Sub)2021-03-25
SubSpectral Normalization for Neural Audio Data Processing96.8% ±0.13res15 w/ SSN(S=4, A=Sub)2021-03-25
SubSpectral Normalization for Neural Audio Data Processing97.5% ±0.15res15 w/ SSN(S=4, A=Sub) (2019)2021-03-25