| Finstreder: Simple and fast Spoken Language Understanding with Finite State Transducers using modern Speech-to-Text models | ✓ Link | 99.8 | Finstreder (Conformer + AMT, character-based) | 2022-06-29 |
| UniverSLU: Universal Spoken Language Understanding for Diverse Tasks with Natural Language Instructions | | 99.8 | UniverSLU | 2023-10-04 |
| Finstreder: Simple and fast Spoken Language Understanding with Finite State Transducers using modern Speech-to-Text models | ✓ Link | 99.7 | Finstreder (Quartznet + AMT) | 2022-06-29 |
| Slungt: Even Faster Spoken Language Understanding with N-Grams and Tries | ✓ Link | 99.7 | Slungt (Conformer + AMT, character-based) | 2024-02-11 |
| Two-stage Textual Knowledge Distillation for End-to-End Spoken Language Understanding | ✓ Link | 99.7 | textual-kd-slu | 2020-10-25 |
| Integration of Pre-trained Networks with Continuous Token Interface for End-to-End Spoken Language Understanding | | 99.7 | Wav2Vec2.0-Classifier | 2021-04-15 |
| Speech-language Pre-training for End-to-end Spoken Language Understanding | | 99.7 | E2E SLP two-step | 2021-02-11 |
| Do We Still Need Automatic Speech Recognition for Spoken Language Understanding? | | 99.6 | Wav2vec 2.0 SSL | 2021-11-29 |
| Finstreder: Simple and fast Spoken Language Understanding with Finite State Transducers using modern Speech-to-Text models | ✓ Link | 99.5 | Finstreder (Conformer) | 2022-06-29 |
| Exploring Transfer Learning For End-to-End Spoken Language Understanding | | 99.5 | AT-AT | 2020-12-15 |
| Finstreder: Simple and fast Spoken Language Understanding with Finite State Transducers using modern Speech-to-Text models | ✓ Link | 99.4 | Finstreder (Conformer, character-based) | 2022-06-29 |
| End-to-End Spoken Language Understanding for Generalized Voice Assistants | | 99.4 | BERT, AC Pretraining | 2021-06-16 |
| Sequential End-to-End Intent and Slot Label Classification and Localization | | 99.3 | 3D-CNN+LSTM+CE | 2021-06-08 |
| Finstreder: Simple and fast Spoken Language Understanding with Finite State Transducers using modern Speech-to-Text models | ✓ Link | 99.2 | Finstreder (Quartznet) | 2022-06-29 |
| Slungt: Even Faster Spoken Language Understanding with N-Grams and Tries | ✓ Link | 99.2 | Slungt (Conformer, character-based) | 2024-02-11 |
| Cross-Modal Alignment for End-to-End Spoken Language Understanding Based on Momentum Contrastive Learning | | 99.2 | CMMC | 2024-04-14 |
| Improving End-to-End Speech-to-Intent Classification with Reptile | | 99.2 | Reptile | 2020-08-05 |
| FANS: Fusing ASR and NLU for on-device SLU | | 99.0 | FANS | 2021-10-31 |
| Sequential End-to-End Intent and Slot Label Classification and Localization | | 99.0 | CTC + Pretrained ASR | 2021-06-08 |
| Speech Understanding on Tiny Devices with A Learning Cache | | 99.0 | Base | 2024-06-04 |
| Speech Model Pre-training for End-to-End Spoken Language Understanding | ✓ Link | 98.8 | Pooling classifier pre-trained using force-aligned phoneme and word labels on LibriSpeech | 2019-04-07 |
| Finstreder: Simple and fast Spoken Language Understanding with Finite State Transducers using modern Speech-to-Text models | ✓ Link | 98.7 | Amazon Alexa | 2022-06-29 |
| Slungt: Even Faster Spoken Language Understanding with N-Grams and Tries | ✓ Link | 98.3 | Slungt (Conformer) | 2024-02-11 |
| SpeechPrompt v2: Prompt Tuning for Speech Classification Tasks | | 98.2 | pGSLM+ | 2023-03-01 |