Paper | Code | Accuracy (%) | ModelName | ReleaseDate |
---|---|---|---|---|
Efficient Sequence Transduction by Jointly Predicting Tokens and Durations | ✓ Link | 90.07 | TDT 0-8 | 2023-04-13 |
A Fine-tuned Wav2vec 2.0/HuBERT Benchmark For Speech Emotion Recognition, Speaker Verification and Spoken Language Understanding | 87.51 | Partially Fine-tuned HuBERT | 2021-11-04 | |
SLURP: A Spoken Language Understanding Resource Package | ✓ Link | 78.33 | Multi-SLURP | 2020-11-26 |
Finstreder: Simple and fast Spoken Language Understanding with Finite State Transducers using modern Speech-to-Text models | ✓ Link | 53.11 | Finstreder (Conformer) | 2022-06-29 |
Finstreder: Simple and fast Spoken Language Understanding with Finite State Transducers using modern Speech-to-Text models | ✓ Link | 43.15 | Finstreder (Quartznet) | 2022-06-29 |