OpenCodePapers
speech-recognition-on-aishell-1
Speech Recognition
Dataset Link
Results over time
Click legend items to toggle metrics. Hover points for model names.
Leaderboard
Show papers without code
Paper
Code
Word Error Rate (WER)
↕
Params(M)
↕
ModelName
ReleaseDate
↕
FireRedASR: Open-Source Industrial-Grade Mandarin Speech Recognition Models from Encoder-Decoder to LLM Integration
✓ Link
0.55
1,100
FireRedASR-AED
2025-01-24
Seed-ASR: Understanding Diverse Speech and Contexts with LLM-based Speech Recognition
0.68
Seed-ASR
2024-07-05
Qwen-Audio: Advancing Universal Audio Understanding via Unified Large-Scale Audio-Language Models
✓ Link
1.29
Qwen-Audio
2023-11-14
MMSpeech: Multi-modal Multi-task Encoder-Decoder Pre-training for Speech Recognition
✓ Link
1.9
MMSpeech With LM
2022-11-29
FunASR: A Fundamental End-to-End Speech Recognition Toolkit
✓ Link
1.95
220
Paraformer-large
2023-05-18
CR-CTC: Consistency regularization on CTC for improved speech recognition
✓ Link
4.02
66.2
Zipformer+CR-CTC (no external language model)
2024-10-07
Lightweight Transducer Based on Frame-Level Criterion
✓ Link
4.03
45.3
Lightweight Transducer With LM
2024-09-05
Improving Mandarin Speech Recogntion with Block-augmented Transformer
✓ Link
4.1
46
SE-WSBO With LM
2022-07-24
Knowledge Transfer from Pre-trained Language Models to Cif-based Speech Recognizers via Hierarchical Distillation
✓ Link
4.1
47
CIF-HKD With LM
2023-01-30
Lightweight Transducer Based on Frame-Level Criterion
✓ Link
4.31
45.3
Lightweight Transducer
2024-09-05
Unimodal Aggregation for CTC-based Speech Recognition
✓ Link
4.7
44.7
UMA
2023-09-15
Unified Streaming and Non-streaming Two-pass End-to-end Model for Speech Recognition
✓ Link
4.72
47
U2
2020-12-10
FunASR: A Fundamental End-to-End Speech Recognition Toolkit
✓ Link
4.95
46.3
Paraformer
2023-05-18
BAT: Boundary aware transducer for memory-efficient and low-latency ASR
✓ Link
4.97
90
BAT
2023-05-19
CAT: A CTC-CRF based ASR Toolkit Bridging the Hybrid and the End-to-end Approaches towards Data Efficiency and Low Latency
✓ Link
6.34
CTC-CRF 4gram-LM
2020-05-27
Beyond Universal Transformer: block reusing with adaptor in Transformer for automatic speech recognition
6.63
8.5
BRA-E
2023-03-23
A Comparative Study on Transformer vs RNN in Speech Applications
✓ Link
6.7
CTC/Att
2019-09-13
End-to-end Speech Recognition with Adaptive Computation Steps
18.7
Att
2018-08-30