OpenCodePapers
speech-separation-on-lrs2
Speech Separation
Dataset Link
Results over time
Click legend items to toggle metrics. Hover points for model names.
Leaderboard
Show papers without code
Paper
Code
SI-SNRi
↕
SDRi
↕
PESQ
↕
STOI
↕
ModelName
ReleaseDate
↕
IIANet: An Intra- and Inter-Modality Attention Network for Audio-Visual Speech Separation
✓ Link
16.4
16.6
IIANet
2023-08-16
TDFNet: An Efficient Audio-Visual Speech Separation Model with Top-down Fusion
✓ Link
15.8
15.9
3.21
0.949
TDFNet-large
2024-01-25
TDFNet: An Efficient Audio-Visual Speech Separation Model with Top-down Fusion
✓ Link
15.0
15.2
3.16
0.938
TDFNet (MHSA + Shared)
2024-01-25
RTFS-Net: Recurrent Time-Frequency Modelling for Efficient Audio-Visual Speech Separation
✓ Link
14.9
15.1
RTFS-Net-12
2023-09-29
RTFS-Net: Recurrent Time-Frequency Modelling for Efficient Audio-Visual Speech Separation
✓ Link
14.6
14.8
RTFS-Net-6
2023-09-29
An Audio-Visual Speech Separation Model Inspired by Cortico-Thalamo-Cortical Circuits
✓ Link
14.3
CTCNet
2022-12-21
RTFS-Net: Recurrent Time-Frequency Modelling for Efficient Audio-Visual Speech Separation
✓ Link
14.1
14.3
RTFS-Net-4
2023-09-29
TDFNet: An Efficient Audio-Visual Speech Separation Model with Top-down Fusion
✓ Link
13.6
13.7
3.10
0.931
TDFNet-small
2024-01-25