OpenCodePapers

speech-separation-on-lrs2

Speech Separation

Results over time

Click legend items to toggle metrics. Hover points for model names.

Leaderboard

Paper	Code	SI-SNRi	SDRi	PESQ	STOI	ModelName	ReleaseDate
IIANet: An Intra- and Inter-Modality Attention Network for Audio-Visual Speech Separation	✓ Link	16.4	16.6			IIANet	2023-08-16
TDFNet: An Efficient Audio-Visual Speech Separation Model with Top-down Fusion	✓ Link	15.8	15.9	3.21	0.949	TDFNet-large	2024-01-25
TDFNet: An Efficient Audio-Visual Speech Separation Model with Top-down Fusion	✓ Link	15.0	15.2	3.16	0.938	TDFNet (MHSA + Shared)	2024-01-25
RTFS-Net: Recurrent Time-Frequency Modelling for Efficient Audio-Visual Speech Separation	✓ Link	14.9	15.1			RTFS-Net-12	2023-09-29
RTFS-Net: Recurrent Time-Frequency Modelling for Efficient Audio-Visual Speech Separation	✓ Link	14.6	14.8			RTFS-Net-6	2023-09-29
An Audio-Visual Speech Separation Model Inspired by Cortico-Thalamo-Cortical Circuits	✓ Link	14.3				CTCNet	2022-12-21
RTFS-Net: Recurrent Time-Frequency Modelling for Efficient Audio-Visual Speech Separation	✓ Link	14.1	14.3			RTFS-Net-4	2023-09-29
TDFNet: An Efficient Audio-Visual Speech Separation Model with Top-down Fusion	✓ Link	13.6	13.7	3.10	0.931	TDFNet-small	2024-01-25