OpenCodePapers

speech-separation-on-lrs2

Speech Separation
Dataset Link
Results over time
Click legend items to toggle metrics. Hover points for model names.
Leaderboard
PaperCodeSI-SNRiSDRiPESQSTOIModelNameReleaseDate
IIANet: An Intra- and Inter-Modality Attention Network for Audio-Visual Speech Separation✓ Link16.416.6IIANet2023-08-16
TDFNet: An Efficient Audio-Visual Speech Separation Model with Top-down Fusion✓ Link15.815.93.210.949TDFNet-large2024-01-25
TDFNet: An Efficient Audio-Visual Speech Separation Model with Top-down Fusion✓ Link15.015.23.160.938TDFNet (MHSA + Shared)2024-01-25
RTFS-Net: Recurrent Time-Frequency Modelling for Efficient Audio-Visual Speech Separation✓ Link14.915.1RTFS-Net-122023-09-29
RTFS-Net: Recurrent Time-Frequency Modelling for Efficient Audio-Visual Speech Separation✓ Link14.614.8RTFS-Net-62023-09-29
An Audio-Visual Speech Separation Model Inspired by Cortico-Thalamo-Cortical Circuits✓ Link14.3CTCNet2022-12-21
RTFS-Net: Recurrent Time-Frequency Modelling for Efficient Audio-Visual Speech Separation✓ Link14.114.3RTFS-Net-42023-09-29
TDFNet: An Efficient Audio-Visual Speech Separation Model with Top-down Fusion✓ Link13.613.73.100.931TDFNet-small2024-01-25