OpenCodePapers

speaker-identification-on-voxceleb1

Speaker Identification
Dataset Link
Results over time
Click legend items to toggle metrics. Hover points for model names.
Leaderboard
PaperCodeTop-1 (%)Top-5 (%)Number of ParamsAccuracyModelNameReleaseDate
Masked Modeling Duo: Towards a Universal Audio Pre-training Framework✓ Link96.696.6MSM-MAE2024-04-09
Masked Modeling Duo: Towards a Universal Audio Pre-training Framework✓ Link96.596.5M2D/0.62024-04-09
Masked Modeling Duo: Towards a Universal Audio Pre-training Framework✓ Link96.396.3M2D/0.72024-04-09
Masked Autoencoders that Listen✓ Link94.894.8AudioMAE (local)2022-07-13
Masked Modeling Duo: Learning Representations by Encouraging Both Networks to Model the Input✓ Link94.894.8M2D ratio=0.62022-10-26
ATST: Audio Representation Learning with Teacher-Student Transformer✓ Link94.394.3ATST Base (ours)2022-04-26
Masked Autoencoders that Listen✓ Link94.194.1AudioMAE (global)2022-07-13
AutoSpeech: Neural Architecture Search for Speaker Recognition✓ Link87.6696.0118M87.66AutoSpeech (N=8,C=128)2020-05-07
SSAST: Self-Supervised Audio Spectrogram Transformer✓ Link80.880.8SSAST-FRAME2021-10-19
SSAMBA: Self-Supervised Audio Representation Learning with Mamba State Space Model✓ Link70.199M70.1SSAMBA2024-05-20
SSAST: Self-Supervised Audio Spectrogram Transformer✓ Link64.264.2SSAST-PATCH2021-10-19
Contrastive Learning of General-Purpose Audio Representations✓ Link37.737.7COLA2020-10-21