OpenCodePapers

speaker-identification-on-voxceleb1

Speaker Identification

Results over time

Click legend items to toggle metrics. Hover points for model names.

Leaderboard

Paper	Code	Top-1 (%)	Top-5 (%)	Number of Params	Accuracy	ModelName	ReleaseDate
Masked Modeling Duo: Towards a Universal Audio Pre-training Framework	✓ Link	96.6			96.6	MSM-MAE	2024-04-09
Masked Modeling Duo: Towards a Universal Audio Pre-training Framework	✓ Link	96.5			96.5	M2D/0.6	2024-04-09
Masked Modeling Duo: Towards a Universal Audio Pre-training Framework	✓ Link	96.3			96.3	M2D/0.7	2024-04-09
Masked Autoencoders that Listen	✓ Link	94.8			94.8	AudioMAE (local)	2022-07-13
Masked Modeling Duo: Learning Representations by Encouraging Both Networks to Model the Input	✓ Link	94.8			94.8	M2D ratio=0.6	2022-10-26
ATST: Audio Representation Learning with Teacher-Student Transformer	✓ Link	94.3			94.3	ATST Base (ours)	2022-04-26
Masked Autoencoders that Listen	✓ Link	94.1			94.1	AudioMAE (global)	2022-07-13
AutoSpeech: Neural Architecture Search for Speaker Recognition	✓ Link	87.66	96.01	18M	87.66	AutoSpeech (N=8,C=128)	2020-05-07
SSAST: Self-Supervised Audio Spectrogram Transformer	✓ Link	80.8			80.8	SSAST-FRAME	2021-10-19
SSAMBA: Self-Supervised Audio Representation Learning with Mamba State Space Model	✓ Link	70.1		99M	70.1	SSAMBA	2024-05-20
SSAST: Self-Supervised Audio Spectrogram Transformer	✓ Link	64.2			64.2	SSAST-PATCH	2021-10-19
Contrastive Learning of General-Purpose Audio Representations	✓ Link	37.7			37.7	COLA	2020-10-21