OpenCodePapers

automatic-lyrics-transcription-on-jam-alt

Speech RecognitionAutomatic Lyrics Transcription
Dataset Link
Results over time
Click legend items to toggle metrics. Hover points for model names.
Leaderboard
PaperCodeWord Error Rate (WER)Case-Sensitive Word Error RateCase Error RatePunctuation F1Parenthesis F-1Line break F1Section break F1ModelNameReleaseDate
Lyrics Transcription for Humans: A Readability-Aware Benchmark✓ Link16.1 20.157.0 29.4 84.473.9AudioShake v32024-07-30
Jam-ALT: A Formatting-Aware Lyrics Transcription Benchmark✓ Link26.0 3.450.529.482.372.1AudioShake v12023-11-23
Lyrics Transcription for Humans: A Readability-Aware Benchmark✓ Link27.932.645.070.43.7Whisper v2 +lang2024-07-30
Lyrics Transcription for Humans: A Readability-Aware Benchmark✓ Link32.637.243.773.90.6Whisper v3 +lang2024-07-30
Lyrics Transcription for Humans: A Readability-Aware Benchmark✓ Link33.539.339.460.6Whisper v2 +demucs +lang2024-07-30
Lyrics Transcription for Humans: A Readability-Aware Benchmark✓ Link35.539.743.073.51.0Whisper v32024-07-30
Jam-ALT: A Formatting-Aware Lyrics Transcription Benchmark✓ Link35.54.341.673.51.0Whisper v32023-11-23
Jam-ALT: A Formatting-Aware Lyrics Transcription Benchmark✓ Link35.74.541.769.33.3Whisper v22023-11-23
Lyrics Transcription for Humans: A Readability-Aware Benchmark✓ Link37.842.144.269.33.3Whisper v22024-07-30
Jam-ALT: A Formatting-Aware Lyrics Transcription Benchmark✓ Link44.05.328.061.2Whisper v2 +demucs2023-11-23
Lyrics Transcription for Humans: A Readability-Aware Benchmark✓ Link44.549.841.661.2Whisper v2 +demucs2024-07-30
Lyrics Transcription for Humans: A Readability-Aware Benchmark✓ Link46.650.433.765.8Whisper v3 +demucs +lang2024-07-30
Jam-ALT: A Formatting-Aware Lyrics Transcription Benchmark✓ Link47.93.829.065.7Whisper v3 +demucs2023-11-23
Lyrics Transcription for Humans: A Readability-Aware Benchmark✓ Link48.051.633.065.7Whisper v3 +demucs2024-07-30
Lyrics Transcription for Humans: A Readability-Aware Benchmark✓ Link66.572.620.00.041.1OWSM v3.1 +demucs +lang2024-07-30
Lyrics Transcription for Humans: A Readability-Aware Benchmark✓ Link69.375.022.50.637.8OWSM v3.1 +lang2024-07-30