OpenCodePapers

automatic-lyrics-transcription-on-jam-alt-2

Speech RecognitionAutomatic Lyrics Transcription
Dataset Link
Results over time
Click legend items to toggle metrics. Hover points for model names.
Leaderboard
PaperCodeWord Error Rate (WER)Case-Sensitive Word Error RateCase Error RatePunctuation F-1Parenthesis F-1Line break F-1Section break F-1ModelNameReleaseDate
Lyrics Transcription for Humans: A Readability-Aware Benchmark✓ Link12.617.7 56.7 4.281.566.4AudioShake v32024-07-30
Lyrics Transcription for Humans: A Readability-Aware Benchmark✓ Link21.927.752.571.53.1Whisper v2 +lang2024-07-30
Lyrics Transcription for Humans: A Readability-Aware Benchmark✓ Link22.428.044.574.50.0Whisper v3 +lang2024-07-30
Jam-ALT: A Formatting-Aware Lyrics Transcription Benchmark✓ Link22.54.147.838.082.7 69.6AudioShake v12023-11-23
Jam-ALT: A Formatting-Aware Lyrics Transcription Benchmark✓ Link25.76.550.071.73.1Whisper v22023-11-23
Lyrics Transcription for Humans: A Readability-Aware Benchmark✓ Link25.831.552.871.73.1Whisper v22024-07-30
Lyrics Transcription for Humans: A Readability-Aware Benchmark✓ Link28.633.642.573.7Whisper v32024-07-30
Jam-ALT: A Formatting-Aware Lyrics Transcription Benchmark✓ Link28.6 5.041.973.7Whisper v32023-11-23
Lyrics Transcription for Humans: A Readability-Aware Benchmark✓ Link34.942.234.352.6Whisper v2 +demucs +lang2024-07-30
Jam-ALT: A Formatting-Aware Lyrics Transcription Benchmark✓ Link38.87.117.256.4Whisper v2 +demucs2023-11-23
Lyrics Transcription for Humans: A Readability-Aware Benchmark✓ Link39.646.540.456.6Whisper v2 +demucs2024-07-30
Lyrics Transcription for Humans: A Readability-Aware Benchmark✓ Link58.662.134.454.7Whisper v3 +demucs +lang2024-07-30
Lyrics Transcription for Humans: A Readability-Aware Benchmark✓ Link61.564.932.452.3Whisper v3 +demucs2024-07-30
Jam-ALT: A Formatting-Aware Lyrics Transcription Benchmark✓ Link61.53.628.752.4Whisper v3 +demucs2023-11-23
Lyrics Transcription for Humans: A Readability-Aware Benchmark✓ Link70.876.09.033.5OWSM v3.1 +demucs +lang2024-07-30
Lyrics Transcription for Humans: A Readability-Aware Benchmark✓ Link73.378.58.80.030.2OWSM v3.1 +lang2024-07-30