Lyrics Transcription for Humans: A Readability-Aware Benchmark | ✓ Link | 17.3 | 20.9 | | 65.3 | 37.9 | 84.3 | 84.8 | AudioShake v3 | 2024-07-30 |
Jam-ALT: A Formatting-Aware Lyrics Transcription Benchmark | ✓ Link | 22.1 | | 3.4 | 59.0 | 32.4 | 80.7 | 77.4 | AudioShake v1 | 2023-11-23 |
Lyrics Transcription for Humans: A Readability-Aware Benchmark | ✓ Link | 24.6 | 28.0 | | 34.0 | | 74.0 | 1.4 | LyricWhiz | 2024-07-30 |
Jam-ALT: A Formatting-Aware Lyrics Transcription Benchmark | ✓ Link | 24.6 | | 3.5 | 34.0 | | 74.0 | 1.4 | LyricWhiz | 2023-11-23 |
Jam-ALT: A Formatting-Aware Lyrics Transcription Benchmark | ✓ Link | 32.3 | | 5.3 | 39.2 | | 53.8 | | Whisper v2 +demucs | 2023-11-23 |
Lyrics Transcription for Humans: A Readability-Aware Benchmark | ✓ Link | 33.3 | 39.1 | | 42.2 | | 53.9 | | Whisper v2 +demucs | 2024-07-30 |
Lyrics Transcription for Humans: A Readability-Aware Benchmark | ✓ Link | 35.6 | 41.3 | | 41.8 | | 53.4 | | Whisper v2 +demucs +lang | 2024-07-30 |
Lyrics Transcription for Humans: A Readability-Aware Benchmark | ✓ Link | 36.4 | 41.4 | | 41.8 | | 72.5 | 2.6 | Whisper v3 +lang | 2024-07-30 |
Lyrics Transcription for Humans: A Readability-Aware Benchmark | ✓ Link | 37.7 | 42.5 | | 41.4 | | 71.5 | 2.6 | Whisper v3 | 2024-07-30 |
Jam-ALT: A Formatting-Aware Lyrics Transcription Benchmark | ✓ Link | 37.7 | | 4.8 | 40.9 | | 71.5 | 2.6 | Whisper v3 | 2023-11-23 |
Lyrics Transcription for Humans: A Readability-Aware Benchmark | ✓ Link | 39.7 | 43.7 | | 34.9 | | 65.5 | 11.6 | Whisper v2 +lang | 2024-07-30 |
Lyrics Transcription for Humans: A Readability-Aware Benchmark | ✓ Link | 43.0 | 47.2 | | 25.8 | | 66.9 | | Whisper v3 +demucs | 2024-07-30 |
Lyrics Transcription for Humans: A Readability-Aware Benchmark | ✓ Link | 43.0 | 47.2 | | 25.8 | | 66.9 | | Whisper v3 +demucs +lang | 2024-07-30 |
Jam-ALT: A Formatting-Aware Lyrics Transcription Benchmark | ✓ Link | 43.0 | | 4.1 | 23.3 | | 66.8 | | Whisper v3 +demucs | 2023-11-23 |
Lyrics Transcription for Humans: A Readability-Aware Benchmark | ✓ Link | 43.8 | 47.5 | | 31.5 | | 63.0 | 11.2 | Whisper v2 | 2024-07-30 |
Jam-ALT: A Formatting-Aware Lyrics Transcription Benchmark | ✓ Link | 43.8 | | 3.5 | 31.3 | | 63.0 | 11.2 | Whisper v2 | 2023-11-23 |
Lyrics Transcription for Humans: A Readability-Aware Benchmark | ✓ Link | 63.4 | 69.4 | | 21.5 | 0.0 | 47.3 | | OWSM v3.1 +demucs +lang | 2024-07-30 |
Lyrics Transcription for Humans: A Readability-Aware Benchmark | ✓ Link | 68.6 | 74.0 | | 22.3 | | 42.7 | | OWSM v3.1 +lang | 2024-07-30 |