Lyrics Transcription for Humans: A Readability-Aware Benchmark | ✓ Link | 20.8 | 23.5 | | 46.1 | 3.2 | 88.6 | 69.0 | AudioShake v3 | 2024-07-30 |
Lyrics Transcription for Humans: A Readability-Aware Benchmark | ✓ Link | 27.1 | 30.5 | | 45.3 | | 73.7 | | Whisper v2 +lang | 2024-07-30 |
Lyrics Transcription for Humans: A Readability-Aware Benchmark | ✓ Link | 27.7 | 31.1 | | 45.9 | | 73.4 | 1.4 | Whisper v2 | 2024-07-30 |
Jam-ALT: A Formatting-Aware Lyrics Transcription Benchmark | ✓ Link | 27.7 | | 3.2 | 45.8 | | 73.4 | 1.4 | Whisper v2 | 2023-11-23 |
Lyrics Transcription for Humans: A Readability-Aware Benchmark | ✓ Link | 34.7 | 38.0 | | 42.5 | | 77.9 | | Whisper v3 | 2024-07-30 |
Lyrics Transcription for Humans: A Readability-Aware Benchmark | ✓ Link | 34.7 | 38.0 | | 42.3 | | 77.9 | | Whisper v3 +lang | 2024-07-30 |
Jam-ALT: A Formatting-Aware Lyrics Transcription Benchmark | ✓ Link | 34.7 | | 3.3 | 42.4 | | 77.8 | | Whisper v3 | 2023-11-23 |
Jam-ALT: A Formatting-Aware Lyrics Transcription Benchmark | ✓ Link | 34.9 | | 2.0 | 45.8 | 41.3 | 84.9 | 72.5 | AudioShake v1 | 2023-11-23 |
Lyrics Transcription for Humans: A Readability-Aware Benchmark | ✓ Link | 38.2 | 42.1 | | 36.1 | | 65.6 | | Whisper v2 +demucs +lang | 2024-07-30 |
Lyrics Transcription for Humans: A Readability-Aware Benchmark | ✓ Link | 43.3 | 46.9 | | 38.0 | | 66.0 | | Whisper v2 +demucs | 2024-07-30 |
Jam-ALT: A Formatting-Aware Lyrics Transcription Benchmark | ✓ Link | 43.3 | | 3.2 | 34.9 | | 66.1 | | Whisper v2 +demucs | 2023-11-23 |
Lyrics Transcription for Humans: A Readability-Aware Benchmark | ✓ Link | 44.9 | 48.2 | | 32.0 | | 69.3 | | Whisper v3 +demucs | 2024-07-30 |
Lyrics Transcription for Humans: A Readability-Aware Benchmark | ✓ Link | 44.9 | 48.3 | | 32.0 | | 69.3 | | Whisper v3 +demucs +lang | 2024-07-30 |
Jam-ALT: A Formatting-Aware Lyrics Transcription Benchmark | ✓ Link | 44.9 | | 3.2 | 30.9 | | 69.4 | | Whisper v3 +demucs | 2023-11-23 |
Lyrics Transcription for Humans: A Readability-Aware Benchmark | ✓ Link | 71.6 | 75.7 | | 30.6 | 1.9 | 36.0 | | OWSM v3.1 +lang | 2024-07-30 |
Lyrics Transcription for Humans: A Readability-Aware Benchmark | ✓ Link | 78.5 | 82.1 | | 22.3 | 0.0 | 40.9 | | OWSM v3.1 +demucs +lang | 2024-07-30 |