OpenCodePapers

text-to-music-generation-on-musiccaps

Text-to-Music Generation
Dataset Link
Results over time
Click legend items to toggle metrics. Hover points for model names.
Leaderboard
PaperCodeFADFD_openl3FDKL_passtISCLAP_LAIONCLAP_MSModelNameReleaseDate
MeLFusion: Synthesizing Music from Image and Language Cues using Diffusion Models✓ Link1.1222.650.89MeLFusion (image-conditioned)2024-06-07
FLUX that Plays Music✓ Link1.431.252.98FLUXMusic2024-09-01
Quality-aware Masked Diffusion Transformer for Enhanced Music Generation✓ Link1.651.312.80OpenMusic (QA-MDT)2024-05-24
ETTA: Elucidating the Design Space of Text-to-Audio Models✓ Link1.9192.1810.060.843.320.510.53ETTA2024-12-26
JEN-1: Text-Guided Universal Music Generation with Omnidirectional Diffusion Models✓ Link2.001.29JEN-12023-08-09
Noise2Music: Text-conditioned Music Generation with Diffusion Models2.134Noise2Music waveform2023-02-08
Improving Text-To-Audio Models with Synthetic Captions✓ Link2.21270.3222.690.942.790.510.43TANGO-AF2024-06-18
AudioLDM 2: Learning Holistic Audio Generation with Self-supervised Pretraining✓ Link2.93190.1616.341.002.590.480.47AudioLDM2-large2023-08-10
AudioLDM 2: Learning Holistic Audio Generation with Self-supervised Pretraining✓ Link3.131.20AudioLDM 2-Full2023-08-10
Simple and Controllable Music Generation✓ Link3.41.23MusicGen w/o melody (1.5B)2023-06-08
Stable Audio Open✓ Link3.51127.2036.421.322.930.480.49Stable Audio Open2024-07-19
UniAudio: An Audio Foundation Model Toward Universal Audio Generation✓ Link3.651.87UniAudio2023-10-01
Simple and Controllable Music Generation✓ Link3.8197.121.31MusicGen w/o melody (3.3B)2023-06-08
Noise2Music: Text-conditioned Music Generation with Diffusion Models3.840Noise2Music spectrogram2023-02-08
MusicLM: Generating Music From Text✓ Link4.0MusicLM2023-01-26
Simple and Controllable Music Generation✓ Link5.01.31MusicGen w/ random melody (1.5B)2023-06-08
Efficient Neural Music Generation5.41MeLoDy2023-05-25
MusicLM: Generating Music From Text✓ Link9.6Mubert2023-01-26
MusicLM: Generating Music From Text✓ Link13.4Riffusion2023-01-26
Fast Timing-Conditioned Latent Audio Diffusion✓ Link108.690.80Stable Audio2024-02-07
AudioLDM 2: Learning Holistic Audio Generation with Self-supervised Pretraining✓ Link354.051.53AudioLDM2-music2023-08-10