Di♪♪Rhythm: Blazingly Fast and Embarrassingly Simple
End-to-End Full-Length Song Generation with Latent Diffusion

Ziqian Ning, Huakang Chen, Yuepeng Jiang, Chunbo Hao, Guobin Ma, Shuai Wang, Jixun Yao, Lei Xie†

Huggingface Space Demo
📑 Paper | 📑 Demo | 💬 WeChat (微信)

DiffRhythm (Chinese: 谛韵, Dì Yùn) is the first open-sourced diffusion-based song generation model that is capable of creating full-length songs. The name combines "Diff" (referencing its diffusion architecture) with "Rhythm" (highlighting its focus on music and song creation). The Chinese name 谛韵 (Dì Yùn) phonetically mirrors "DiffRhythm", where "谛" (attentive listening) symbolizes auditory perception, and "韵" (melodic charm) represents musicality.

News and Updates

2025.3.4 🔥 We released the DiffRhythm paper and Huggingface Space demo.

TODOs

Model Versions

Model	HuggingFace
DiffRhythm-base (1m35s)	https://huggingface.co/ASLP-lab/DiffRhythm-base
DiffRhythm-full (4m45s)	Coming soon...
DiffRhythm-vae	https://huggingface.co/ASLP-lab/DiffRhythm-vae

License & Disclaimer

As the VAE is fine-tuned from Stable Audio Open, DiffRhythm is subject to the Stability AI Community License Agreement

DiffRhythm enables the creation of original music across diverse genres, supporting applications in artistic creation, education, and entertainment. While designed for positive use cases, potential risks include unintentional copyright infringement through stylistic similarities, inappropriate blending of cultural musical elements, and misuse for generating harmful content. To ensure responsible deployment, users must implement verification mechanisms to confirm musical originality, disclose AI involvement in generated works, and obtain permissions when adapting protected styles.

Citation

@article{ning2025diffrhythm,
  title={{DiffRhythm}: Blazingly Fast and Embarrassingly Simple</br>End-to-End Full-Length Song Generation with Latent Diffusion<},
  author={Ziqian, Ning and Huakang, Chen and Yuepeng, Jiang and Chunbo, Hao and Guobin, Ma and Shuai, Wang and Jixun, Yao and Lei, Xie},
  journal={arXiv preprint arXiv:2503.01183},
  year={2025}
}

Contact Us

If you are interested in leaving a message to our research team, feel free to email nzqiann@gmail.com.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
config		config
dataset		dataset
g2p		g2p
infer		infer
model		model
scripts		scripts
src		src
train		train
.gitignore		.gitignore
LICENSE.md		LICENSE.md
Readme.md		Readme.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Di♪♪Rhythm: Blazingly Fast and Embarrassingly Simple
End-to-End Full-Length Song Generation with Latent Diffusion

News and Updates

2025.3.4 🔥 We released the DiffRhythm paper and Huggingface Space demo.

TODOs

Model Versions

License & Disclaimer

Citation

Contact Us

About

Uh oh!

Releases

Packages

Languages

License

lizhaohai369/fork-DiffRhythm-py

Folders and files

Latest commit

History

Repository files navigation

Di♪♪Rhythm: Blazingly Fast and Embarrassingly SimpleEnd-to-End Full-Length Song Generation with Latent Diffusion

News and Updates

2025.3.4 🔥 We released the DiffRhythm paper and Huggingface Space demo.

TODOs

Model Versions

License & Disclaimer

Citation

Contact Us

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Di♪♪Rhythm: Blazingly Fast and Embarrassingly Simple
End-to-End Full-Length Song Generation with Latent Diffusion

Packages