Mozart's Touch

This is the official implementation of Mozart's Touch: A Lightweight Multi-modal Music Generation Framework Based on Pre-Trained Large Models

Package Description

This repository is structured as follows:

Diancai-Backend
├─MozartsTouch/: source code for the implementation of Mozart's Touch
│  ├─model/: pre-trained MusicGen model
│  ├─static/: static source for test purpose
│  ├─utils/: source code for the modules
│  ├─download_model.py: download pre-trained MusicGen model to model/
│  ├─api_key.ini: OpenAI API key
│  └─main.py: Main program of Mozart's Touch
│ outputs/: directory to store generation result music
│ backend_app.py: program for backend web application of Mozart's Touch
└─start_server.py: start the backend server of Mozart's Touch

Setup

Before running, please configure api_key.ini in the /MozartsTouch/ in the following manners:

[OpenAI]

API_KEY=sk-xxxxxxx
Install dependencies using pip install -r requirements.txt.
Run download_model.py to download MusicGen model parameters.
Use MozartsTouch.img_to_music_generate() to generate music.

To test codes without importing large models, set test_mode to True in main.py.

Switch between "musicgen_medium" or "musicgen_small" by modifying music_gen_model_name (actually from index in import_music_generator(mode: int)) in main.py.

Running as a Command Line Tool

With the setup complete, you can now run the following command to generate music:

python main.py

or debug with no model imported:

python main.py --test_mode

Running as a Web Backend Server

Install dependencies using pip install -r requirements_for_server.txt.
Configure port number and other parameters instart_server.py.
Run python start_server.py.
Access http://localhost:3000/docs#/ to view the backend documentation and test the APIs.

TO-DO List

增加用户输入提示词功能
~~删除API中的mode~~
优化音乐生成部分MusicGen模型的代码（主要需求：优化生成效率）
部署视频配乐的功能，尝试将 Video-Llama 或者 Video-BLIP2 整合到我们的项目中。
Use argparse to call model
Add support for other models as an alternative e.g. LLaMa.

Name		Name	Last commit message	Last commit date
Latest commit History 141 Commits
MozartsTouch		MozartsTouch
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
architecture.png		architecture.png
backend_app.py		backend_app.py
logo.png		logo.png
requirements.txt		requirements.txt
requirements_for_server.txt		requirements_for_server.txt
start_server.py		start_server.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

MozartsTouch

MozartsTouch

.gitignore

.gitignore

LICENSE

LICENSE

README.md

README.md

architecture.png

architecture.png

backend_app.py

backend_app.py

logo.png

logo.png

requirements.txt

requirements.txt

requirements_for_server.txt

requirements_for_server.txt

start_server.py

start_server.py

Repository files navigation

Mozart's Touch

Package Description

Setup

Running as a Command Line Tool

Running as a Web Backend Server

TO-DO List

About

Releases

Packages

Contributors 6

Languages

License

WangTooNaive/MozartsTouch

Folders and files

Latest commit

History

Repository files navigation

Mozart's Touch

Package Description

Setup

Running as a Command Line Tool

Running as a Web Backend Server

TO-DO List

About

Resources

License

Stars

Watchers

Forks

Languages