⚠️ The main branch is bleeding edge: Expect frequent updates and many breaking changes after every commit
This project allows you to create character cards (like NAI/character.ai/tavern) and model cards to easily chat with a LLM voiced by your favourite RVC model. Use the app to download the required files before using or manually download them here: https://huggingface.co/datasets/SayanoAI/RVC-Studio/tree/main
- Character card builder: model agnostic JSON character cards compatible with any LLM model
- Model card builder: customize prompt format for each LLM to minimize parroting (e.g. LLM speaking for the user)
- LLM E/RP: uncensored chat with your RVC model in real time using popular GGUF LLMs.
- multimodal integration: send and receive pictures from your LLM model
- Python 3.8 or higher (developed and tested on v3.8.17)
- Git
- Clone this repository or download the zip file and extract it.
- Double-click "conda-installer.bat" to install the latest version of conda package manager
- Double-click "conda-start.bat" (if you skipped step 2.)
- Clone this repository or download the zip file.
- Navigate to the project directory and create a virtual environment with the command
virtualenv venv
. - Activate the virtual environment with the command
source venv/bin/activate
on Linux/Mac orvenv\Scripts\activate
on Windows. Or useconda create -n RVC-Chat & conda activate RVC-Chat
if you're using conda package manager. - Install the required packages with the command
pip install -r requirements.txt
. - Run the streamlit app with the command
streamlit run Home.py
.
- Download one of the following recommended LLMs and place them in your ./models/LLM folder (or use the homepage downloader):
- stablelm-zephyr-3b (< 8GB VRAM)
- neuralhermes-7b (8 GB VRAM)
- llama2-13b-tiefighter (> 8 GB VRAM)
- Navigate to LLM Server page and start the server after selecting the downloaded LLM
- Navigate to Chat page and write your name (this is what the LLM will call you)
- Select Your Character (or create one using Character Builder)
- Select a SD model for image generation
- Click "Start Chatting" to chat with your model
Feel free to explore other LLMs. (you will have to build your own model config)
Run docker compose up --build
in the main project folder.
- nothing yet :D
This project is for educational and research purposes only. The generated voice overs are not intended to infringe on any copyrights or trademarks of the original songs or text. The project does not endorse or promote any illegal or unethical use of the generative AI technology. The project is not responsible for any damages or liabilities arising from the use or misuse of the generated voice overs.
This project uses code and AI models from the following repositories:
- RVC-Studio by SayanoAI.
- SpeechT5: A Self-Supervised Pre-training Model for Speech Recognition and Generation by Microsoft.
We thank all the authors and contributors of these repositories for their amazing work and for making their code and models publicly available.