Xiaodou: Voice-to-Voice Chatbot

Xiaodou is a simple voice-to-voice chatbot designed for seamless interaction with users.

Installation

Install the required packages using one of the following methods:

With pip:
```
pip install -r requirements.txt
```
With poetry (recommended):
```
poetry install --without dev
```

Configuration

Audio Devices

To use a specific audio device, specify the input and output device names in xiaodou/main.py:

# Input and output device names, if None, use default device
INPUT_DEVICE_NAME = None # Azure format
OUTPUT_DEVICE_NAME = None # pygame format

There are some scripts in scripts/ to list the available audio devices for Azure SDK and pygame:

Example 1: on macOS, list the available audio devices:

cd scripts/macos_list_audio_devices
make run

Example 2: list the available audio devices using pygame:

cd scripts/pygame_list_audio_devices
make run

API Keys

Set the following environment variables in a .env file, refer to .env.example for an example:

OPENAI_API_TYPE="azure"
OPENAI_API_BASE="https://example.openai.azure.com/"
OPENAI_API_KEY="..."
OPENAI_API_VERSION="2023-03-15-preview"
AZURE_OPENAI_DEPLOYMENT_NAME="gpt-4"
SPEECH_API_KEY="..."
SPEECH_SERVICE_REGION="..."

Keyword Model

The chatbot is activated upon hearing the keyword "小豆". The example keyword model is located in xiaodou/models/. For additional information, refer to xiaodou/models/README.md.

Usage

Start the chatbot with the following command:

python xiaodou/main.py

Once activated, you can begin conversing with the chatbot. The interaction flow is as follows:

sequenceDiagram
    participant User
    participant Bot
    User->>Bot: Say the keyword
    Bot->>User: Play notification sound
    User->>Bot: Voice input (e.g. "Can you tell me a joke")
    Bot->>Bot: Stop recording after user pause
    Bot->>User: Play another notification sound
    Bot->>Bot: Recognize voice with Azure Speech Service
    Bot->>Bot: Send prompt to OpenAI API
    Bot->>Bot: Receive response
    Bot->>Bot: Synthesize response using Azure Speech Service
    Bot->>User: Play synthesized voice (e.g. "Sure, here's a joke, ...")
    User->>Bot: Repeat (starts with keyword)

Development

To contribute to the development of Xiaodou, follow these steps:

Install pre-commit hooks and development dependencies:
```
poetry install
pre-commit install
```

License

For more information on the license, please refer to the LICENSE file.

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
scripts		scripts
xiaodou		xiaodou
.env.example		.env.example
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
LICENSE		LICENSE
README.md		README.md
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

scripts

scripts

xiaodou

xiaodou

.env.example

.env.example

.gitignore

.gitignore

.pre-commit-config.yaml

.pre-commit-config.yaml

LICENSE

LICENSE

README.md

README.md

poetry.lock

poetry.lock

pyproject.toml

pyproject.toml

requirements.txt

requirements.txt

Repository files navigation

Xiaodou: Voice-to-Voice Chatbot

Installation

Configuration

Audio Devices

API Keys

Keyword Model

Usage

Development

License

About

Releases

Packages

Languages

License

volltin/xiaodou-bot

Folders and files

Latest commit

History

Repository files navigation

Xiaodou: Voice-to-Voice Chatbot

Installation

Configuration

Audio Devices

API Keys

Keyword Model

Usage

Development

License

About

Topics

Resources

License

Stars

Watchers

Forks

Languages