TTS CLI

A command-line interface for text-to-speech with voice cloning, powered by Qwen3-TTS.

Supports PyTorch (CUDA / CPU) and MLX (Apple Silicon) backends with automatic platform detection.

Features

Voice cloning — clone any voice from a short audio sample
Streaming playback — hear audio as it generates, no waiting
Apple Silicon native — MLX backend for fast local inference
Two model sizes — 1.7B (quality) and 0.6B (speed)
JSON output — machine-readable output for scripting and pipelines
Configurable — TOML config files or CLI flags

Installation

Requires Python 3.11+

Quick Install

curl -fsSL https://raw.githubusercontent.com/jiweiyuan/ttscli/main/install.sh | bash

With options:

# Specify backend
curl -fsSL https://raw.githubusercontent.com/jiweiyuan/ttscli/main/install.sh | bash -s -- --backend mlx
curl -fsSL https://raw.githubusercontent.com/jiweiyuan/ttscli/main/install.sh | bash -s -- --backend pytorch

# Force using uv
curl -fsSL https://raw.githubusercontent.com/jiweiyuan/ttscli/main/install.sh | bash -s -- --uv

pip Install

# Basic install
pip install ttscli

# With PyTorch backend
pip install ttscli[pytorch]

# With MLX backend (Apple Silicon)
pip install ttscli[mlx]

# Development
pip install ttscli[dev]

Or install from source:

git clone https://github.com/your-org/ttscli.git
cd ttscli
pip install -e ".[pytorch]"

Verify:

tts --version

Test basic commands:

tts voice list       # List voices
tts config show      # Show config
tts --help           # View help

Troubleshooting

Command not found — ensure your Python scripts directory is in PATH:

export PATH="$HOME/.local/bin:$PATH"

Or use the module directly:

python -m ttscli --version

Import errors — reinstall dependencies:

pip install -e ".[pytorch]" --force-reinstall

Permission errors — install in user mode:

pip install --user -e .

Uninstallation

pip uninstall ttscli

Quick Start

1. Add a voice sample

tts voice add recording.wav --text "The transcript of the recording" --voice myvoice

2. Speak aloud (streaming)

tts say "Hello, how are you today?" --voice myvoice

3. Save to file

tts say "Hello world" --voice myvoice -o hello.wav --no-play

Commands

`tts say`

Generate speech from text. Plays aloud with streaming by default.

tts say "Text to speak" [OPTIONS]

Options:
  -v, --voice TEXT     Voice name (default: configured default)
  -l, --language TEXT  Language code (default: en)
  -m, --model TEXT     Model size: 1.7B or 0.6B (default: 1.7B)
  -o, --output PATH   Save to WAV file
  -i, --instruct TEXT  Speaking style instruction
  --no-play            Don't play audio, only save to file
  --no-stream          Disable streaming (generate all, then play)
  --seed INT           Random seed for reproducibility

Examples:

tts say "Hello, how are you?"                      # play aloud
tts say "Good morning" --voice myvoice             # use specific voice
tts say "Hello world" -o hello.wav                 # play and save
tts say "Hello world" -o hello.wav --no-play       # save only
tts say "Breaking news!" -i "Speak urgently"       # with style instruction
tts say "Slow and steady" --no-stream              # generate all, then play

`tts voice`

Manage voices and audio samples.

tts voice add <audio_file> [OPTIONS]   # Add sample (creates voice if needed)
tts voice list                          # List all voices
tts voice info [VOICE]                  # Show voice details
tts voice delete <VOICE> [-y]           # Delete a voice
tts voice default [VOICE]               # Set/show default voice
tts voice default --unset               # Unset default voice

`tts config`

View and update configuration.

tts config show                # Show current config
tts config set <key> <value>   # Set a config value

Available config keys: data_dir, default_voice, default_language, default_model, output_format, auto_play

JSON Output

Use --json or --output json for machine-readable output:

tts --json voice list
tts --output json say "Hello" --voice myvoice

Configuration

Configuration is loaded from (in order of priority):

CLI flags (--data-dir, --output)
Config files:
- ./tts.toml (project-local)
- ~/.config/tts/config.toml
- ~/.tts/config.toml

Example config.toml:

default_voice = "myvoice"
default_language = "en"
default_model = "1.7B"
output_format = "rich"
data_dir = "~/tts"

Data Storage

All data is stored in ~/tts/ by default:

~/tts/
├── voices.json       # Voice definitions and metadata
├── samples/          # Audio samples for voice cloning
└── generations/      # Generated audio files

Requirements

Python 3.11+
PyTorch backend: torch, transformers, qwen-tts
MLX backend (Apple Silicon): mlx, mlx-audio
Audio: soundfile, sounddevice

System dependency: SoX (required by qwen-tts)

# macOS
brew install sox
# Ubuntu/Debian
sudo apt install sox

License

MIT — see LICENSE for details.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
.claude/skills		.claude/skills
.github/workflows		.github/workflows
demo		demo
docs		docs
tests		tests
ttscli		ttscli
.gitignore		.gitignore
.python-version		.python-version
CHANGELOG.md		CHANGELOG.md
LICENSE		LICENSE
README.md		README.md
demo.mp4		demo.mp4
install.sh		install.sh
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

TTS CLI

Features

Installation

Quick Install

pip Install

Troubleshooting

Uninstallation

Quick Start

1. Add a voice sample

2. Speak aloud (streaming)

3. Save to file

Commands

`tts say`

`tts voice`

`tts config`

JSON Output

Configuration

Data Storage

Requirements

License

About

Uh oh!

Releases 2

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

TTS CLI

Features

Installation

Quick Install

pip Install

Troubleshooting

Uninstallation

Quick Start

1. Add a voice sample

2. Speak aloud (streaming)

3. Save to file

Commands

tts say

tts voice

tts config

JSON Output

Configuration

Data Storage

Requirements

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 2

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

`tts say`

`tts voice`

`tts config`

Packages