tts.nvim

A comprehensive text-to-speech plugin for Neovim with support for macOS native speech synthesis and OpenAI-compatible TTS endpoints. This plugin allows you to have your text read aloud directly within Neovim. I created it for use with obsidian.nvim. I did not find anything else like it.

TL;DR

-- Quick setup with auto-detection
require('tts').setup()

-- Or specific backend
require('tts').setup({
  backend = 'macos',  -- or 'openai'
  macos = { voice = 'Samantha', rate = 200 }
})

Features

Multi-backend Support: Native macOS say command and OpenAI API
Smart Text Selection: Visual mode, line/paragraph, and motion-based selection
Intelligent Content Filtering: Removes paths, URLs, and technical content for better TTS
Playback Control: Play and stop with basic state management
Queue Management: Add multiple text segments to a queue for sequential playback
Audio Caching: Reduces API calls and improves response time
Text Preprocessing: Clean code comments, expand abbreviations, language-specific replacements
Customizable: Extensive configuration options, hooks, and keymaps

Requirements

Neovim 0.7+
macOS (for native TTS) or OpenAI API key (for OpenAI TTS)

Installation

Using lazy.nvim:

{
  'yourusername/tts.nvim',
  config = function()
    require('tts').setup({
      -- Backend selection: 'auto', 'macos', 'openai'
      backend = 'auto',
      
      -- macOS configuration
      macos = {
        voice = 'Alex',
        rate = 200,              -- 75-720 words per minute
        volume = 0.5,            -- 0.0-1.0
        audio_device = nil,      -- Custom audio device
        pitch = nil,             -- Voice pitch
        modulation = nil,        -- Voice modulation
      },
      
      -- OpenAI configuration
      openai = {
        api_key = vim.env.OPENAI_API_KEY,
        api_url = 'https://api.openai.com/v1/audio/speech',
        model = 'tts-1',         -- or 'tts-1-hd'
        voice = 'alloy',         -- alloy, echo, fable, onyx, nova, shimmer
        speed = 1.0,             -- 0.25-4.0
        format = 'mp3',          -- mp3, opus, aac, flac
        headers = {},            -- Custom headers
        timeout = 30,            -- Request timeout in seconds
      },
      
      -- Playback behavior
      playback = {
        auto_clear_queue = false,
        show_progress = true,
        chunk_size = 500,                    -- Text chunk size for processing
        pause_between_chunks = 0.5,          -- Pause between chunks (seconds)
        player = 'auto',                     -- Audio player: 'auto', 'mpv', 'ffplay', etc.
        player_args = {},                    -- Custom player arguments
        default_selection = 'section',       -- Default text selection: 'line', 'paragraph', 'section'
      },
      
      -- Cache settings
      cache = {
        enabled = true,
        directory = vim.fn.stdpath('cache') .. '/tts',
        max_size = 100,          -- MB
        max_age = 7,             -- days
        cleanup_on_start = true,
      },
      
      -- Custom keymaps
      keymaps = {
        play = '<leader>tp',
        stop = '<leader>ts',
        queue = '<leader>tq',
        clear = '<leader>tc',
        next = '<leader>tn',
        prev = '<leader>tN',
        visual_play = '<leader>tp',
      },
      
      -- Text preprocessing
      preprocessing = {
        clean_markdown = true,           -- Remove markdown syntax
        clean_code = false,              -- Remove code comments
        filtering_level = 'moderate',    -- 'none', 'minimal', 'moderate', 'aggressive'
        expand_abbreviations = true,
        skip_code_blocks = true,         -- Skip code blocks entirely
        replacements = {
          -- Task markers
          ['TODO:?'] = 'todo item',
          ['FIXME:?'] = 'fix me item',
          ['NOTE:?'] = 'note',
          ['WARNING:?'] = 'warning',
          ['TIP:?'] = 'tip',
          
          -- Technical acronyms
          ['API'] = 'A P I',
          ['URL'] = 'U R L',
          ['HTTP'] = 'H T T P',
          ['JSON'] = 'J son',
          ['SQL'] = 'S Q L',
          ['CSS'] = 'C S S',
          ['HTML'] = 'H T M L',
          ['JS'] = 'javascript',
          ['TS'] = 'TypeScript',
          ['AI'] = 'artificial intelligence',
          ['TL;DR'] = 'too long did not read',
          
          -- Common abbreviations
          ['etc.'] = 'etcetera',
          ['i.e.'] = 'that is',
          ['e.g.'] = 'for example',
          
          -- Custom identifiers
          -- ['MyApp'] = 'my app',
          -- ['BigCorp'] = 'big corp',
        },
        languages = {
          lua = {
            ['~='] = 'not equal',
            ['%.%.'] = 'concatenate',
          },
          python = {
            ['!='] = 'not equal',
            ['//'] = 'integer divide',
          },
          javascript = {
            ['=>'] = 'arrow function',
            ['==='] = 'strict equals',
            ['!=='] = 'strict not equals',
          },
          rust = {
            ['fn '] = 'function ',
            ['mut '] = 'mutable ',
            ['impl '] = 'implementation ',
          },
        },
      },
      
      -- Hooks
      hooks = {
        before_play = nil,              -- function(text) return modified_text end
        after_play = nil,               -- function(text) end
        on_state_change = nil,          -- function(new_state, old_state) end
        on_error = function(err)
          vim.notify('TTS Error: ' .. err, vim.log.levels.ERROR)
        end,
        on_queue_item = nil,            -- function(item, index, total) end
      },
      
      -- Notifications
      notifications = {
        level = vim.log.levels.INFO,
        use_notify = false,             -- Use vim.notify for notifications
      },
    })
  end
}

Using packer.nvim:

use {
  'yourusername/tts.nvim',
  config = function()
    require('tts').setup({
      -- Same configuration as above
    })
  end
}

Quick Start

Basic Setup

require('tts').setup()  -- Uses defaults with auto-detection

macOS Setup

require('tts').setup({
  backend = 'macos',
  macos = {
    voice = 'Samantha',  -- or 'Alex', 'Victoria', etc.
    rate = 200           -- words per minute
  }
})

Content Filtering Levels

none: No additional filtering
minimal: Remove paths and URLs only
moderate: Remove paths, URLs, UUIDs, versions, emails, IPs (default)
aggressive: All of the above plus code-specific content

Usage

Commands

:TTS [text] - Speak the provided text or current selection
:TTSPlay - Play current selection/line/paragraph
:TTSStop - Stop current playback
:TTSQueue [text] - Add text to queue or show queue
:TTSClear - Clear the queue
:TTSNext - Skip to next in queue
:TTSPrev - Go to previous in queue
:TTSBackend <name> - Switch backend (macos/openai)
:TTSVoices - List available voices
:TTSSetVoice <voice> - Set voice for current backend
:TTSClearCache - Clear audio cache
:TTSMotion <motion> - Speak text based on vim motion

Default Keymaps

<leader>tp - Play selection/current text
<leader>ts - Stop playback
<leader>tq - Add to queue / show queue
<leader>tc - Clear queue
<leader>tn - Next in queue
<leader>tN - Previous in queue

Usage Examples

Visual Mode

Select text in visual mode
Press <leader>tp or run :TTSPlay

Current Line

Place cursor on a line
Press <leader>tp or run :TTSPlay

Current Paragraph

-- Configure to use paragraph by default
require('tts').setup({
  playback = {
    default_selection = 'paragraph'
  }
})

With Motions

:TTSMotion w    " Speak word
:TTSMotion }    " Speak to next paragraph
:TTSMotion G    " Speak to end of file

Queue Management

" Add multiple items to queue
:TTSQueue First paragraph
:TTSQueue Second paragraph
:TTSQueue Third paragraph

" Show queue
:TTSQueue

" Process queue
:TTSPlay

Advanced Features

Content Processing

The plugin intelligently processes text before speaking:

Pattern Replacement System

The replacements configuration can replace any identifiers or patterns with spoken alternatives:

replacements = {
  -- Technical acronyms
  ["API"] = "A P I",
  ["URL"] = "U R L",
  ["HTTP"] = "H T T P",
  ["JSON"] = "J son",

  -- Project-specific identifiers
  ["MyApp"] = "my app",
  ["BigCorp"] = "big corp",

  -- Custom terminology
  ["foobar"] = "foo bar",
  ["widget"] = "wid get",
}

Markdown Processing

Code blocks: ```code...``` → " code block "
Inline code: `code` → " code "
Links: [text](url) → "text"
Headers: # Header → "Header"
Emphasis: **bold**, *italic* → plain text

Smart Content Filtering

Paths: /path/to/file, C:\Windows → " file path "
URLs: https://example.com → " url "
Git references: HEAD~1, commit hashes → " git reference "
UUIDs: 550e8400-e29b-41d4-a716-446655440000 → " uuid "
Versions: v1.2.3 → " version "
Emails: user@example.com → " email address "
IPs: 192.168.1.1 → " ip address "

Code-Specific Filtering (aggressive level)

JavaScript: npm packages, imports
Python: pip packages, imports
Lua: require statements, variables
Rust: cargo commands, use statements
Go: go commands, imports
Java: Maven commands, imports

Backend Details

macOS Backend

Uses the built-in say command:

Requires macOS
No API key needed
Multiple system voices available
Fast, local processing

OpenAI-Compatible Backend

Uses OpenAI-compatible TTS API (official OpenAI, Kokoro FastAPI, etc.):

API key required for official OpenAI, optional for self-hosted services
High-quality voices (varies by service)
Multiple voice options (alloy, echo, fable, onyx, nova, shimmer for OpenAI)
Audio caching to reduce API calls
Supports custom endpoints and self-hosted services

API Reference

local tts = require("tts")

-- Basic usage
tts.play("Hello world")
tts.play_selection()
tts.stop()


-- Queue management
tts.queue_add("Text to queue")
tts.queue_clear()
tts.queue_list()

-- Backend control
tts.set_backend("openai")
tts.get_backend()
tts.list_voices()
tts.set_voice("alloy")

-- Cache management
tts.clear_cache()
tts.get_cache_stats()

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
lua/tts		lua/tts
plugin		plugin
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

tts.nvim

TL;DR

Features

Requirements

Installation

Quick Start

Basic Setup

macOS Setup

Content Filtering Levels

Usage

Commands

Default Keymaps

Usage Examples

Visual Mode

Current Line

Current Paragraph

With Motions

Queue Management

Advanced Features

Content Processing

Pattern Replacement System

Markdown Processing

Smart Content Filtering

Code-Specific Filtering (aggressive level)

Backend Details

macOS Backend

OpenAI-Compatible Backend

API Reference

About

Uh oh!

Releases

Packages

Languages

chriswritescode-dev/tts.nvim

Folders and files

Latest commit

History

Repository files navigation

tts.nvim

TL;DR

Features

Requirements

Installation

Quick Start

Basic Setup

macOS Setup

Content Filtering Levels

Usage

Commands

Default Keymaps

Usage Examples

Visual Mode

Current Line

Current Paragraph

With Motions

Queue Management

Advanced Features

Content Processing

Pattern Replacement System

Markdown Processing

Smart Content Filtering

Code-Specific Filtering (aggressive level)

Backend Details

macOS Backend

OpenAI-Compatible Backend

API Reference

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages