NovaVoice

NovaVoice is an AI voice assistant. Currently very early in development, contributions welcome!

Demo

demo.mp4

Features

🎤 Wake Word Detection (Porcupine or Azure Speech)
🗣️ Speech-to-Text (Built-in using WebRtcVad and Whisper Turbo or Cheetah)
🔊 Text-to-Speech (Windows Speech or Mimic3)
🤖 AI Interactions (via Groq API)
🔍 Extensible Tool System (Google Search API, YouTube Data API, AccuWeather API)
📝 Conversation History
🎵 Media Playback Support
🔄 Configurable Audio Processing Pipeline

Prerequisites

.NET 8.0 SDK
Required external dependencies:
- yt-dlp (for YouTube playback)
- mpv (for audio playback)
- mimic3 (if using Mimic TTS)

Configuration

Configure the assistant through environment variables or appsettings.json:

{
  "Recorder": "Picovoice",
  "WakeWordProvider": "Porcupine",
  "SpeechToTextProvider": "Cheetah",
  "TextToSpeechProvider": "WindowsSpeech",
  "Google": {
    "ApiKey": "YOUR_KEY",
    "CustomSearchId": "YOUR_ID"
  },
  "AccuWeather": {
    "ApiKey": "YOUR_KEY"
  },
  "Groq": {
    "ApiKey": "YOUR_KEY",
    "WhisperModel": "whisper-large-v3-turbo",
    "GeneralModel": "llama-3.1-70b-versatile",
    "ToolUseModel": "llama3-groq-70b-8192-tool-use-preview",
    "SystemPrompt": "You are a helpful, child-friendly voice assistant called Nova. Respond concisely to user prompts (received via Automatic Speech Recognition). You have access to the following tools: `google_search`, `youtube_search`, and `check_weather`. Consider carefully when you should use them.\n\n- Use `youtube_search` to find music and audio content that the user may want to listen to.\n- Use `check_weather` to get the current weather conditions or forecasts for a location.\n- Use `google_search` for retrieving live data, for example news or exchange rates\n\nPrioritise safety and well-being, never provide responses that could be harmful, inappropriate, biased, or misleading. If the prompt is ambiguous, unsafe, or you are unable to provide a helpful response using the available tools, respond with \"Sorry, I can't help with that\"."
  },
  "Picovoice": {
    "AccessKey": "YOUR_KEY",
    "Porcupine": {
      "Keywords": ["Hey Nova", "OK Stop"]
    }
  }
}

Project Structure

Speech/ - Core speech processing components
- Recorder/ - Audio input handling
- SpeechToText/ - Speech recognition implementations
- TextToSpeech/ - Speech synthesis implementations
- WakeWord/ - Wake word detection providers
Tools/ - Extensible tool system implementations
Models/ - Data models and configuration classes
Events/ - Event handling system

Getting Started

Clone the repository
Configure your API keys in appsettings.json or environment variables
Install required external dependencies
Build and run the project:

dotnet build
dotnet run

Provider Options

Wake Word Detection

Porcupine
Azure Speech

Speech-to-Text

Built-in (crude implementation using WebRtcVadSharp and Whisper Turbo)
Cheetah

Text-to-Speech

Windows Speech
Mimic3

Recording Engine

NAudio
Picovoice

Tool System

The assistant includes several built-in tools:

google_search - Web search functionality
youtube_search - YouTube content search and playback
check_weather - Weather information retrieval

Acknowledgments

Thanks to UNIVERSFIELD for the sound effects.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
AccuWeatherApiClient		AccuWeatherApiClient
Events		Events
Extensions		Extensions
GoogleApiClient		GoogleApiClient
GroqApiClient		GroqApiClient
Models		Models
Properties		Properties
Sounds		Sounds
Speech		Speech
Tools		Tools
.gitignore		.gitignore
Assistant.cs		Assistant.cs
ConfigurationValidator.cs		ConfigurationValidator.cs
Constants.cs		Constants.cs
LICENSE		LICENSE
NovaVoice.csproj		NovaVoice.csproj
NovaVoice.sln		NovaVoice.sln
Program.cs		Program.cs
README.md		README.md
SoundPlayer.cs		SoundPlayer.cs
YoutubePlayer.cs		YoutubePlayer.cs
appsettings.json		appsettings.json
global.json		global.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

NovaVoice

Demo

Features

Prerequisites

Configuration

Project Structure

Getting Started

Provider Options

Wake Word Detection

Speech-to-Text

Text-to-Speech

Recording Engine

Tool System

Acknowledgments

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

mhingston/NovaVoice

Folders and files

Latest commit

History

Repository files navigation

NovaVoice

Demo

Features

Prerequisites

Configuration

Project Structure

Getting Started

Provider Options

Wake Word Detection

Speech-to-Text

Text-to-Speech

Recording Engine

Tool System

Acknowledgments

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages