AI Voice Keyboard

A Windows application that converts voice to keyboard input using speech recognition technology.

Latest Update (v1.0.3)

Fixed issues with recording state management
Improved floating button synchronization
Enhanced error handling and thread safety
See full release notes

Features

Two Recognition Modes:
- Command Mode: Execute keyboard commands like "up", "down", "control c", etc.
- String Mode: Type text as you speak naturally
AI-Powered Recognition:
- Uses Whisper.net for accurate speech recognition
- Falls back to Windows Speech Recognition when needed
Convenient Interface:
- Floating overlay button for easy access
- System tray integration
- Minimalist design

Getting Started

Prerequisites

Windows 10 or later
.NET 8.0 Runtime
Microphone for voice input

Installation

Download the latest release from the Releases page
Extract the ZIP file to a location of your choice
Run VoiceToKeyboard.exe

First Run

On first launch, AI Voice Keyboard will need to download the Whisper model. If it doesn't download automatically:

Download the model manually from: https://huggingface.co/ggerganov/whisper.cpp/resolve/main/ggml-medium.bin
Place the model file in the same directory as the application executable

Usage

Basic Controls

Start/Stop Listening: Click the "Start Listening" button or use the floating button
Switch Modes: Select Command or String mode using the radio buttons
Floating Button: Enable/disable the floating overlay for easy access

Voice Commands

Command Mode

Say commands like:

Navigation: "up", "down", "left", "right", "page up", "page down", "home", "end"
Function keys: "f1", "f2", etc.
Common shortcuts: "control c" (copy), "control v" (paste), "control z" (undo)
Switch modes: "command option" or "string option"

String Mode

Simply speak naturally and the system will type what you say
Speech will automatically pause after 1.5 seconds of silence
To stop recognition, click the recording button again

Troubleshooting

No Speech Recognition: Ensure your microphone is properly connected and set as the default recording device
Poor Recognition: Speak clearly and at a moderate pace
Missing Whisper Model: Download the model file manually as described in the First Run section

License

This project is licensed under the GPL-3.0 License - see the LICENSE file for details.

Acknowledgments

Whisper.net for AI speech recognition
NAudio for audio processing
Developed by ShioDev
Powered by Beaver Foundation

Version

See VERSION.md for version history and release notes.

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
VoiceToKeyboard		VoiceToKeyboard
.gitignore		.gitignore
README.md		README.md
VERSION.md		VERSION.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AI Voice Keyboard

Latest Update (v1.0.3)

Features

Getting Started

Prerequisites

Installation

First Run

Usage

Basic Controls

Voice Commands

Command Mode

String Mode

Troubleshooting

License

Acknowledgments

Version

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

AI Voice Keyboard

Latest Update (v1.0.3)

Features

Getting Started

Prerequisites

Installation

First Run

Usage

Basic Controls

Voice Commands

Command Mode

String Mode

Troubleshooting

License

Acknowledgments

Version

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages