I Am Dictator 🎙️

Push-to-talk speech-to-text transcriber for Windows that lets you dictate text anywhere.

Features

🎯 Universal Hotkey: Press CTRL+ALT to start/stop recording
🤖 Local AI: Uses OpenAI Whisper for accurate transcription
🔇 Subtle Audio Feedback: Low-volume beeps indicate recording state
⚡ Batch Processing: Records complete utterances for better accuracy
🎤 Microphone Selection: Choose from available audio input devices
📝 Auto Text Injection: Automatically types transcribed text
🌍 Multi-language: Supports automatic language detection
🔒 Privacy-First: All processing happens locally

Quick Start

Run IAmDictator.exe
The app starts in the system tray
Hold CTRL+ALT and speak
Release keys when done
Transcribed text appears automatically

System Tray Menu

Right-click the tray icon to access:

Language: Auto, English, Spanish, French, German, Portuguese
Model: Tiny (fastest) to Large-v3 (most accurate)
Injection Mode: SendInput or Clipboard
Microphone: Select input device
Settings: Advanced configuration
Logs: View application logs

Requirements

Windows 10/11
.NET 8.0 Runtime
Python 3.10+ (for STT server)
Microphone

Architecture

┌─────────────────────┐
│   WPF Application   │
│  (System Tray UI)   │
└──────────┬──────────┘
           │
           ├── Hotkey Engine (CTRL+ALT)
           ├── Audio Capture (NAudio)
           ├── gRPC Client
           │
           v
┌─────────────────────┐
│   Python STT Server │
│  (faster-whisper)   │
└─────────────────────┘
           │
           v
┌─────────────────────┐
│   Text Injection    │
│   (SendInput API)   │
└─────────────────────┘

Components

IAmDictator.App: WPF application with system tray UI
IAmDictator.Core: Core functionality (hotkeys, audio, text injection)
IAmDictator.STT.Server: Python gRPC server with faster-whisper

Configuration

Settings are stored in: %APPDATA%\IAmDictator\settings.json

{
  "Hotkey": {
    "Enabled": true,
    "ArmedDelayMs": 120
  },
  "Transcription": {
    "ModelName": "small",
    "Language": "auto"
  },
  "Injection": {
    "Mode": "SendInput"
  },
  "General": {
    "MicrophoneDeviceIndex": 0
  }
}

Building from Source

Prerequisites

.NET 8.0 SDK
Python 3.10+
Visual Studio 2022 or VS Code

Build Steps

Clone the repository:

git clone https://github.com/yourusername/IAmDictator.git
cd IAmDictator

Build the solution:

dotnet build IAmDictator.sln --configuration Release

Setup Python STT server:

cd src/IAmDictator.STT.Server
python -m venv venv
venv\Scripts\activate
pip install -r requirements.txt

Run the application:

src\IAmDictator.App\bin\Release\net8.0-windows\IAmDictator.App.exe

Troubleshooting

No transcription appearing

Check microphone selection in tray menu
Verify STT server is running (check logs)
Ensure correct microphone device is selected

Poor transcription quality

Use a better microphone (headset recommended)
Increase model size (small → medium → large)
Speak clearly and at moderate pace
Reduce background noise

Logs Location

%LOCALAPPDATA%\IAmDictator\logs\

License

MIT License - See LICENSE file for details

Credits

Whisper: OpenAI
faster-whisper: SYSTRAN
NAudio: Mark Heath
gRPC: Google

Made with ❤️ for productivity enthusiasts

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
.claude		.claude
.github/workflows		.github/workflows
auxiliar		auxiliar
installer		installer
src-mac		src-mac
src		src
tests/IAmDictator.Tests		tests/IAmDictator.Tests
tools		tools
.editorconfig		.editorconfig
.gitignore		.gitignore
ARCHITECTURE.md		ARCHITECTURE.md
BUILDMAC.md		BUILDMAC.md
BUILD_INSTRUCTIONS.md		BUILD_INSTRUCTIONS.md
CHANGELOG.md		CHANGELOG.md
CONTRIBUTING.md		CONTRIBUTING.md
Directory.Build.props		Directory.Build.props
LICENSE		LICENSE
PROJECT_SUMMARY.md		PROJECT_SUMMARY.md
PTT.Transcriber.sln		PTT.Transcriber.sln
PTTTranscriber.sln		PTTTranscriber.sln
QUICKSTART.md		QUICKSTART.md
README.md		README.md
goclaude.bat		goclaude.bat

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

I Am Dictator 🎙️

Features

Quick Start

System Tray Menu

Requirements

Architecture

Components

Configuration

Building from Source

Prerequisites

Build Steps

Troubleshooting

No transcription appearing

Poor transcription quality

Logs Location

License

Credits

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

I Am Dictator 🎙️

Features

Quick Start

System Tray Menu

Requirements

Architecture

Components

Configuration

Building from Source

Prerequisites

Build Steps

Troubleshooting

No transcription appearing

Poor transcription quality

Logs Location

License

Credits

About

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages