A Windows application that transcribes WhatsApp audio messages to text using Whisper.net.
- Transcribes audio files to text
- Supports automatic language detection
- Uses Whisper.net for high-quality speech recognition
- Simple and intuitive user interface
- Copy an audio file to your clipboard (Ctrl+C on the file in Windows Explorer)
- Click the "Transcribe Audio" button in the application
- The application will convert the audio to the required format using FFmpeg
- The transcription will appear in the text box
- Windows operating system
- .NET 8.0 or higher
- FFmpeg installed and available in your PATH
- CUDA-compatible GPU (optional, for faster processing)
- Clone this repository
- Open the solution in Visual Studio
- Build and run the application
- On first run, the application will automatically download the required Whisper model
- Whisper.net - .NET wrapper for OpenAI's Whisper model
- FFmpeg - For audio conversion
- NVidia GPU with CUDA support
- CUDA Toolkit (>= 12.1)
The application uses:
- Whisper.net for speech recognition
- FFmpeg for audio conversion
- Windows Forms for the user interface
- CUDA acceleration (if available)
The first time you run the application, it will download the large Whisper model (approximately 3GB). This may take some time depending on your internet connection.
This project is open source and available under the MIT License.