Sanskrit Read Story - Whisper.cpp

A Flutter mobile app for Sanskrit reading practice using Whisper.cpp for speech recognition.

Features

Offline Speech Recognition: Uses Whisper.cpp with ggml quantized models
Sanskrit Support: Optimized for Sanskrit pronunciation using Hindi language model
Read-Along Experience: Interactive word-by-word reading practice
Text-to-Speech: Tap words to hear correct pronunciation
Fully Offline: All processing happens on-device

Technology Stack

Flutter: Cross-platform mobile framework
Whisper.cpp: Fast inference of OpenAI's Whisper models
ggml: Efficient machine learning inference library
Flutter TTS: Text-to-speech for pronunciation guidance

Project Structure

lib/
├── main.dart              # Main app entry point
└── services/
    └── whisper_service.dart  # Whisper.cpp integration service

assets/
└── model/
    └── ggml-sanskrit-q5_1.bin  # Quantized Whisper model for Sanskrit

Setup

Prerequisites

Flutter SDK (>=3.1.0)
Android Studio / Xcode
Whisper ggml model for Sanskrit

Installation

Clone the repository:

git clone <your-repo-url>
cd read_story

Install dependencies:

flutter pub get

Ensure the ggml model is in the assets folder:

assets/model/ggml-sanskrit-q5_1.bin

Run the app:

flutter run

How It Works

Model Loading: On startup, the app copies the ggml model from assets to app storage
Audio Recording: Captures microphone input at 16kHz (Whisper's native sample rate)
Streaming Recognition: Accumulates 3-second audio chunks for processing
Transcription: Whisper.cpp transcribes audio chunks in real-time
Word Matching: Compares transcribed text with expected Sanskrit words
Progress Tracking: Advances through the story as words are correctly spoken

Model Details

Format: GGML quantized (Q5_1)
Language: Hindi (used for Sanskrit recognition)
Size: Optimized for mobile deployment
Inference: On-device using whisper_flutter_new package

Customization

Change Story Words

Edit the _words list in lib/main.dart:

final List<String> _words = ['एकः', 'काकः', 'पिपासितः', 'आसीत्', 'सः', 'जलार्थम्'];

Adjust Buffer Size

Modify the buffer duration in lib/services/whisper_service.dart:

static const int _bufferSizeSeconds = 3; // Seconds of audio to accumulate

Performance

Model Load Time: ~2-5 seconds on modern devices
Transcription Latency: ~1-3 seconds per 3-second audio chunk
Memory Usage: ~200-300 MB with model loaded
Battery Impact: Moderate during active recording

Permissions

The app requires microphone permission for audio recording.

License

This project is licensed under the MIT License.

Acknowledgments

OpenAI for Whisper models
ggerganov for whisper.cpp and ggml
Flutter community for mobile development framework

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
.vscode		.vscode
android		android
assets		assets
ios		ios
lib		lib
web		web
.gitignore		.gitignore
.metadata		.metadata
FIREBASE_REMOTE_CONFIG_SETUP.md		FIREBASE_REMOTE_CONFIG_SETUP.md
ONNX_SETUP.md		ONNX_SETUP.md
PHONETIC_MATCHING_IMPROVEMENTS.md		PHONETIC_MATCHING_IMPROVEMENTS.md
QUICK_START_REMOTE_CONFIG.md		QUICK_START_REMOTE_CONFIG.md
README.md		README.md
SETUP_NOTES.md		SETUP_NOTES.md
all_stories_remote_config.json		all_stories_remote_config.json
analysis_options.yaml		analysis_options.yaml
devtools_options.yaml		devtools_options.yaml
pubspec.lock		pubspec.lock
pubspec.yaml		pubspec.yaml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Sanskrit Read Story - Whisper.cpp

Features

Technology Stack

Project Structure

Setup

Prerequisites

Installation

How It Works

Model Details

Customization

Change Story Words

Adjust Buffer Size

Performance

Permissions

License

Acknowledgments

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Sanskrit Read Story - Whisper.cpp

Features

Technology Stack

Project Structure

Setup

Prerequisites

Installation

How It Works

Model Details

Customization

Change Story Words

Adjust Buffer Size

Performance

Permissions

License

Acknowledgments

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages