Digitizer

An Android document scanner app that uses AI to extract and digitize text from images.

Features

Camera Capture - Take photos of documents directly
Gallery Import - Select existing images for processing
AI-Powered OCR - Uses Google Gemini AI for accurate text extraction
Markdown Export - Optionally save extracted text as markdown files
Batch Processing - Process multiple documents at once
Smart Filenames - AI suggests appropriate filenames based on content
Model Selection - Choose between different Gemini AI models

Screenshots

Home Screen	Processing	Results
Camera/Gallery options	AI extraction in progress	View and save extracted text

How It Works

Tap Camera to capture a document or Gallery to select images
AI processes the image and extracts text
Review the extracted text and markdown conversion
Select a target directory and customize the filename
Toggle "Save Markdown" if you want .md files alongside images
Tap Save Document to store the digitized content

Setup

API Key Configuration

This app requires a Google Gemini API key.

Get an API key from Google AI Studio
Create or edit local.properties in the project root:
```
apiKey=YOUR_GEMINI_API_KEY_HERE
```
Build and run the app

The API key is automatically loaded via the secrets-gradle-plugin and never committed to version control.

Permissions

CAMERA - Required to capture document photos
READ_EXTERNAL_STORAGE - Required to access gallery images

Tech Stack

Kotlin - 100% Kotlin codebase
Jetpack Compose - Modern declarative UI
Google Generative AI SDK - Gemini AI integration
Material 3 - Latest Material Design components
Secrets Gradle Plugin - Secure API key management

Requirements

Android 9.0 (API 28) or higher
Google Gemini API key

Installation

From Release

Download the latest APK from Releases
Install and grant camera/storage permissions
Note: You'll need to build from source with your own API key for full functionality

Build from Source

git clone https://github.com/sunil-dhaka/Digitizer.git
cd Digitizer
# Add your API key to local.properties
echo "apiKey=YOUR_GEMINI_API_KEY" >> local.properties
./gradlew assembleDebug

Project Structure

app/src/main/java/com/example/digitizer/
    MainActivity.kt           # Entry point
    DocumentScannerScreen.kt  # Main UI composable
    BakingViewModel.kt        # AI processing logic
    UiState.kt                # UI state definitions
    FilePicker.kt             # File/directory selection
    PermissionHandler.kt      # Runtime permissions
    ui/theme/                 # Material 3 theming

AI Models

The app supports multiple Gemini models:

Gemini 1.5 Flash - Fast processing (recommended)
Gemini 1.5 Pro - Higher accuracy for complex documents
Gemini 2.0 Flash - Latest model with improved capabilities

License

MIT License - feel free to use, modify, and distribute.

Author

Built with Jetpack Compose by sunil-dhaka

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
.idea		.idea
app		app
gradle		gradle
.gitignore		.gitignore
README.md		README.md
banner.svg		banner.svg
build.gradle.kts		build.gradle.kts
gradle.properties		gradle.properties
gradlew		gradlew
gradlew.bat		gradlew.bat
settings.gradle.kts		settings.gradle.kts

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Digitizer

Features

Screenshots

How It Works

Setup

API Key Configuration

Permissions

Tech Stack

Requirements

Installation

From Release

Build from Source

Project Structure

AI Models

License

Author

About

Uh oh!

Releases 1

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Digitizer

Features

Screenshots

How It Works

Setup

API Key Configuration

Permissions

Tech Stack

Requirements

Installation

From Release

Build from Source

Project Structure

AI Models

License

Author

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages