ScreenScribe

Capture. Extract. Format.

A macOS menu bar application for capturing screen regions and extracting content using AI-powered prompts. Features built-in support for LaTeX and Markdown extraction, plus the ability to create your own custom prompts.

Demo

Watch the application in action:

Menu Bar Access

Settings Panel

Features

Menu Bar Convenience: Lives in your menu bar for quick access.
Screen Capture: Use global keyboard shortcuts or the menu bar to capture any portion of your screen.
Text Extraction (Vision OCR): Uses Apple's built-in Vision framework for fast, offline text recognition.
AI-Powered Extraction: Leverages the Google Gemini API with customizable prompts for intelligent content extraction.
Built-in Prompts:
- LaTeX: Convert mathematical equations and formatted content to LaTeX code.
- Markdown: Extract and convert content to clean Markdown format.
Custom Prompts: Create, edit, and save your own prompts for specialized extraction needs.
Per-Prompt Output Format: Configure how each prompt formats output (line breaks, spaces, or LaTeX newlines).
Default Prompt: Set any prompt as the default for quick keyboard access.
Gemini Model Selection: Choose between different Gemini models to optimize for speed, cost, or accuracy.
Clipboard Integration: Automatically copies extracted content to your clipboard.
Customizable Shortcuts: Set global keyboard shortcuts for text extraction and your default prompt.
Recent History: Access recently captured results directly from the menu bar.
API Key Management: Securely enter and store your Google Gemini API key via the Settings panel.

Requirements

macOS: Version 14.0 (Sonoma) or later.
Google Gemini API Key: Required for AI-powered extraction (LaTeX, Markdown, and custom prompts). Get a free key from Google AI Studio.
Xcode: Version 16.0 or later (if building from source).

Gemini API Models and Rate Limits

ScreenScribe allows you to choose between the following Gemini models to optimize for your specific needs:

Model	Description
Gemini 3 Flash	Best balance of speed, cost, and accuracy
Gemini 3 Pro	Most capable model for complex content
Gemini 2.5 Flash-Lite	Fastest and most cost-effective option

Note: All models are available on the free tier that includes a generous usage limit that should be more than sufficient for personal use.

Installation

Quick Install

Download the latest .dmg from Releases
Open the DMG and drag ScreenScribe to Applications
Right-click the app and select "Open" (required for first launch)

Opening for the First Time

Since this app is not notarized with Apple, macOS will show a security warning. Here's how to open it:

Option 1: Right-click to Open (Recommended)

Open Finder and go to Applications
Right-click (or Control-click) on ScreenScribe
Select "Open" from the context menu
Click "Open" in the dialog that appears

Option 2: System Settings

Try to open the app normally (it will be blocked)
Go to System Settings > Privacy & Security
Scroll down to find the message about ScreenScribe being blocked
Click "Open Anyway"

This only needs to be done once. After the first successful launch, the app will open normally.

First Launch

When you first open ScreenScribe, a setup wizard will guide you through:

Screen Recording Permission - Required for screen capture functionality
API Key Setup - Optional, enables AI-powered extraction (LaTeX, Markdown, custom prompts)

The app will appear in your menu bar after setup is complete.

Usage

Capture:

Click the menu bar icon and select "Extract Text" for offline OCR, or choose a prompt (LaTeX, Markdown, or custom)
Use keyboard shortcuts: Cmd+T for text, Cmd+L for your default AI prompt

Select Area: Your cursor will turn into a crosshair. Click and drag to select the screen region.

Result:

A sound plays on successful capture
Content is automatically copied to your clipboard
Menu bar icon briefly shows a checkmark

History: Access recent captures from the menu bar under "Recent Captures"

Settings:

API Key: Enter your Google Gemini API Key for AI-powered extraction
Gemini Model: Select your preferred model (speed vs. accuracy trade-off)
Shortcuts: Customize keyboard shortcuts
Prompts: Create custom prompts, edit copy formats, set your default

Custom Prompts

Create your own prompts for specialized extraction:

Open Settings from the menu bar.
In the Prompts section, click the + button.
Enter a name and write your prompt instructions.
Choose your preferred copy format (line breaks, spaces, or LaTeX newlines).
Click Create to save.

You can set any prompt as your default by selecting it and clicking Set as Default. The default prompt will be triggered by the keyboard shortcut (Cmd+L by default).

Note: Built-in prompts (LaTeX and Markdown) cannot be deleted or have their content modified, but you can change their copy format.

Building from Source

If you prefer to build the application yourself:

Clone the repository:

git clone https://github.com/SamuelZ12/screen-scribe.git
cd screen-scribe

Open in Xcode:
```
open ScreenScribe.xcodeproj
```
Select Scheme: Ensure the ScreenScribe scheme is selected.
Build/Run: Press Cmd+B to build or Cmd+R to run the application directly on your Mac. (Apps you build yourself typically don't trigger the same Gatekeeper warnings on your own machine).
(Required for AI extraction) Configure API Key and Model: After running the built app, open its Settings panel from the menu bar icon, enter your Google Gemini API key, and select your preferred model.

Code Structure Overview

ScreenScribe/Sources/App.swift: Main application delegate, menu bar setup, capture initiation, and result handling.
ScreenScribe/Sources/Recognizer.swift: Handles text OCR using Apple's Vision framework.
ScreenScribe/Sources/Models/Prompt.swift: Prompt data model with built-in LaTeX and Markdown prompts.
ScreenScribe/Sources/Services/GeminiService.swift: Manages interaction with the Google Gemini API.
ScreenScribe/Sources/Services/PromptManager.swift: CRUD operations for prompts and persistence.
ScreenScribe/Sources/Settings/: Contains SwiftUI views for settings and prompt management.
ScreenScribe/Sources/Extensions/: Utility extensions for various AppKit/Foundation classes.
ScreenScribe/Info.plist: Application metadata and permission descriptions.
ScreenScribe.xcodeproj: Xcode project file.

License

This project is licensed under the MIT License - see the LICENSE file for details.

Acknowledgments

Built on top of TextGrabber2 by cyanzhong

Name		Name	Last commit message	Last commit date
Latest commit History 100 Commits
.github/workflows		.github/workflows
Assets		Assets
Custom Prompts		Custom Prompts
ScreenScribe.xcodeproj		ScreenScribe.xcodeproj
ScreenScribe		ScreenScribe
.gitignore		.gitignore
Build.xcconfig		Build.xcconfig
CLAUDE.md		CLAUDE.md
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ScreenScribe

Demo

Menu Bar Access

Settings Panel

Features

Requirements

Gemini API Models and Rate Limits

Installation

Quick Install

Opening for the First Time

First Launch

Usage

Custom Prompts

Building from Source

Code Structure Overview

License

Acknowledgments

About

Uh oh!

Releases 7

Packages

Contributors 3

Uh oh!

Languages

License

SamuelZ12/screen-scribe

Folders and files

Latest commit

History

Repository files navigation

ScreenScribe

Demo

Menu Bar Access

Settings Panel

Features

Requirements

Gemini API Models and Rate Limits

Installation

Quick Install

Opening for the First Time

First Launch

Usage

Custom Prompts

Building from Source

Code Structure Overview

License

Acknowledgments

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 7

Packages 0

Contributors 3

Uh oh!

Languages

Packages