TextTractor

A Swift package to extract text from images and PDFs using Apple's Vision framework.

This package provides both a command-line interface (CLI) for direct use and a core library (TextTractorCore) that can be easily integrated into larger macOS or iOS* GUI applications.

*Note: iOS compatibility is possible but the package is currently configured for macOS. See Package.swift.

Features

Extract text from various image formats (PNG, JPEG, etc.).
Extract text from PDF documents (multi-page supported).
Simple command-line interface.
Lightweight, easy-to-integrate library for use in other Swift projects.

Requirements

macOS 13.0 or later.
Swift 5.9 or later.
Xcode 14 or later.

CLI Usage

1. Build the CLI

First, build the project from the root directory:

swift build -c release

The executable text-tractor will be located in .build/release/.

2. Run the Executable

The CLI takes a single required argument: the path to the input file. It can either print the extracted text to the console or save it to a file.

Syntax:

.build/release/text-tractor <path-to-image-or-pdf> [options]

Arguments:

inputPath: The path to the input image or PDF file.

Options:

-o, --output-path <path>: The path to save the output .txt file. If omitted, text is printed to standard output.

Examples:

Extract text and print to console:

.build/release/text-tractor /path/to/my/image.png

Extract text and save to a file:

.build/release/text-tractor /path/to/my/document.pdf -o /path/to/output/extracted.txt

Library Integration for GUI Apps

You can easily use TextTractorCore in your own Xcode project (e.g., a SwiftUI or AppKit app).

1. Add Package Dependency

In Xcode, go to File > Add Packages... and enter the repository URL for this project.

https://github.com/inspirationull/texttractor

Xcode will fetch the package. When prompted to "Choose Package Products for TextTractor", select TextTractorCore and add it to your app's target.

2. Use in Code

Now you can import TextTractorCore and use the OCRService to process files.

import SwiftUI
import TextTractorCore

struct ContentView: View {
    @State private var extractedText = "Loading..."
    private let ocrService = OCRService()

    var body: some View {
        ScrollView {
            Text(extractedText)
                .padding()
        }
        .onAppear(perform: processFile)
    }

    private func processFile() {
        // Get a URL to a local image or PDF file
        guard let fileURL = Bundle.main.url(forResource: "my-document", withExtension: "pdf") else {
            self.extractedText = "Error: File not found."
            return
        }

        Task {
            do {
                let text = try await ocrService.processFile(at: fileURL)
                DispatchQueue.main.async {
                    self.extractedText = text
                }
            } catch {
                DispatchQueue.main.async {
                    self.extractedText = "An error occurred: \(error.localizedDescription)"
                }
            }
        }
    }
}

API

The public API of the TextTractorCore library is simple and focused.

`OCRService`

This is the main struct you will interact with.

public struct OCRService {
    public init() {}

    /// Main entry point: Process a file at a URL (Image or PDF)
    public func processFile(at url: URL) async throws -> String
}

init(): Creates a new instance of the service.
processFile(at: URL): Asynchronously processes the file at the given URL. It automatically detects whether the file is an image or a PDF. It returns the extracted text as a String or throws an OCRError.

`OCRError`

A custom error enum for handling processing failures.

public enum OCRError: Error {
    case invalidFile
    case pdfConversionFailed
    case processingFailed(Error) // Wraps an underlying system error
}

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
Sources		Sources
.gitignore		.gitignore
LICENSE		LICENSE
Package.resolved		Package.resolved
Package.swift		Package.swift
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

TextTractor

Features

Requirements

CLI Usage

1. Build the CLI

2. Run the Executable

Library Integration for GUI Apps

1. Add Package Dependency

2. Use in Code

API

`OCRService`

`OCRError`

About

Uh oh!

Releases 1

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

TextTractor

Features

Requirements

CLI Usage

1. Build the CLI

2. Run the Executable

Library Integration for GUI Apps

1. Add Package Dependency

2. Use in Code

API

OCRService

OCRError

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

`OCRService`

`OCRError`

Packages