#

text-extraction

Here are 13 public repositories matching this topic...

datashare

ICIJ / datashare

A self-hosted search engine for documents.

docker elasticsearch extract text-extraction named-entity-recognition web-gui datashare investigative-journalism

Updated Nov 14, 2024
Java

Arxa / video_text_detection

Bachelor Thesis | Text extraction from complex video scenes

opencv video gradle javafx image-processing text-extraction junit testfx

Updated Mar 15, 2019
Java

senurah / java-tess4J-ocr

Tess4J CLI OCR Tool is a command-line application that extracts text from images and PDFs using the Tess4J library, with support for multiple languages. The extracted text is automatically copied to the clipboard for easy access.

java open-source pdf ocr image-processing text-extraction tesseract-ocr tess4j java-cli

Updated Sep 9, 2024
Java

mkalus / tika-page-extractor

Tika per page PDF extractor server returning content as JSON.

metadata pdf json tika text-extraction

Updated Mar 16, 2016
Java

FileFormatInfo / ff-pdf2txt

Simple server to extract text from a PDF

pdf text conversion text-extraction file-converter

Updated Dec 26, 2021
Java

matrix-maeny / Text-Detector

Extract Text from An Image.

text-extraction text-detection

Updated Jul 1, 2022
Java

hyuseinleshov / ocr-exporter-api

A Spring Boot-based OCR Exporter tool that extracts text from image or PDF files using the OCR Space API and exports the results to various formats such as PDF, text, Word, or a database.

ocr spring-boot text-extraction file-processing word-export pdf-processing text-export ocr-space-api

Updated Oct 27, 2024
Java

arachnio / arachnio4j

Arachnio client library for Java 11+

text-extraction web-scraping data-extraction article-extractor news-scraping web-scraping-java arachnio

Updated May 31, 2023
Java

KNiemeijer / Thesis-Text-Extraction

Text extraction: a highway to systematically process car reviews

nlp car text-analysis text-extraction opinion-mining corenlp

Updated Jun 17, 2017
Java

yadoc2text

aledbetter / yadoc2text

Yet Another Document 2 Text for pdf/doc/html/rft/etc - Extract text - or - convert to simplified HTML to retain layout information

natural-language-processing text-mining text-extraction text-extract text-extractor

Updated Apr 14, 2023
Java

jhecking / tika-lambda

Run Apache Tika as a service in AWS Lambda by scanning documents in S3 and storing the extracted text back to S3

lambda serverless text-extraction apache-tika aws-sam

Updated Jan 21, 2019
Java

Kajal-ghadage2000 / Text-Recognition-and-Extraction-Android-App

Extract and detect text from the captured image and also selected images from the gallery.

text-extraction text-recognition android-app mlkit-android text-extraction-from-image text-recognition-from-image

Updated May 31, 2021
Java

eitanflor / ShellHacks-2020

A Cloud-Native Infrastructure for License Plate Recognition and Text Extraction with Python Integration

python java machine-learning javafx text-extraction artificial-intelligence sqlserver google-cloud-platform cloud-sql cloudvision license-plate-recognition

Updated Oct 26, 2020
Java

Improve this page

Add a description, image, and links to the text-extraction topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the text-extraction topic, visit your repo's landing page and select "manage topics."