Java ML Text Processor

This project demonstrates text processing capabilities using Apache OpenNLP and JavaFX, featuring a simple interactive GUI for educational purposes. The application performs tokenization, sentence detection, part-of-speech tagging, and named entity recognition, showcasing basic NLP techniques with machine learning models.

Features

Tokenization: Splits text into individual tokens (words).
Sentence Detection: Identifies and separates sentences within the text.
Part-of-Speech Tagging: Tags each token with its corresponding part of speech.
Named Entity Recognition (NER): Detects and labels named entities (such as people, organizations, etc.).
Export Results: Allows users to export processed text or results to a local file.

Prerequisites

Java 17 or higher
JavaFX SDK 22.0.1
Apache OpenNLP library

Setup Instructions

Step 1: Clone the Repository

git clone https://github.com/vdrvar/java_ml_text_processor.git
cd java_ml_text_processor

Step 2: Set Up JavaFX

Download the JavaFX SDK from Gluon and place it in your preferred directory. Ensure the necessary environment variables are set if needed.

Step 3: Run the Application

Using the Shell Script (Linux/macOS)

Make the script executable:

chmod +x run.sh

Run the script:

./run.sh

Using the Batch Script (Windows)

Run the batch script:

run.bat

This will compile the Java source files and then run the application.

Usage

Input Text

Enter the text you want to process in the input area.

Tokenize

Click the Tokenize button to split the input text into individual tokens.

Detect Sentences

Click the Detect Sentences button to identify and separate sentences within the input text.

POS Tagging

Click the POS Tagging button to tag each token with its corresponding part of speech.

NER

Click the NER button to detect and label named entities in the input text.

Export Results

Click the Export Results button to save the processed results to a file.

License

This project is licensed under the MIT License. See the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
.vscode		.vscode
bin/com/example		bin/com/example
lib		lib
models		models
src/com/example		src/com/example
README.md		README.md
apache-opennlp-2.3.3-bin.tar.gz		apache-opennlp-2.3.3-bin.tar.gz
results.txt		results.txt
run.bat		run.bat
run.sh		run.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!

Repository files navigation

Java ML Text Processor

Features

Prerequisites

Setup Instructions

Step 1: Clone the Repository

Step 2: Set Up JavaFX

Step 3: Run the Application

Using the Shell Script (Linux/macOS)

Using the Batch Script (Windows)

Usage

Input Text

Tokenize

Detect Sentences

POS Tagging

NER

Export Results

License

Acknowledgements

About

Uh oh!

Releases

Packages

Languages

Uh oh!

Uh oh!

vdrvar/java_ml_text_processor

Folders and files

Latest commit

History

Repository files navigation

Java ML Text Processor

Features

Prerequisites

Setup Instructions

Step 1: Clone the Repository

Step 2: Set Up JavaFX

Step 3: Run the Application

Using the Shell Script (Linux/macOS)

Using the Batch Script (Windows)

Usage

Input Text

Tokenize

Detect Sentences

POS Tagging

NER

Export Results

License

Acknowledgements

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages