Master thesis: The task of image retrieval in historical publications

Digitized historical publications (magazines, books, documents) abound in iconography (engravings, illustrations, photographs, diagrams). In view of this, a natural need arises to search for images that fulfill a given information need.

This masters thesis is aimed at creating an application to search for visual content in newspapers originally published by the Chronicling America project. The area of work included:

Acquisition of high-resolution images using scraper.
Data preprocessing.
Training and evaluations of the object detection model.
Prediction and visualization of results.
Cropping visual content from test set of original images using resulting bounding boxes predictions, apply OCR on them and store results in SQLite database.
Implementation of full-text search engine.
Creating GUI that selects appropriate visual content according to the user's text query based on predictions and OCR results.

Dataset described here: https://news-navigator.labs.loc.gov

Instruction:

Before use:
- clone the repository,
- install requirements,
- install Tesseract OCR using homebrew (run "brew install tesseract"),
- install spaCy language core for english (run "python -m spacy download en_core_web_sm"),
- run "python setup.py install"
Run "python scraper_runner.py" to obtain high-resolution images from the Newspaper Navigator project.
Run "python preprocessing_runner.py" to create model input data from source annotations files.
Run "python model_runner.py" to start training, evaluation or both (feel free to try various argument values).
Run "python metric_runner.py" to calculate the average precision (AP) for each class, as well as its mean value (mAP).
Run "python visualization_runner.py" to visualize several random model predictions.
Run "python predict_runner.py" to make prediction on your own single newspaper image.
Run "python ocr_runner.py" to crop visual content from test set of original images using resulting bbox predictions, apply OCR on them and store results in SQLite database.
Run "python gui_runner.py" to launch GUI for full-text search through OCR results on predicted visual content.

IMPORTANT:

If you intend to use GPU install Pytorch using following command: "pip3 install torch==1.10.2+cu113 torchvision==0.11.3+cu113 -f https://download.pytorch.org/whl/cu113/torch_stable.html".
Each script in directory named 'runner' is a command line application (despite of 'constants.py', where you can edit default arguments). Run each with argument '--help' to see the description of the other arguments.
Valid paths are generated automatically, but you can provide specific ones using click arguments in the command line for each runner.

Name		Name	Last commit message	Last commit date
Latest commit History 236 Commits
.github/workflows		.github/workflows
data		data
gui_graphics		gui_graphics
logs		logs
model_config		model_config
ocr_database		ocr_database
source_annotations		source_annotations
src		src
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

.github/workflows

.github/workflows

data

data

gui_graphics

gui_graphics

logs

logs

model_config

model_config

ocr_database

ocr_database

source_annotations

source_annotations

src

src

.gitignore

.gitignore

.pre-commit-config.yaml

.pre-commit-config.yaml

README.md

README.md

requirements.txt

requirements.txt

Repository files navigation

Master thesis: The task of image retrieval in historical publications

Instruction:

Gallery:

Welcoming window:

Search results window:

Detailed single result window:

About

Releases

Packages

Languages

yngalxx/Master_degree

Folders and files

Latest commit

History

Repository files navigation

Master thesis: The task of image retrieval in historical publications

Instruction:

Gallery:

Welcoming window:

Search results window:

Detailed single result window:

About

Topics

Resources

Stars

Watchers

Forks

Languages