img_pdf_retrieval

Requirements

Git
Anaconda
Python 3.7.2

Setup

Installation

Type below commands on the Analconda Prompt sequentially.

git clone https://gitlab.com/naoki.ohsugi/img_pdf_retrieval.git
cd img_pdf_retrieval
conda create -n img_pdf_retrieval python==3.7.2
conda activate img_pdf_retrieval
pip install -r requirements.txt

Install `haarcascade_frontalface_default.xml`

Download haarcascade_frontalface_default.xml and put it to img_pdf_retrieval folder.

Setup for PDF search

If no need to search PDF files, the following setting can be skipped. Otherwise, download poppler-windows from @oschwartz10612's repo. Please make sure to add the bin/ folder to PATH or use poppler_path = r"C:\path\to\poppler-xx\bin" as an argument in convert_from_path.

Update [FOLDERS] section in config.ini

Setup Target Folders

Open config.ini, change, and add the target folders to retrieve the image/PDF files. You can specify multiple folders with the serial numbers as follows.

[FOLDERS]
0 = C:\Users\<Uesr Name>\img_pdf_retrieval\data\targets\
1 = C:\...
2 = ...

Indexing

Before booting server, need indexing

python indexing.py

It takes a few hours, according to the registered folders and the number of files found in your environment.

Launching search server and open screen

python search_server.py

Open http://127.0.0.1:5000/ on the browser.

Search Server Screen

Push button and select image to search.
Push Submit button
Input image to be shown in the area (3)
Found images will be shown in the area (4)

TODO

Record the time stamp of the source files and skip already indexed and not updated files in the database.
Running on the system tray and execute indexing periodically (e.g. every 12 hours).

Credit

https://github.com/danghieuan/image-retrieval-system

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
images		images
templates		templates
.gitignore		.gitignore
README.md		README.md
config.ini		config.ini
config_reader.py		config_reader.py
feature_extractor.py		feature_extractor.py
indexing.py		indexing.py
requirements.txt		requirements.txt
search_server.py		search_server.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

img_pdf_retrieval

Requirements

Setup

Installation

Install `haarcascade_frontalface_default.xml`

Setup for PDF search

Setup Target Folders

Indexing

Launching search server and open screen

Search Server Screen

TODO

Credit

About

Releases

Packages

Languages

ohsugi/img_pdf_retrieval

Folders and files

Latest commit

History

Repository files navigation

img_pdf_retrieval

Requirements

Setup

Installation

Install haarcascade_frontalface_default.xml

Setup for PDF search

Setup Target Folders

Indexing

Launching search server and open screen

Search Server Screen

TODO

Credit

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Install `haarcascade_frontalface_default.xml`

Packages