Skip to content

fabraz/fastagger

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Fastagger

Fastagger is a stream segmentation label aiding tool.

Operation

Fastagger exports a pdf file pages to images and it serves them in its user interface, where you can label each page of the document, using the shorkeys (N - next page, P - previous page, 1 - new document, 2 - same document). You might want to download the json file with labels you have given to pages, by clicking on save button.

In the picure bellow you can see Fastagger UI.

fastagger

How to run

Clone the repo

git clone https://github.com/fabraz/fastagger

Copy your pdfs

Every pdf you copy to the path `./pdfs will be available for labelling.

Docker run

docker-compose up -d --build

Check out Fastagger UI at http://localhost:3000

Citation

Fabricio Ataides Braz, Nilton Correia da Silva, Jonathan Alis Salgado Lima, Leveraging effectiveness and efficiency in Page Stream Deep Segmentation, Engineering Applications of Artificial Intelligence, Volume 105, 2021, 104394, ISSN 0952-1976, https://doi.org/10.1016/j.engappai.2021.104394. (https://www.sciencedirect.com/science/article/pii/S0952197621002426)

Latex

@article{BRAZ2021104394,
title = {Leveraging effectiveness and efficiency in Page Stream Deep Segmentation},
journal = {Engineering Applications of Artificial Intelligence},
volume = {105},
pages = {104394},
year = {2021},
issn = {0952-1976},
doi = {https://doi.org/10.1016/j.engappai.2021.104394},
url = {https://www.sciencedirect.com/science/article/pii/S0952197621002426},
author = {Fabricio Ataides Braz and Nilton Correia {da Silva} and Jonathan Alis Salgado Lima},
keywords = {Page Stream Segmentation, Classification},    
}

How to Contribute

Just open your issues and/or pull requests, all are welcome 😃!