Prerequisites

brew install ffmpeg

A lot of the code especially processing SmartDoc dataset was initially taken from this repo, all the credits goes there.

Dataset

You need to download and adjust the command below accordingly. The first parameter is the path to raw dataset and second path is where the processed results will be placed into. You can find the dataset here

python main.py process-smartdoc ../datasets/SmartDoc/ ../data-doc/SmartDocProcessed/

Generate dataset from processed smartdoc

python main.py document-data-generator ../data-doc/SmartDocProcessedTrain/ ../data-doc/SmartDocMaskedTrain/
python main.py document-data-generator ../data-doc/SmartDocProcessedValid/ ../data-doc/SmartDocMaskedValid/

python main.py train --name t1 --train-dir ../data-doc/SmartDocMaskedTrain --valid-dir ../data-doc/SmartDocMaskedValid

Name		Name	Last commit message	Last commit date
Latest commit History 30 Commits
.vscode		.vscode
dataprocessor		dataprocessor
notebooks		notebooks
.flake8		.flake8
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
Makefile		Makefile
Pipfile		Pipfile
Pipfile.lock		Pipfile.lock
README.md		README.md
experiment.py		experiment.py
main.py		main.py
model.py		model.py
trainer.py		trainer.py
transforms.py		transforms.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Prerequisites

Dataset

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Prerequisites

Dataset

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages