Skip to content

jwr0218/PV2DOC

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

36 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

PV2DOC - PresentationVideo2Document

VL2D framework

Environment ( Docker image )

docker pull tensorflow/tensorflow:2.7.0-gpu

Environment

apt-get install list : 
ffmpeg=7:4.2.7-0ubuntu0.1
libgl1-mesa-glx=21.2.6-0ubuntu0.1~20.04.2
libglib2.0-0=2.64.6-1~ubuntu20.04.6
lmodern=2.004.5-6
pandoc=2.5-3build2
python-dev
tesseract-ocr=4.1.1-2build2
texlive-xetex=2019.20200218-1

Dependency

pip install -r requirements.txt

Download Mask-Rcnn Model (Figure & Formular Detection)

Already Uploaded Mask-RCNN model to Git. If you have problem to download model, you can download with below address

MRCNN model : 
https://drive.google.com/file/d/1PTzFMJp-pF2Tt-EwPyibfj2w0KMfm9Mi/view?usp=sharing
YOLO Model : 
https://drive.google.com/file/d/1xwnx3B290BWID0JfhJU87ya82yC0mnUT/view?usp=sharing

Activate Our Solution

python main.py [file_name]

Mask-RCNN Custom

code

git lfs pull

About

PV2DOC - PresentationVideo2Document

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages