Skip to content

Parses IB Math PDF Papers into usable PNGs using correct directory structure

Notifications You must be signed in to change notification settings

ja5087/paper-parser

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

paper-parser

prereqs

Please make sure to have installed:

  1. opencv (2.4) according to node_opencv and set OPENCV_DIR and PATH=%OPENCV_DIR%\bin properly
  2. tesseract (latest) and set TESSDATA_PREFIX to /tessdata where you installed it
  3. cpdf (latest)
  4. ImageMagick + Legacy Tools (convert)

usage

  1. install using npm install for a private github
  2. var paperparser = require('paper-parser')
  3. paperparser() everytime you want to complete rebuild (because there's no checking for existing files soz);

Parses all PDFs in /input as IB Maths Papers

Dirty I know.

Doesn't work on some questions.

Tested to work on basically nothing.

Will output images in folder structure:

/static/year/group/subject/paper/language/timezone

Example:

__dirname/static/M13/5/MATME/SP2/ENG/TZ1

About

Parses IB Math PDF Papers into usable PNGs using correct directory structure

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published