Skip to content

This repository will automate the process of saving snapshots of highlighted content within multiple pdf files.

Notifications You must be signed in to change notification settings

Erdos1729/automated-snapshot-of-annotated-content-in-pdfs

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

16 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Automated snapshot of annotated content in pdfs

This repository will automate the process of saving snapshots of highlighted content within multiple pdf files.

Instructions

  • pip install -r requirements
  • Run snapshot_ext.py

Reference

I devised the solution from the following pages of the documentation:

  • Tutorial page for introduction to the fitz liabrary
  • page.searchFor to solve the return type for searchFor method
  • fitz.Rect to identify what the returned objects from page.searchFor
  • fitz.Point to provide parameters to create a cropping box around the highlighted content
  • Collection of Recipes page to implement cropping, saving and working with images as spnapshots

🚀 Quickstart

Installing the requirements will automatically install all dependencies. Make sure you install the requirements before running the code. Also note that this requires Python 3.6+.

About

This repository will automate the process of saving snapshots of highlighted content within multiple pdf files.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages