Skip to content

Example script that recovers metadata from PDF and DOCX files

License

Notifications You must be signed in to change notification settings

hartek/metadatos

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

17 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

metadatos

Script that recovers metadata from PDF, DOCX, JPEG, PNG, GIF, BMP and TIFF files.

Module dependencies (installable via pip):

  • PyPDF2
  • python-docx
  • PIL (pillow)
  • exifread
  • libxmp (python-xmp-toolkit)
  • termcolor

This command will install the dependencies via Pip:

pip install -r requirements.txt

How to use

You just have to execute the script with the folder containing the files to be analysed as an argument.

For instance, this will analyse all files inside ./samples.

python metadatos.py ./samples

If you want a colored output, you can use -c argument.

python metadatos.py -c ./samples 

About

Example script that recovers metadata from PDF and DOCX files

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages