Skip to content

This python script allows to automate PDF downloads from web-pages and html files. It automatically searches all the PDF links from the given URL page and starts downloading them. It can automatically use up to Four Threads if there are more number of PDF files and helps in fast download.

Notifications You must be signed in to change notification settings

gassantos/Automate-pdf-Downloads

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 

Repository files navigation

Automate-pdf-Downloads

This python script allows to automate PDF downloads from web-pages and html files. It automatically searches all the PDF links from the given URL page and starts downloading them. It can automatically use up to Four Threads if there are more number of PDF files and helps in fast download.

Requirements:

This script uses all the built-in libraries provided by python except for two libraries which are beautifulsoup and requests. They can be installed through pip using the command 'pip install beautifulsoup4' and 'pip install requests'. These can also be installed by running the requirements.py file which is present in this repository by running 'python requirements.py'.

Using the Script:

  • From Command Line Argument:

python getpdf.py, url

python getpdf.py, url1, url2, url3....

python getpdf.py, afile.html

python getpdf.py, afile1.html, afile2.html, afile3.html...

  • Passing argument after running the script: python getpdf.py

-- Now arguments can be passed in the input prompt

About

This python script allows to automate PDF downloads from web-pages and html files. It automatically searches all the PDF links from the given URL page and starts downloading them. It can automatically use up to Four Threads if there are more number of PDF files and helps in fast download.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 100.0%