we have one week left... pokemon... gotta download them all...
first you'll need to have python3 installed and clone this repo:
https://github.com/MiniGlome/Archive.org-Downloader
and / or : https://github.com/vmbrasseur/iadownload
then you can add your archive.com credentials in the .txt file
and run my script ... paste the book url that you want to download. magic.
Python3 script to download archive.org books in PDF format
There are many great books available on https://openlibrary.org/ and https://archive.org/, however, you can only borrow them for 1 hour to 14 days and you don't have the option to download it as a PDF to read it offline or share it with your friends. I created this program to solve this problem and retrieve the original book in pdf format for FREE!
Of course, the download takes a few minutes depending on the number of pages and the quality of the images you have selected. You must also create an account on https://archive.org/ for the script to work.
To get started you need to have python3 installed. If it is not the case you can download it here : https://www.python.org/downloads/
Make sure you've already git installed. Then you can run the following commands to get the scripts on your computer:
git clone https://github.com/MiniGlome/Archive.org-Downloader.git
git clone https://github.com/yotsugi33/archive.it
cd archive.itThe script requires the modules requests, tqdm and img2pdf, you can install them all at once with this command:
pip install -r requirements.txtusage: archive-org-downloader.py [-h] -e EMAIL -p PASSWORD [-u URL] [-d DIR] [-f FILE] [-r RESOLUTION] [-t THREADS] [-j]
optional arguments:
-h, --help show this help message and exit
-e EMAIL, --email EMAIL
Your archive.org email
-p PASSWORD, --password PASSWORD
Your archive.org password
-u URL, --url URL Link to the book (https://archive.org/details/XXXX). You can use this argument several times
to download multiple books
-d DIR, --dir DIR Output directory
-f FILE, --file FILE File where are stored the URLs of the books to download
-r RESOLUTION, --resolution RESOLUTION
Image resolution (10 to 0, 0 is the highest), [default 3]
-t THREADS, --threads THREADS
Maximum number of threads, [default 50]
-j, --jpg Output to individual JPG's rather than a PDF
-m, --meta Output the metadata of the book to a json fileThe email and password fields are required, so to use this script you must have a registered account on archive.org.
The -r argument specifies the resolution of the images (0 is the best quality).
The PDF are downloaded in the current folder
This command will download the 3 books as pdf in the best possible quality. To only download the individual images you can use --jpg.
python3 archive-org-downloader.py -e myemail@tempmail.com -p Passw0rd -r 0 -u https://archive.org/details/IntermediatePython -u https://archive.org/details/horrorgamispooky0000bidd_m7r1 -u https://archive.org/details/elblabladelosge00gaut If you want to download a lot of books, you can paste the urls of the books in a .txt file (one per line) and use --file
python3 archive-org-downloader.py -e myemail@tempmail.com -p Passw0rd --file books_to_download.txtIf you want to support my work, you can send 2 or 3 solana 🙃 to this address:
4C3Tuiv9KkqVSqtCNWgSeS3dzfCSBaGCB3mUpL968Ujd
