Skip to content
/ DriveIt Public
forked from XIAZY/DriveIt

A New Crawler Supports Multiple Websites

License

Notifications You must be signed in to change notification settings

bllli/DriveIt

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

50 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

DriveIt

DriveIt is a new crawler supports multiple websites, for now it supports

Overview

This project is still under development. More features will be added later.

Usage

Simply run it with Python 3. You may need to install some dependencies from PyPi. Make sure to install a JavaScript runtime before you start (like Node.js).

sudo pip3 install PyExecJS beautifulsoup4

Then you should be able to run it happily. To start, type

python3 driveit.py -u <FlyleafURL>

If you want to download a given number of latest chapters, use

python3 driveit.py -u <FlyleafURL> -l <num>

For example: eg

Or if you prefer GUI to CLI:

python3 driveit-gui.py

Note you need to have PyQt5 installed to use the GUI version. For Mac users, you can install it via

brew install pyqt5

For example: eg_gui

It can automatically creates subfolders followed by chapters, fetched picture will be stored in the proper location. For instance, chapter 1 page 1 will be stored in /name of the comic/Chapter 1/1.jpg.

Complied versions for Mac and Windows are available under Releases.

New websites can be easily supported. I'm now working on it.

By The Way

  • A flyleaf page means the index page of the comic. For instances:

  • Reading-driven development. Update frequency may be unstable depends on how far I read.

  • Note that the ck101 website is blocked in Mainland China. You may need a global VPN or Proxychains to fetch comics from it.

  • If you want to fetch comics from DMZJ, make sure the flyleaf address begins with www.dmzj.com instead of manhua.dmzj.com. The logic to fetch comics from these two domains are different.

  • Sometimes you will receive connection reset if you try to fetch comics from eHentai if you are in Mainland China. Use a global VPN or Proxychains instead.

  • Personally, I'll recommend you to fetch comics from DMZJ. For me this website is the fastest one.

License

Copyright 2016 XIAZY

Licensed under the WTFPL License, Version 2.0 (the "License"); you may not use this file except in compliance with the License. You may obtain a copy of the License at

http://www.wtfpl.net/

About

A New Crawler Supports Multiple Websites

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages

  • Python 100.0%