Skip to content

lim4349/Comics-Crawler

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

14 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Don`t crawl several comics at one time.

I did not figure detailed directorys for each comics yet.

This is made for crawling images in 'https://readcomiconline.to'

I tested 'Walking Dead' and 'Batman-Damned'

But i am pretty sure they use same format for all comics.

If you want to use this for other comics, you should change url in second block.

Default texture quality is high option.

If you want low-quality pictures, change a word in url 'hq' to 'lq'.

Image will save as issue-name/index.jpg

Only Tested In Mac OS and Chrome

Need Chrome Browser, Selenium, Chrome Driver

chrome driver link : http://chromedriver.chromium.org/downloads

chrome driver must be with crawler.ipynb in same folder.

Change Log

Update one page crawler.

Add functions to avoid CAPTCHA.

Expand to other comics.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published