Skip to content

A short and simple python crawler, that uses Webkit and executes Javascript

Notifications You must be signed in to change notification settings

invernizzi/js-crawler

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 

Repository files navigation

js-crawler

A short and simple web crawler written in Python, that uses Webkit and executes Javascript.

How to use

crawler = Crawler(gui=True,                                                 # To see the crawler in action
                  is_link_interesting=lambda url, text: 'download' in url)  # Follow every link containing
                                                                            #  "download" in the url
crawler.crawl('http://firefox.com')
crawler.close()

About

A short and simple python crawler, that uses Webkit and executes Javascript

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages