Skip to content

2shou/PhantomjsFetcher

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

13 Commits
 
 
 
 
 
 

Repository files navigation

PhantomjsFetcher

A python web fetcher using phantomjs and tornado to mock browser.

Before using

  1. install phantomjs and start with:

$ phantomjs phantomjs_fetcher.js [port]

  1. install tornado with pip:

$ pip install tornado

Sample Code

from tornado_fetcher import Fetcher

# create a fetcher
>>> fetcher=Fetcher(
  user_agent='phantomjs', # user agent
  phantomjs_proxy='http://localhost:12306', # phantomjs url
  pool_size=10, # max httpclient num
  async=False
  )
# fetch html after rendering javascript from url
>>> fetcher.fetch(url)
# or execute additional javascript after rendering end, which must be a function
>>> fetcher.fetch(url, js_script='setTimeout("function(){window.scrollTo(0,100000)}", 1000)')

Reference

pyspider

About

A python web fetcher using phantomjs to mock browser

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages