Skip to content

manhtai/vne-scrapy

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 

Repository files navigation

VnExpress crawler

Items

vnexpress.items.VnexpressItem

Spider

sohoa

  • Request content directly, then
  • Send GET requests to get comments from another server

Pipelines

  • Requiring certain item fields
  • Convert lists to strings for saving to MySQL
  • Saving items to MySQL (You have create a table first)

Alternatives

  • To get AJAX content: Selenium+PhantomJS, Splash+ScrapyJS
  • To save data: MongoDB, ElasticSearch

About

A simple crawler using Scrapy

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages