Skip to content

princhenee/IWCT_Weibo_Crawler

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

24 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

IWCT_Weibo_Crawler

This repository designing a sina weibo crawler is dedicated to the research program of IWCT,SJTU

=======

#Requirements:

  • Scrapy >= 0.14
  • redis-py (tested on 2.4.9)
  • redis server (tested on 2.4-2.6)
  • BeautifulSoup
  • pymongo

Installation

$ sudo apt-get install redis-server
$ sudo pip install requirements.txt

IWCT_Weibo_Crawler的功能

  1. 微博模拟登录

  2. 抓取任务接口(用户资料/朋友网/微博内容等)

  3. 页面内容解析

  4. 数据存储(Redis/MongoDB)

IWCT_Weibo_Crawler Provides

  1. WEIBO Login Simulator

  2. Extraction Task Interface(user profile/social network/weibos etc.)

  3. Weibo Page Parser

  4. Data Storage(Redis/MongoDB)

How to Use IWCT_Weibo_Crawler

  • run command **$ scrapy crawl weibospider ** on your console
  • under current directory

About

This repository designing a sina weibo crawler is dedicated to the research program of IWCT,SJTU

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published