捉去豆瓣电影,用于分析
Branch: master
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
douban_movie
job
lagou
.gitignore
README.md
requirements.txt

README.md

dalmatian -- 斑点狗

一个爬虫系统,捉去豆瓣电影,用于数据分析,然后会通过推荐系统推送给用户

开发环境:

  • 数据库:mysql
  • 爬虫框架:scrapy
  • 系统:mac osx

douban_movie

使用需要修改douban_movie/douban_movie/settings.py文件夹里面的数据库配置:host, port, username, password

douban_movie目录下面,跑命令:

scrapy crawl now_playing_movie