spider for exporting my favorites
https://github.com/geekan/scrapy-examples
需求:专栏图片未存储
参考:
https://github.com/shanelau/zhihu nodejs版本 https://github.com/xchaoinfo/fuck-login 所有网站的登陆 https://github.com/moxiegushi/zhihu 爬虫包含验证码破解 https://github.com/wycm/zhihu-crawler java版本 https://github.com/mtianyan/ArticleSpider 一直有更新 scrapy爬虫 https://github.com/AlexTan-b-z/ZhihuSpider https://github.com/xpisme/scrapy_zhihu 有爬出的1w用户数据
学习:
http://cuiqingcai.com/1319.html bs模块用法 http://www.jianshu.com/p/5017d8342dd2 python unicode http://www.pythonsite.com/?p=358 scrapy
评论、链接 API: http://open.weibo.com/wiki/API#.E6.94.B6.E8.97.8F https://github.com/michaelliao/sinaweibopy