python3.x 爬虫小项目

自己平时做数据分析时爬的数据就当做练习爬虫了 😸

环境搭建与讲解

通过模拟登录获取,因为说说中的请求链接需要的参数是在cookie中获取的,当然也可以通过其他的方式获取对应的cookies. 其中g_qzonetoken的获取是在网页的源码中获取的,
分析说说的链接, 构造参数, 传入即可

Name		Name	Last commit message	Last commit date
Latest commit History 53 Commits
Maoyantop100		Maoyantop100
aqistudy		aqistudy
baidu		baidu
bing		bing
concurrentSpider		concurrentSpider
douban		douban
douyin		douyin
downloader		downloader
geetest		geetest
github		github
image		image
jingdong		jingdong
novel		novel
qq		qq
saike		saike
scrapyLearnings		scrapyLearnings
starbucks		starbucks
strong		strong
ted		ted
vip_downloader		vip_downloader
wangyiMusic		wangyiMusic
weixin		weixin
wifi_crack		wifi_crack
xpath		xpath
.gitignore		.gitignore
README.md		README.md