Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[TODO] 搜狗登录 #53

Open
chyroc opened this issue Jan 17, 2017 · 3 comments
Open

[TODO] 搜狗登录 #53

chyroc opened this issue Jan 17, 2017 · 3 comments

Comments

@chyroc
Copy link
Owner

chyroc commented Jan 17, 2017

No description provided.

@ax4
Copy link

ax4 commented Jan 27, 2017

一种另类的思路,用 js 爬虫,"曲线救国" 来解决搜狗登陆问题。用户具体的使用过程如下:
step 1. 开发者在自己的 browser 里,访问 weixin.sogou.com
step 2. 在网页上,手动登陆
step 3. 在登陆后的 weixin.sogou.com 页面上,打开 browser 的 console.
step 4. 调用一段,我们预先写好的 JS 爬虫代码,爬取获得文章的临时链接
step 5. 临时链接可以打包下载,交给我们的 python 爬虫来执行内容爬取&保存

这个思路是在,与@hczhcz 于2016年3、4月份讨论的时候,他所提出的

@ax4
Copy link

ax4 commented Jan 29, 2017

开了个坑~希望我自己能尽快填完哈(~~~怎么有点不相信自己的填坑效率呢~~~)
https://github.com/ax4/WechatSogouJS

已知 Issue:

  • 搜狗微信 - 搜文章, 只能显示 100页内容(未登陆仅前 10页,登陆后 100页)
  • 使用JS爬虫仍然会跳出验证码。尝试添加Ruokuai

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants
@ax4 @chyroc and others