Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

生成微博词云 - februarysea.com #17

Open
februarysea opened this issue Feb 14, 2020 · 0 comments
Open

生成微博词云 - februarysea.com #17

februarysea opened this issue Feb 14, 2020 · 0 comments

Comments

@februarysea
Copy link
Owner

https://februarysea.com/2019/10/30/%E7%94%9F%E6%88%90%E5%BE%AE%E5%8D%9A%E8%AF%8D%E4%BA%91/

今天花了一天写了一个新浪微博的爬虫,爬取微博指定用户微博内容,然后生成词云,以@带带大师兄为例,这是带带大师兄微博的图云。 具体实现思路是:手机端网页的微博内容比较容易获取,于是通过爬虫访问手机端微博网页m.weibo.com获取某人的微博信息,然后将微博信息构成一个字符串进行词语分割,最后用分割的词语生成词云。 构建请求头:主要是为了微博把我们的爬虫识别为浏览器。 123456headers

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant