weiboHotWord

基于hadoop和hive的微博热词跟踪系统

对应的blog地址为:

http://blog.csdn.net/gamer_gyt/article/details/51940211
<1>首先是利用微博的api得到每天的微博数据
<2>编写hadoop项目对微博内容进行分词统计，设置一个阀值，当一个词的出现的数目超过这个阀值时就将其加入到热词列表里，在以后的每天就对其进行统计
<3>将处理后的数据写入hive

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
file源文件/20160221		file源文件/20160221
hot_word_code		hot_word_code
利用API获取微博数据example		利用API获取微博数据example
.gitattributes		.gitattributes
.gitignore		.gitignore
20160221-r-00000		20160221-r-00000
README.md		README.md
hiveSQL		hiveSQL

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

weiboHotWord

对应的blog地址为:

About

Releases

Packages

Languages

Thinkgamer/weiboHotWord

Folders and files

Latest commit

History

Repository files navigation

weiboHotWord

对应的blog地址为:

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages