This paper first presents a novel semi-automated technique that derives semantically relevant hashtags using domain-specific knowledge base of topic concepts and combines them with the existing tweet-based-hashtags to produce Hybrid Hashtags. Further, to deal with the speed and volume of Big Data Streams of tweets, we present the online approach that updates the preprocessing and learning model incrementally in a real-time streaming environment using distributed framework Apache Storm. Finally, to fully exploit the batch and stream environment performance advantages, we propose a comprehensive framework (Hybrid Hashtag based Tweet topic classification framework (HHTC)) that combines batch and online mechanisms in the most effective way.
-
Notifications
You must be signed in to change notification settings - Fork 0
vibhutittu/Real-time-Tweet-Analytics-using-Hybrid-Hashtags-on-Twitter-Big-Data-Streams
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
About
No description, website, or topics provided.
Resources
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published