Twitter Trendiness Score Computation

The objective of this project is to compute the trendiness scores of specific words and phrases (two consecutive words) appearing in Twitter.

What is a "Trend"?
Spikes in the likelihood of seeing a word/phrase relative to its usual likelihood.

“Trendiness Score” Formula:
The trendiness of a word/phrase p at time t is computed as follows:

Here,

Approach

Tweets are obtained from the Twitter API.

Each individual tweet along with its timestamp is transformed according to our needs and pushed to a Kakfa Queue.

At the consumer end, the tweets are consumed and loaded onto a Tweets table in a PostgreSQL Database.

Now, when a user wants to find out the trendiness score of a word/phrase at any specific time, the user runs the trendiness_kafka.py script with the word/phrase as input.

The trendiness score of the word/phrase is computed using the formula shown above and displayed.

This process is executed every minute until the code is force stopped.

Finally, trendiness scores are plotted across each minute.

The same is shown below:

Name		Name	Last commit message	Last commit date
Latest commit History 165 Commits
Milestone_1		Milestone_1
Milestone_2		Milestone_2
Milestone_3_Kafka		Milestone_3_Kafka
Code_Execution_Procedure.md		Code_Execution_Procedure.md
PostgreSQL_Schema.md		PostgreSQL_Schema.md
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Twitter Trendiness Score Computation

Approach

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Twitter Trendiness Score Computation

Approach

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages