twitter_data_analysis

The network analysis notebook does the following tasks:

Use Twitter API to collect 1000 tweets in which keyword ‘narendra modi’ appears, save the collected tweets in nm.txt
Convert the collected tweets into BoW vectors and find cosine similarity of a pair of tweets and print the top-10 most similar tweet pairs, print these pairs
Do the same using TF-IDF vectors
Find out unique users (N) who have posted these 1000 tweets. (N <= 1000) U1, u2, ….., uN
Find the followers and followee of each user from the N users obtained before
- u1 - [followers list] [followee list]
- u2 - [followers list] [followee list]
- .
- .
- uN - [followers list] [followee list]
Followers and followees are also users, so create a follower-followee directed graph among them, G. (ui → uj) iff ui is followed by uj
Find popular users in this G based on
- Degree centrality
- Betweeness centrality
- Closeness centrality

Run notebooks on local machine:

Create a twitter developer account from here

cd to the repository and create a virtual environment and activate it:

virtualenv -p usr/bin/python3.8 env
source env/bin/activate

Install the libraries from requirements.txt:

pip install -r requirements.txt

Create a file by the name secrets.py and copy the content of secrets_template.py to secrets.py. Add your secret keys, which you got by creating an app on twitter developers account, in secrets.py file.

Run jupyter notebook to run all notebooks inside this repository locally.

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
data		data
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
network_analysis.ipynb		network_analysis.ipynb
network_graph.png		network_graph.png
requirements.txt		requirements.txt
secrets_template.py		secrets_template.py
tweet_similarity_ranking.ipynb		tweet_similarity_ranking.ipynb
twitter-data-collection.ipynb		twitter-data-collection.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

data

data

.gitignore

.gitignore

LICENSE

LICENSE

README.md

README.md

network_analysis.ipynb

network_analysis.ipynb

network_graph.png

network_graph.png

requirements.txt

requirements.txt

secrets_template.py

secrets_template.py

tweet_similarity_ranking.ipynb

tweet_similarity_ranking.ipynb

twitter-data-collection.ipynb

twitter-data-collection.ipynb

Repository files navigation

twitter_data_analysis

Run notebooks on local machine:

About

Releases

Packages

Languages

License

soumyaa1804/twitter_data_analysis

Folders and files

Latest commit

History

Repository files navigation

twitter_data_analysis

Run notebooks on local machine:

About

Topics

Resources

License

Stars

Watchers

Forks

Languages