Skip to content
An analysis on my preferences on twitter
Jupyter Notebook Python
Branch: master
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
data
scripts
.gitignore
LICENSE
README.md
cleaning-data.ipynb
clustering-tweets.ipynb
clusters
collecting-data.ipynb
environment.yml
faved_tweets.df
idf_tweets.df

README.md

twitter-faves

An analysis on my preferences on twitter. This is a work in progress. Its purpose is to serve as workshop material to present jupyter notebooks, python and its data science ecosystem. Herein I use mainly nltk, scikit-learn and pandas.

Notebook index

The notebooks present a very simple pipeline in the following order:

  1. collecting-data.ipynb
  2. cleaning-data.ipynb
  3. clustering-tweets.ipynb

Running the jupyter notebooks

All dependencies are listed in environment.yml. To create an environment with the required dependencies (first install conda):

git clone https://github.com/bgalvao/twitter-faves.git
cd twitter-faves
conda install -f environment.yml

This environment will be created with the name twitter-faves. To activate it, run:

source activate twitter-faves

finally, to start a JupyterLab "IDE", run:

jupyter lab
You can’t perform that action at this time.