Skip to content

RaRe-Technologies/movie-plots-by-genre

master
Switch branches/tags

Name already in use

A tag already exists with the provided branch name. Many Git commands accept both tag and branch names, so creating this branch may cause unexpected behavior. Are you sure you want to create this branch?
Code

Latest commit

 

Git stats

Files

Permalink
Failed to load latest commit information.
Type
Name
Latest commit message
Commit time
 
 
 
 
 
 
 
 
 
 

movie-plots-by-genre

Movie plots by genre tutorial at PyData Berlin 20 May 2016.

See slides for the narrative.

Setting Up

Make sure you have Python 3.

  • Clone this repository
git clone https://github.com/RaRe-Technologies/movie-plots-by-genre.git

if you don't have git you can also download a zip of this repo https://github.com/RaRe-Technologies/movie-plots-by-genre/archive/master.zip

  • Install virtualenv
(sudo) pip install virtualenv
  • Create a virtual env and install all the requirements.
cd movie-plots-by-genre/
virtualenv gensim # if you have both python2 and python3 then use virtualenv -p python3 gensim
source gensim/bin/activate
pip3 install cython gensim sklearn pandas matplotlib nltk pyemd jupyter

NOTE:On OSX you might want to download pyemd from github and install it via python3 setup.py install.

  • Download Google News pre-trained word2vec model (1.5 Gb) from here

  • Download nltk data

python -m nltk.downloader punkt
  • Fire up a jupyter notebook
jupyter notebook

If you are short on bandwidth then you will be able to follow most of the tutorial just with these libraries:

  • Python 3
  • pip3 install cython gensim sklearn pandas matplotlib nltk pyemd jupyter
  • python -m nltk.downloader punkt

About

Movie plots by genre tutorial at PyData Berlin 2016

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published