Skip to content
Python interface to IMDb plain-text data files
Python
Branch: master
Clone or download
Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
imdb
.gitignore Modify search result ranking method for better analysis of number of … Aug 28, 2013
LICENSE Initial commit Apr 13, 2013
README.md Update readme to reflect partial database support Jun 6, 2014
TESTS Handle malformed running times; derate movies without known year Sep 18, 2013
example.py Proper Unicode handling in example (closes: #7) Nov 27, 2016
rating_scale.txt Modify search result ranking method for better analysis of number of … Aug 28, 2013
setup.py Version bump May 21, 2015
test.py Add test cases Aug 31, 2013
wsgi.py wsgi: Enable clearing of the cache Apr 20, 2014

README.md

This package implements a Python interface to IMDb plain text data files.

At this time, the API should not be considered stable.

Note that IMDb uses iso-8859-1 encoding (in data files and URLs); this package uses Unicode in most places.

python-imdb supports the following data files (to greater or lesser degree)

  • movies
  • aka-titles
  • ratings
  • plot
  • genres
  • running-times
  • color-info
  • certificates
  • directors
  • writers
  • actors
  • actresses

Download these files into /some/directory and then run python imdb --rebuild-db /some/directory to convert the data files (necessary to support seeking within the data files) and build a search index. This will result in files imdb.zip and imdb.zip.idx.

For search, movies.list is required and aka-titles.list and ratings.list are strongly recommended. However, each file is optional, with associated data and/or features simply being unavailable.

The module includes examples of a simple program (example.py) and a WSGI-based JSON API endpoint (wsgi.py).

You can’t perform that action at this time.