Skip to content

Latest commit

 

History

History
31 lines (23 loc) · 1.05 KB

README.md

File metadata and controls

31 lines (23 loc) · 1.05 KB

This package implements a Python interface to IMDb plain text data files.

At this time, the API should not be considered stable.

Note that IMDb uses iso-8859-1 encoding (in data files and URLs); this package uses Unicode in most places.

python-imdb supports the following data files (to greater or lesser degree)

  • movies
  • aka-titles
  • ratings
  • plot
  • genres
  • running-times
  • color-info
  • certificates
  • directors
  • writers
  • actors
  • actresses

Download these files into /some/directory and then run python imdb --rebuild-db /some/directory to convert the data files (necessary to support seeking within the data files) and build a search index. This will result in files imdb.zip and imdb.zip.idx.

For search, movies.list is required and aka-titles.list and ratings.list are strongly recommended. However, each file is optional, with associated data and/or features simply being unavailable.

The module includes examples of a simple program (example.py) and a WSGI-based JSON API endpoint (wsgi.py).