GitHub - WattSocialBot/imdbpy: IMDbPY is a Python package useful to retrieve and manage the data of the IMDb movie database about movies, people, characters and companies

IMDbPY is a Python package for retrieving and managing the data of the IMDb movie database about movies, people and companies.

Revamp notice

Starting on November 2017, many things were improved and simplified:

moved the package to Python 3 (compatible with Python 2.7)
removed dependencies: SQLObject, C compiler, BeautifulSoup
removed the "mobile" and "httpThin" parsers
introduced a testsuite (please help with it!)

The original Python 2 version is available in the imdbpy-legacy branch (mostly unsupported).

Main features

written in Python 3 (compatible with Python 2.7)
platform-independent
can retrieve data from both the IMDb's web server, or a local copy of the database
a simple and complete API
released under the terms of the GPL 2 license

IMDbPY powers many other software and has been used in various research papers. Curious about that?

Installation

Whenever it's possible, please always use the latest version from the repository. To install it using pip:

pip3 install git+https://github.com/alberanid/imdbpy

Code example

from imdb import IMDb

# create an instance of the IMDb class
ia = IMDb()

# get a movie and print its director(s)
the_matrix = ia.get_movie('0133093')
print(the_matrix['director'])

# show all the information sets avaiable for Movie objects
print(ia.get_movie_infoset())

# update a Movie object with more information
ia.update(the_matrix, ['technical'])
# show which keys were added by the information set
print(the_matrix.infoset2keys['technical'])
# print one of the new keys
print(the_matrix.get('cinematographic process'))

# search for a person
for person in ia.search_person('Mel Gibson'):
    print(person.personID, person['name'])

# get the first result of a company search,
# update it to get the basic information
ladd_company = ia.search_company('The Ladd Company')[0]
ia.update(ladd_company)
# show the available information and print some
print(ladd_company.keys())
print(ladd_company.get('production companies'))

# get 5 movies tagged with a keyword
dystopia = ia.get_keyword('dystopia', results=5)

# get top250 and bottom100 movies
top250 = ia.get_top250_movies()
bottom100 = ia.get_bottom100_movies()

S3 database

IMDb distributes some of the data in their s3 database. Using IMDbPY, you can easily import them using the s32imdbpy.py script. Download the files from here, create an empty database in your favorite database server, and then run:

./bin/s32imdbpy.py /path/to/the/tsv.gz/files/ URI

where URI is the identifier used to access a SQL database amongst the ones supported by SQLAlchemy, for example postgres://user:password@localhost/imdb.

You will use the same URI with the "s3" accessSystem to create an instance of the IMDb object that is able to access the database:

ia = IMDb('s3', uri)

For more information, see docs/README.s3.txt

Main objects and methods

Create an instance of the IMDb class, to access information from the web or a SQL database:

ia = imdb.IMDb()

Return an instance of a Movie, Person or Company class. The objects have the basic information:

movie = ia.get_movie(movieID)
person = ia.get_person(personID)
company = ia.get_company(companyID)

Return a list of Movie, Person or Company instances. These objects have only bare information, like title and movieID:

movies = ia.search_movie(title)
persons = ia.search_person(name)
companies = ia.search_company(name)

Update a Movie, Person or Company instance with basic information, or any other specified info set:

ia.update(obj, info=infoset)

Return all info sets available for a movie; similar methods are available for other objects:

ia.get_movie_infoset()

Mapping between the fetched info sets and the keywords they provide; similar methods are available for other objects:

movie.infoset2keys

The ID of the object:

movie.movieID
person.personID
company.companyID

Get a key of an object:

movie['title']
person.get('name')

Search for keywords similar to the one provided, and fetch movies matching a given keyword:

keywords = ia.search_keyword(keyword)
movies = ia.get_keyword(keyword)

Get the top 250 and bottom 100 movies:

ia.get_top250_movies()
ia.get_bottom100_movies()

Check whether a person worked in a given movie or not:

person in movie
movie in person

Check whether a key is present for a movie (or person) or not:

'cast' in movie
'filmography' in person

License

IMDbPY is released under the terms of the GNU GPL v2 (or later) license.

Name		Name	Last commit message	Last commit date
Latest commit History 1,426 Commits
.github		.github
bin		bin
docs		docs
imdb		imdb
tests		tests
.gitignore		.gitignore
.hgignore		.hgignore
.hgtags		.hgtags
LICENSE.txt		LICENSE.txt
MANIFEST.in		MANIFEST.in
Makefile		Makefile
README.rst		README.rst
setup.cfg		setup.cfg
setup.py		setup.py
tox.ini		tox.ini

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Main features

Installation

Code example

S3 database

Main objects and methods

License

About

Releases

Packages

Languages

License

WattSocialBot/imdbpy

Folders and files

Latest commit

History

Repository files navigation

Main features

Installation

Code example

S3 database

Main objects and methods

License

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages