<p>This project will take you through the process of mashing up data from two different APIs to make movie recommendations. The TasteDive API lets you provide a movie (or bands, TV shows, etc.) as a query input, and returns a set of related items. The OMDB API lets you provide a movie title as a query input and get back data about the movie, including scores from various review sites (Rotten Tomatoes, IMDB, etc.).

You will put those two together. You will use TasteDive to get related movies for a whole list of titles. You’ll combine the resulting lists of related movies, and sort them according to their Rotten Tomatoes scores (which will require making API calls to the OMDB API.)

To avoid problems with rate limits and site accessibility, we have provided a cache file with results for all the queries you need to make to both OMDB and TasteDive. Just use requests_with_caching.get() rather than requests.get(). If you’re having trouble, you may not be formatting your queries properly, or you may not be asking for data that exists in our cache. We will try to provide as much information as we can to help guide you to form queries for which data exists in the cache.

Your first task will be to fetch data from TasteDive. The documentation for the API is at https://tastedive.com/read/api.

Define a function, called get_movies_from_tastedive. It should take one input parameter, a string that is the name of a movie or music artist. The function should return the 5 TasteDive results that are associated with that string; be sure to only get movies, not other kinds of media. It will be a python dictionary with just one key, ‘Similar’.

Try invoking your function with the input “Black Panther”.

HINT: Be sure to include only q, type, and limit as parameters in order to extract data from the cache. If any other parameters are included, then the function will not be able to recognize the data that you’re attempting to pull from the cache. Remember, you will not need an api key in order to complete the project, because all data will be found in the cache.</p>
<table style="width:100%">
    <tr>
        <td><b>q</b></td>
        <td><b>type</b></td>
        <td><b>limit</b></td>
    </tr>
        <td><b>Black Panther</b></td>
        <td><b>omitted</b></td>
        <td><b>omitted</b></td>
    <tr>
    </tr>
        <td><b>Black Panther</b></td>
        <td><b>omitted</b></td>
        <td><b>5</b></td>
    <tr>
    </tr>
        <td><b>Black Panther</b></td>
        <td><b>movies</b></td>
        <td><b>omitted</b></td>
    <tr>
    </tr>
        <td><b>Black Panther</b></td>
        <td><b>movies</b></td>
        <td><b>5</b></td>
    <tr>
    </tr>
        <td><b>Tony Bennett</b></td>
        <td><b>omitted</b></td>
        <td><b>5</b></td>
    <tr>
    </tr>
        <td><b>Tony Bennett</b></td>
        <td><b>movies</b></td>
        <td><b>5</b></td>
    <tr>
    </tr>
        <td><b>Captain Marvel</b></td>
        <td><b>movies</b></td>
        <td><b>5</b></td>
    <tr>
    </tr>
        <td><b>Bridesmaids</b></td>
        <td><b>movies</b></td>
        <td><b>5</b></td>
    <tr>
    </tr>
        <td><b>Sherlock Holmes</b></td>
        <td><b>movies</b></td>
        <td><b>5</b></td>
    <tr>
</table>

In [None]:
import requests_with_caching
import json

def get_movies_from_tastedive(name):
    baseurl = "https://tastedive.com/api/similar" 
    query_params = {}
    query_params["q"] = name
    query_params["type"] = "movies"
    query_params["limit"] = 5
    this_page_cache = requests_with_caching.get(baseurl, params = query_params)
    return json.loads(this_page_cache.text)
    
def extract_movie_titles(movie_dict):
    return [movie['Name'] for movie in movie_dict['Similar']['Results']]
    
def get_related_titles(movie_lst):
    temp = list()
    final = list()
    for movie in movie_lst:
        temp.append(extract_movie_titles(get_movies_from_tastedive(movie)))
    for lst in temp:
        for movie in lst:
            if movie not in final:
                final.append(movie)
    return final
    
def get_movie_data(title):
    baseurl = "http://www.omdbapi.com/"
    query_param ={}
    query_param['t'] = title
    query_param['r'] = "json"
    this_page_cache = requests_with_caching.get(baseurl, params=query_param)
    return json.loads(this_page_cache.text)
    
def get_movie_rating(title):
    rating = 0
    if 'Rotten Tomatoes' in title['Ratings'][1].values():
        rating = int(title['Ratings'][1]['Value'][:-1])
    return (rating)


def get_sorted_recommendations(movie_lst):
    alternatives = get_related_titles(movie_lst)
    movie_dict ={}
    for movie in alternatives:
        movie_dict[movie] = 0
    key_list = movie_dict.keys()
    for key in key_list:
        movie_dict[key] = get_movie_rating(get_movie_data(key))      
    return ([i[0] for i in sorted(movie_dict.items(), key=lambda item: (item[1], item[0]), reverse=True)])

Please copy the completed function from above into this active code window. Next, you will need to write a function that extracts just the list of movie titles from a dictionary returned by get_movies_from_tastedive. Call it extract_movie_titles.

In [None]:
import requests_with_caching
import json

def get_movies_from_tastedive(name):
    baseurl = "https://tastedive.com/api/similar" 
    query_params = {}
    query_params["q"] = name
    query_params["type"] = "movies"
    query_params["limit"] = 5
    this_page_cache = requests_with_caching.get(baseurl, params = query_params)
    return json.loads(this_page_cache.text)
    
def extract_movie_titles(movie_dict):
    return [movie['Name'] for movie in movie_dict['Similar']['Results']]
    
def get_related_titles(movie_lst):
    temp = list()
    final = list()
    for movie in movie_lst:
        temp.append(extract_movie_titles(get_movies_from_tastedive(movie)))
    for lst in temp:
        for movie in lst:
            if movie not in final:
                final.append(movie)
    return final
    
def get_movie_data(title):
    baseurl = "http://www.omdbapi.com/"
    query_param ={}
    query_param['t'] = title
    query_param['r'] = "json"
    this_page_cache = requests_with_caching.get(baseurl, params=query_param)
    return json.loads(this_page_cache.text)
    
def get_movie_rating(title):
    rating = 0
    if 'Rotten Tomatoes' in title['Ratings'][1].values():
        rating = int(title['Ratings'][1]['Value'][:-1])
    return (rating)


def get_sorted_recommendations(movie_lst):
    alternatives = get_related_titles(movie_lst)
    movie_dict ={}
    for movie in alternatives:
        movie_dict[movie] = 0
    key_list = movie_dict.keys()
    for key in key_list:
        movie_dict[key] = get_movie_rating(get_movie_data(key))      
    return ([i[0] for i in sorted(movie_dict.items(), key=lambda item: (item[1], item[0]), reverse=True)])

Please copy the completed functions from the two code windows above into this active code window. Next, you’ll write a function, called get_related_titles. It takes a list of movie titles as input. It gets five related movies for each from TasteDive, extracts the titles for all of them, and combines them all into a single list. Don’t include the same movie twice.

In [None]:
import requests_with_caching
import json

def get_movies_from_tastedive(name):
    baseurl = "https://tastedive.com/api/similar" 
    query_params = {}
    query_params["q"] = name
    query_params["type"] = "movies"
    query_params["limit"] = 5
    this_page_cache = requests_with_caching.get(baseurl, params = query_params)
    return json.loads(this_page_cache.text)
    
def extract_movie_titles(movie_dict):
    return [movie['Name'] for movie in movie_dict['Similar']['Results']]
    
def get_related_titles(movie_lst):
    temp = list()
    final = list()
    for movie in movie_lst:
        temp.append(extract_movie_titles(get_movies_from_tastedive(movie)))
    for lst in temp:
        for movie in lst:
            if movie not in final:
                final.append(movie)
    return final
    
def get_movie_data(title):
    baseurl = "http://www.omdbapi.com/"
    query_param ={}
    query_param['t'] = title
    query_param['r'] = "json"
    this_page_cache = requests_with_caching.get(baseurl, params=query_param)
    return json.loads(this_page_cache.text)
    
def get_movie_rating(title):
    rating = 0
    if 'Rotten Tomatoes' in title['Ratings'][1].values():
        rating = int(title['Ratings'][1]['Value'][:-1])
    return (rating)


def get_sorted_recommendations(movie_lst):
    alternatives = get_related_titles(movie_lst)
    movie_dict ={}
    for movie in alternatives:
        movie_dict[movie] = 0
    key_list = movie_dict.keys()
    for key in key_list:
        movie_dict[key] = get_movie_rating(get_movie_data(key))      
    return ([i[0] for i in sorted(movie_dict.items(), key=lambda item: (item[1], item[0]), reverse=True)])

Your next task will be to fetch data from OMDB. The documentation for the API is at https://www.omdbapi.com/

Define a function called get_movie_data. It takes in one parameter which is a string that should represent the title of a movie you want to search. The function should return a dictionary with information about that movie.

Again, use requests_with_caching.get(). For the queries on movies that are already in the cache, you won’t need an api key. You will need to provide the following keys: t and r. As with the TasteDive cache, be sure to only include those two parameters in order to extract existing data from the cache.

In [None]:
import requests_with_caching
import json

def get_movies_from_tastedive(name):
    baseurl = "https://tastedive.com/api/similar" 
    query_params = {}
    query_params["q"] = name
    query_params["type"] = "movies"
    query_params["limit"] = 5
    this_page_cache = requests_with_caching.get(baseurl, params = query_params)
    return json.loads(this_page_cache.text)
    
def extract_movie_titles(movie_dict):
    return [movie['Name'] for movie in movie_dict['Similar']['Results']]
    
def get_related_titles(movie_lst):
    temp = list()
    final = list()
    for movie in movie_lst:
        temp.append(extract_movie_titles(get_movies_from_tastedive(movie)))
    for lst in temp:
        for movie in lst:
            if movie not in final:
                final.append(movie)
    return final
    
def get_movie_data(title):
    baseurl = "http://www.omdbapi.com/"
    query_param ={}
    query_param['t'] = title
    query_param['r'] = "json"
    this_page_cache = requests_with_caching.get(baseurl, params=query_param)
    return json.loads(this_page_cache.text)
    
def get_movie_rating(title):
    rating = 0
    if 'Rotten Tomatoes' in title['Ratings'][1].values():
        rating = int(title['Ratings'][1]['Value'][:-1])
    return (rating)


def get_sorted_recommendations(movie_lst):
    alternatives = get_related_titles(movie_lst)
    movie_dict ={}
    for movie in alternatives:
        movie_dict[movie] = 0
    key_list = movie_dict.keys()
    for key in key_list:
        movie_dict[key] = get_movie_rating(get_movie_data(key))      
    return ([i[0] for i in sorted(movie_dict.items(), key=lambda item: (item[1], item[0]), reverse=True)])

Now, you’ll put it all together. Don’t forget to copy all of the functions that you have previously defined into this code window. Define a function get_sorted_recommendations. It takes a list of movie titles as an input. It returns a sorted list of related movie titles as output, up to five related movies for each input movie title. The movies should be sorted in descending order by their Rotten Tomatoes rating, as returned by the get_movie_rating function. Break ties in reverse alphabetic order, so that ‘Yahşi Batı’ comes before ‘Eyyvah Eyvah’.

In [None]:
import requests_with_caching
import json


# some invocations that we use in the automated tests; uncomment these if you are getting errors and want better error messages
# get_sorted_recommendations(["Bridesmaids", "Sherlock Holmes"])
def get_movies_from_tastedive(name):
    parameters = {"q": name, "type": "movies", "limit": 5}
    tastedive_response = requests_with_caching.get("https://tastedive.com/api/similar", params=parameters)
    py_data = json.loads(tastedive_response.text)
    return py_data


def extract_movie_titles(dic_from_get_movies):
    movie_title = list()
    movie_info = dic_from_get_movies["Similar"]["Results"]
    for movie in movie_info:
        movie_title.append(movie["Name"])
    return movie_title


def get_related_titles(list_of_movie_title):
    print(list_of_movie_title)
    new_list = list()
    for title in list_of_movie_title:
        a = get_movies_from_tastedive(title)
        b = extract_movie_titles(a)
        for movie in b:
            if movie not in new_list:
                new_list.append(movie)
    return new_list


def get_movie_data(movie_name):
    parameters = {'t': movie_name, 'r': 'json'}
    omdbapi_response = requests_with_caching.get('http://www.omdbapi.com/', params=parameters)
    a = json.loads(omdbapi_response.text)
    return a


def get_movie_rating(movie_dict):
    if len(movie_dict['Ratings']) > 1:
        if movie_dict['Ratings'][1]['Source'] == 'Rotten Tomatoes':
            rotten_rating = movie_dict['Ratings'][1]['Value'][:2]
            rotten_rating = int(rotten_rating)
    else:
        rotten_rating = 0

    return rotten_rating


def getkey(item):
    return item[1]


def get_sorted_recommendations(list_of_movies):
    related_movies = get_related_titles(list_of_movies)
    ratings = list()
    sorted_list = list()
    for movie in related_movies:
        a = get_movie_data(movie)
        ratings.append(get_movie_rating(a))

    temp_tuple1 = zip(related_movies, ratings)
    temp_tuple2 = sorted(temp_tuple1, key=getkey, reverse=True)
    print(temp_tuple2)
    for i in range(len(temp_tuple2) - 1):
        if temp_tuple2[i][0] not in sorted_list:
            if temp_tuple2[i][1] == temp_tuple2[i + 1][1]:
                if temp_tuple2[i][0] < temp_tuple2[i + 1][0]:
                    sorted_list.append(temp_tuple2[i + 1][0])
                    sorted_list.append(temp_tuple2[i][0])
            else:
                sorted_list.append(temp_tuple2[i][0])

    print(sorted_list)

    return sorted_list