### Fix Metadata Mismatch

In this notebook, we fix the entries that had metadata of other movies by using their correct imdb ID. We fix such cases by refilling the `title` and `synopsis` of the current movie.

In [14]:
import pandas as pd

In [15]:
df_mismatches = pd.read_csv("manually_checked_title_mismatch.csv", dtype={"imdb_id": str, "year": int})

In [16]:
df_mismatches.head(2)

Unnamed: 0,title,imdb_id,file,dir,synopsis,year,link,clean_title
0,Greenfingers,203540,1981_The_Ultimate_Performer,before2010,Inmate Colin Briggs (Clive Owen) is introduced...,2001,/Mick-Jagger-Ultimate-Performer/dp/B09H9GW28G/...,Greenfingers
1,Dhoom,20113586,1296_Sachertorte,com,"""The tale begins in Mumbai where a gang of rob...",2022,/Sachertorte-Max-Hubacher/dp/B0B8TPMS8N/ref=sr...,Dhoom


In [17]:
end_meta = pd.read_csv("../../data/6_character_metadata/filtered_final_validated_metadata.csv", dtype={"imdb_id": str})

In [18]:
end_meta.loc[end_meta['file'].isin(df_mismatches['file'])].shape

(27, 7)

In [19]:
from imdb import Cinemagoer

ia = Cinemagoer()

In [21]:
def update_title_synopsis(row):
    movie = ia.get_movie(row['imdb_id'])
    
    row['title'] = movie['title']
    
    if movie.data:
        if movie.data.get("plot"):
            row['synopsis'] = movie.data.get("plot")[0]
        else:
            row['synopsis'] = ""
    
    print(row['title'], movie)
    return row

matched_titles = end_meta.loc[end_meta['file'].isin(df_mismatches['file'])].apply(update_title_synopsis, axis=1)

Greenfingers Greenfingers
Sachertorte Sachertorte
Perfect Plan Perfect Plan
It's Kind of a Funny Story It's Kind of a Funny Story
13 Cameras 13 Cameras
Come As You Are Come As You Are
Bigfoot, UFOs and Jesus Bigfoot, UFOs and Jesus
Harpoon Harpoon
Deadline Deadline
Northern Skies Over Empty Space Northern Skies Over Empty Space
CIRCLES vol 1 CIRCLES vol 1
The Queen of Versailles The Queen of Versailles
Everybody's Talking About Jamie Everybody's Talking About Jamie
Allies Allies
Akilla's Escape Akilla's Escape
The Conspirator The Conspirator
Brian and Charles Brian and Charles
Flu Flu
Kesari Kesari
The Hollow Child The Hollow Child
Hell Trip Hell Trip
The Peanut Butter Falcon The Peanut Butter Falcon
Dog Years Dog Years
Brittany Runs a Marathon Brittany Runs a Marathon
Close Range Close Range
RoboCop RoboCop
The Magnificent Seven The Magnificent Seven


In [22]:
end_meta.loc[end_meta['file'].isin(df_mismatches['file']), ['title', 'synopsis']] = matched_titles[['title', 'synopsis']].values


In [23]:
end_meta.loc[end_meta['file'].isin(df_mismatches['file'])]

Unnamed: 0,title,imdb_id,file,dir,synopsis,year,link
2379,Greenfingers,203540,1981_The_Ultimate_Performer,before2010,A prison inmate with a green thumb goes on to ...,2001.0,/Mick-Jagger-Ultimate-Performer/dp/B09H9GW28G/...
3049,Sachertorte,20113586,1296_Sachertorte,com,"Karl puts his life in Berlin on hold, only to ...",2022.0,/Sachertorte-Max-Hubacher/dp/B0B8TPMS8N/ref=sr...
3050,Perfect Plan,1664010,1297_Perfect_Plan,com,A Realtor for high-end listings in Chicago fin...,2011.0,/Perfect-Plan-Emily-Rose/dp/B08H78X26S/ref=sr_...
3052,It's Kind of a Funny Story,804497,1321_Its_Kind_of_a_Funny_Story,com,A clinically-depressed teenager gets a new sta...,2010.0,/Kind-Funny-Story-Keir-Gilchrist/dp/B004GBFF5Y...
3053,13 Cameras,4392454,1335_13_Cameras,com,"A newlywed couple, move into a new house acros...",2016.0,/13-Cameras-PJ-McCabe/dp/B01E4NPWXI/ref=sr_1_2...
3064,Come As You Are,6722726,396_Come_As_You_Are,com,Three young men with disabilities hit the road...,2020.0,/Come-You-Are-Grant-Rosenmeyer/dp/B08386TDTD/r...
3065,"Bigfoot, UFOs and Jesus",14831408,3971_Bigfoot_UFOs_and_Jesus,com,After her estranged father dies during a UFO s...,2021.0,/Bigfoot-UFOs-Jesus-Melissa-Anschutz/dp/B095XL...
3082,Harpoon,7831358,4194_Harpoon,com,"Rivalries, dark secrets, and sexual tension em...",2019.0,/Harpoon-Munro-Chambers/dp/B07YGN31F7/ref=sr_1...
3087,Deadline,1242618,1774_Deadline,com,A screenwriter travels to an abandoned house t...,2009.0,/Deadline-Brittany-Murphy/dp/B0B8513PL6/ref=sr...
3104,Northern Skies Over Empty Space,14030846,2001_Northern_Skies_Over_Empty_Space,com,Don Reynaldo is a famous hunter in decline. Wh...,2022.0,/Northern-Skies-Over-Empty-Space/dp/B0B76B6P7M...


In [24]:
end_meta.to_csv("../../data/6_character_metadata/all_metadata_finalized.csv", index=False)