# Digital Marketing Strategy
## "What determinants could videogame publishers use to improve their Customer Engagement on the social media platforms of Instagram and Facebook?"
### In this notebook we will only combine the datasets from fanpagekarma.

#### Combining the datasets from fanpagekarma
For Facebook we have 4 datasets from 2021 - 2024 and combine these to 1 dataset.<Br>
For Instagram we also have 4 datasets from 2021 - 2024 and also combine these to 1 dataset.

In [3]:
import os 
import pandas as pd

In [7]:
# Open the folders of the datasets
fb_folder = '/Users/jimdoves/Desktop/Master Digital Driven Business/Digital Marketing Strategy/Datasets/Facebook'
ig_folder = '/Users/jimdoves/Desktop/Master Digital Driven Business/Digital Marketing Strategy/Datasets/Instagram'
DMS_folder = '/Users/jimdoves/Desktop/Master Digital Driven Business/Digital Marketing Strategy/DMS'

def clean_headers(df):
    df.columns = df.columns.str.strip().str.lower()  
    df = df.loc[:, ~df.columns.duplicated()] 
    return df

def combine_datasets(folder, platform):
    all_files = sorted([f for f in os.listdir(folder) if f.endswith(".xlsx")])  # Sort files on name and year
    combined_df = None  
    
    for idx, file in enumerate(all_files):
        file_path = os.path.join(folder, file)
        print(f"Working on: {file_path}")

        df = pd.read_excel(file_path, engine="openpyxl")
        df = clean_headers(df) 

        # Take the header from the first file (2021)
        if combined_df is None:
            combined_df = df
        else:
            df = df.reindex(columns=combined_df.columns, fill_value=None)
            combined_df = pd.concat([combined_df, df], ignore_index=True, sort=False)

    # Safe to new file
    output_file = f"combined_{platform}_datasets.xlsx"
    combined_df.to_excel(output_file, index=False)
    print(f"Combined dataset saved as: {output_file}")

# Combine the datasets
combine_datasets(fb_folder, "FB")
combine_datasets(ig_folder, "IG")


Working on: /Users/jimdoves/Desktop/Master Digital Driven Business/Digital Marketing Strategy/Datasets/Facebook/GamingStudio_FBdata_2021.xlsx
Working on: /Users/jimdoves/Desktop/Master Digital Driven Business/Digital Marketing Strategy/Datasets/Facebook/GamingStudio_FBdata_2022.xlsx
Working on: /Users/jimdoves/Desktop/Master Digital Driven Business/Digital Marketing Strategy/Datasets/Facebook/GamingStudio_FBdata_2023.xlsx
Working on: /Users/jimdoves/Desktop/Master Digital Driven Business/Digital Marketing Strategy/Datasets/Facebook/GamingStudio_FBdata_2024.xlsx
Combined dataset saved as: combined_FB_datasets.xlsx
Working on: /Users/jimdoves/Desktop/Master Digital Driven Business/Digital Marketing Strategy/Datasets/Instagram/GamingStudio_IGdata_2021.xlsx
Working on: /Users/jimdoves/Desktop/Master Digital Driven Business/Digital Marketing Strategy/Datasets/Instagram/GamingStudio_IGdata_2022.xlsx
Working on: /Users/jimdoves/Desktop/Master Digital Driven Business/Digital Marketing Strategy

#### Display the first headers of the datasets

In [8]:
# Load the combined datasets and display first few rows
fb_combined_path = f"{DMS_folder}/combined_FB_datasets.xlsx"
ig_combined_path = f"{DMS_folder}/combined_IG_datasets.xlsx"

# Read the files
fb_combined = pd.read_excel(fb_combined_path, engine="openpyxl")
ig_combined = pd.read_excel(ig_combined_path, engine="openpyxl")

# Show the first rows of the dataset
print("First rows of Facebook dataset:")
display(fb_combined.head())

print("\nFirst rows of Instagram dataset:")
display(ig_combined.head())


First rows of Facebook dataset:


Unnamed: 0,date,message,profile,network,number of followers,number of likes,total likes(including positive emoji likes),number of comments,number of comments (primary),shares per post,...,number of thankful,number of wow,post comments positive sentiment share,post comments negative sentiment share,engagement,weighted engagement,message-id,profile-id,link,external links
0,2021-12-31 18:00:03,What's the first game you're playing in 2022?,Insomniac Games,FACEBOOK,271113,728,995,203,190,25,...,0,1,0.305419,0.054187,0.004511,0.005444,91127584852_10160103901729853,91127584852,https://www.facebook.com/insomniacgames/photos...,
1,2021-12-31 16:00:03,Fungi and ferns and moths and Fireflies -- the...,"Naughty Dog, LLC",FACEBOOK,1506484,850,1138,14,14,57,...,0,2,0.5,0.071429,0.000803,0.000888,57327290245_10165832084200246,57327290245,https://www.facebook.com/naughtydog/photos/a.1...,
2,2021-12-31 16:00:02,How did the creators of NEO: The World Ends wi...,Square Enix,FACEBOOK,1792534,237,291,24,15,12,...,0,1,0.458333,0.375,0.000183,0.00021,59916854460_10159833755834461,59916854460,https://www.facebook.com/SquareEnix/photos/a.1...,
3,2021-12-31 14:00:01,Yesterday was a journey. Tomorrow's a new adve...,Square Enix,FACEBOOK,1792534,478,664,50,48,106,...,0,1,0.48,0.36,0.00046,0.000606,59916854460_10159832301434461,59916854460,https://www.facebook.com/867403448085191/video...,
4,2021-12-31 13:00:17,Happy New Year!,CD PROJEKT RED,FACEBOOK,482793,6630,8950,214,164,128,...,0,3,0.565421,0.17757,0.019262,0.020235,283103815139159_4671320582984105,283103815139159,https://www.facebook.com/283103815139159/posts...,



First rows of Instagram dataset:


Unnamed: 0,date,message,profile,network,number of followers,number of likes,number of comments,"reactions, comments & shares",number of posts,picture posts,video-posts,engagement,message-id,profile-id,link,external links
0,2021-12-31 18:00:07,What is the first game you are going to play i...,Insomniac Games,INSTAGRAM,597380,8190,295,8485,1,1,0,0.014204,18159887410207560,302534790,https://www.instagram.com/p/CYKA__hjRC2/,
1,2021-12-31 16:00:11,Fungi and ferns and moths and Fireflies -- the...,Naughty Dog,INSTAGRAM,1208699,19992,35,20027,1,1,0,0.016569,17918878019040680,1593309399,https://www.instagram.com/p/CYJzRfyAaSI/,
2,2021-12-31 14:01:05,Yesterday was a journey. Tomorrow's a new adve...,Square Enix,INSTAGRAM,1059895,3167,79,3246,1,0,1,0.003063,17864704832649000,2117884847,https://www.instagram.com/tv/CYJlmkOhnw0/,
3,2021-12-30 18:01:10,It's always bad luck when a Black Cat... judo ...,Insomniac Games,INSTAGRAM,597380,42010,341,42351,1,1,0,0.070946,17902246271425140,302534790,https://www.instagram.com/p/CYHcUnih3H8/,
4,2021-12-30 15:09:00,Mammoth’s Patriot Mil-Spec is the internationa...,Rockstar Games,INSTAGRAM,21721887,156874,2163,159037,1,1,0,0.007325,18189250981149360,184595688,https://www.instagram.com/p/CYHInoNtmzo/,
