# Descriptive analysis objectives
We are tasked to derive at least 8 descriptive analyses on mall data. Objectives:
1. **Gender Distribution**: Analyze the distribution of respondents by gender to understand the gender demographics of mall visitors.
2. **Age Range Distribution**: Analyze the distribution of respondents by age range to identify the age groups that frequent the mall.
3. **Occupation Distribution**: Analyze the distribution of respondents by occupation to determine the professional background of mall visitors.
4. **Monthly Income Distribution**: Analyze the distribution of respondents by monthly income to assess the economic status of the mall's customer base.
5. **Comfort Room Location Preference**: Analyze the preference for comfort room locations within the mall to optimize the placement of these facilities.
6. **Green Space Preference**: Analyze the preference for green spaces within the mall to understand the demand for natural and relaxing areas.
7. **Age vs Store Preference Correlation**: Analyze the correlation between age groups and store preferences to tailor marketing strategies.
8. **Income Level vs Store Preference Correlation** : Analyze the correlation between income levels and store preferences to optimize product offerings and promotions.


In [38]:
import altair as alt
import pandas as pd

# Load the CSV file
file_path = '../data/dino_mall_cleaned.csv'
df = pd.DataFrame(pd.read_csv(file_path))
df

Unnamed: 0,timestamp,consent,sex,age_range,occupation,monthly_income,interested_outside,order_store_1,order_store_2,order_store_3,...,interest_service,comfort_room_location,green_space_bool,green_yes_store_area,amenities_beside_customer_service_desks,amenities_beside_comfort_rooms,amenities_beside_common_area,amenities_beside_directory,amenities_beside_elevator,amenities_beside_escalator
0,4/1/2024 14:15,I acknowledge and authorize the use of the gat...,Female,20-29,Student,Prefer to not say,"Electronics and Technology (Apple, Samsung)",Electronics and Gadgets,Apparel and Fashion,Department Stores,...,1,Both sides of the mall,Yes,Entertainment,"Apparel and Fashion, Department Stores, Home F...","Apparel and Fashion, Department Stores, Entert...","Apparel and Fashion, Electronics and Gadgets, ...","Near/Beside the entrance, Center of the mall, ...",Center of the mall,"Near/Beside the entrance, Center of the mall, ..."
1,4/1/2024 14:34,I acknowledge and authorize the use of the gat...,Female,20-29,Student,Prefer to not say,Services (ATM machines),Entertainment,Food and Beverages,Specialty Stores,...,3,Both sides of the mall,Yes,Food and Beverages,Department Stores,Food and Beverages,Entertainment,Near/Beside the entrance,Center of the mall,Center of the mall
2,4/1/2024 14:37,I acknowledge and authorize the use of the gat...,Female,20-29,Student,Prefer to not say,"Food and Beverages (Jollibee, Max, etc.)",Food and Beverages,Department Stores,Services,...,3,Both sides of the mall,Yes,Food and Beverages,"Department Stores, Entertainment","Entertainment, Food and Beverages, Services","Apparel and Fashion, Electronics and Gadgets, ...","Near/Beside the entrance, Center of the mall",Center of the mall,Center of the mall
3,4/1/2024 14:37,I acknowledge and authorize the use of the gat...,Female,20-29,Unemployed,"Below ₱10,000","Food and Beverages (Jollibee, Max, etc.)",Food and Beverages,"Beauty and Personal Care, Health and Wellness",Department Stores,...,4,Both sides of the mall,Yes,Department Stores,Department Stores,Food and Beverages,Entertainment,Near/Beside the entrance,Center of the mall,"Near/Beside the entrance, Center of the mall"
4,4/1/2024 15:44,I acknowledge and authorize the use of the gat...,Female,20-29,Student,Not working,"Food and Beverages (Jollibee, Max, etc.)",Food and Beverages,"Beauty and Personal Care, Health and Wellness",Apparel and Fashion,...,1,Both sides of the mall,Yes,Food and Beverages,"Apparel and Fashion, Food and Beverages, Speci...","Apparel and Fashion, Entertainment, Food and B...","Department Stores, Home Furnishings and Decor",Near/Beside the entrance,"Near/Beside the entrance, Center of the mall, ...","Near/Beside the entrance, Center of the mall, ..."
...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...
108,4/6/2024 19:18,I acknowledge and authorize the use of the gat...,Male,20-29,Student,Prefer to not say,"Food and Beverages (Jollibee, Max, etc.)",Services,"Beauty and Personal Care, Health and Wellness",Home Furnishings and Decor,...,1,Both sides of the mall,Yes,Electronics and Technology,"Apparel and Fashion, Electronics and Gadgets, ...","Electronics and Gadgets, Entertainment, Food a...","Apparel and Fashion, Electronics and Gadgets, ...",Center of the mall,Near/Beside the entrance,Center of the mall
109,4/6/2024 22:48,I acknowledge and authorize the use of the gat...,Male,20-29,Student,"Below ₱10,000","Food and Beverages (Jollibee, Max, etc.)",Food and Beverages,Home Furnishings and Decor,"Beauty and Personal Care, Health and Wellness",...,3,Both sides of the mall,Yes,Food and Beverages,"Apparel and Fashion, Electronics and Gadgets, ...","Beauty, Personnel, Health and Wellness, Entert...","Apparel and Fashion, Electronics and Gadgets, ...","Near/Beside the entrance, Center of the mall","Center of the mall, End of the mall","Center of the mall, End of the mall"
110,4/7/2024 8:09,I acknowledge and authorize the use of the gat...,Male,20-29,Student,Not working,Services (ATM machines),Services,"Beauty and Personal Care, Health and Wellness",Apparel and Fashion,...,3,"West/East sides of the mall, alternately",Yes,Food and Beverages,Services,"Beauty, Personnel, Health and Wellness",Food and Beverages,Near/Beside the entrance,Center of the mall,"Center of the mall, End of the mall"
111,4/7/2024 18:58,I acknowledge and authorize the use of the gat...,Male,20-29,Student,"Below ₱10,000",Department Stores,Department Stores,Apparel and Fashion,Electronics and Gadgets,...,3,"West/East sides of the mall, alternately",Yes,Apparel and Fashion,Apparel and Fashion,Electronics and Gadgets,Electronics and Gadgets,Near/Beside the entrance,Center of the mall,Near/Beside the entrance


In [39]:
# Extract the comfort room location preference column
comfort_room_location = df['comfort_room_location']

# Calculate the distribution of preferences and sort by frequency
location_distribution = comfort_room_location.value_counts().sort_values(ascending=False)

# Calculate percentages
total = len(comfort_room_location)
location_percentage = (location_distribution / total * 100).round(2)

# Create a DataFrame to display both count and percentage
location_dist_df = pd.DataFrame({
    'Count': location_distribution,
    'Percentage (%)': location_percentage
})

location_dist_df

Unnamed: 0_level_0,Count,Percentage (%)
comfort_room_location,Unnamed: 1_level_1,Unnamed: 2_level_1
Both sides of the mall,77,68.14
"West/East sides of the mall, alternately",24,21.24
Center,10,8.85
North Side side (only),2,1.77


In [52]:
# Reset index to convert location names to a column
plot_df = location_dist_df.reset_index()
plot_df.columns = ['Location', 'Count', 'Percentage (%)']
# Create a radial plot for comfort room locations
radial_chart = alt.Chart(plot_df).encode(
    theta=alt.Theta("Count:Q").stack(True),
    radius=alt.Radius("Count:Q").scale(type="sqrt", zero=True, rangeMin=20),
    color=alt.Color("Location:N", legend=alt.Legend(title="Comfort Room Location")),
    tooltip=['Location', 'Count', 'Percentage (%)']
).properties(
    title='Comfort Room Location Preferences',
    width=400,
    height=400
)
# Create the arc layer
arc_layer = radial_chart.mark_arc(innerRadius=20, stroke="#fff")
# Create the text layer
text_layer = radial_chart.mark_text(radiusOffset=10).encode(text="Count:Q")
# Combine the layers
final_chart = arc_layer + text_layer

final_chart