# Introduction
Anny seriously loves Japanese food so in the beginning of 2021, he decides to embark upon a risky venture and opens up a cute little restaurant that sells his 3 favourite foods: **sushi, curry and ramen.**

Anny’s Diner is in need of your assistance to help the restaurant stay afloat - the restaurant has captured some very basic data from their few months of operation but have no idea how to use their data to help them run the business.

## Problem_statement
Anny wants to use the data to answer a few simple questions about his customers, especially about their visiting patterns, how much money they’ve spent and also which menu items are their favourite. Having this deeper connection with his customers will help him deliver a better and more personalised experience for his loyal customers.

He plans on using these insights to help him decide whether he should expand the existing customer loyalty program - additionally he needs help to generate some basic datasets so his team can easily inspect the data without needing to use SQL.

Danny has provided you with a sample of his overall customer data due to privacy issues - but he hopes that these examples are enough for you to write fully functioning pandas code  to help him answer his questions!

Anny has shared with you 3 key datasets for this case study:

- sales
- menu
- members

### 1. Bring in the necessary libraries for your work. Import the tools and resources needed to accomplish your tasks.
### 2.Import the necessary data for analysis. Bring in the information that you need to examine and draw insights from.
### 3.Explore the details of all datasets by checking their information.
### 4.Make sure that each type of information (like numbers or dates) is stored in the correct way. This helps ensure that the data is accurate and ready for analysis, making your work more reliable and meaningful

### 1. What is the total amount each customer spent at the restaurant?
### 2.How many days has each customer visited the restaurant?
### 3.What was the first item from the menu purchased by each customer?
### 4.What is the most purchased item on the menu and how many times was it purchased by all customers?
### 5.Which item was the most popular for each customer?
### 6.Which item was purchased first by the customer after they became a member?
### 7.Which item was purchased just before the customer became a member?
### 8.What is the total items and amount spent for each member before they became a member?
### 9.If each  $1 spent equates to 10 points and sushi has a 2x points multiplier - how many points would each customer have?
### 10.In the first week after a customer joins the program (including their join date) they earn 2x points on all items, not just sushi - how many points do customer A and B have at the end of January?


In [1]:
import pandas as pd, numpy as np

sales_df = pd.read_csv('sales.csv')
menu_df = pd.read_csv('menu.csv')
members_df = pd.read_csv('members.csv')

print("Sales Data:")
print(sales_df.head())

print("\nMenu Data:")
print(menu_df.head())

print("\nMembers Data:")
print(members_df.head())

Sales Data:
  customer_id  order_date  product_id
0           A  2021-01-01           1
1           A  2021-01-01           2
2           A  2021-01-07           2
3           A  2021-01-10           3
4           A  2021-01-11           3

Menu Data:
   product_id product_name  price
0           1        sushi     10
1           2        curry     15
2           3        ramen     12

Members Data:
  customer_id   join_date
0           A  2021-01-07
1           B  2021-01-09


In [2]:
print("Sales Data Info:")
print(sales_df.info())

print("\nMenu Data Info:")
print(menu_df.info())

print("\nMembers Data Info:")
print(members_df.info())

Sales Data Info:
<class 'pandas.core.frame.DataFrame'>
RangeIndex: 15 entries, 0 to 14
Data columns (total 3 columns):
 #   Column       Non-Null Count  Dtype 
---  ------       --------------  ----- 
 0   customer_id  15 non-null     object
 1   order_date   15 non-null     object
 2   product_id   15 non-null     int64 
dtypes: int64(1), object(2)
memory usage: 492.0+ bytes
None

Menu Data Info:
<class 'pandas.core.frame.DataFrame'>
RangeIndex: 3 entries, 0 to 2
Data columns (total 3 columns):
 #   Column        Non-Null Count  Dtype 
---  ------        --------------  ----- 
 0   product_id    3 non-null      int64 
 1   product_name  3 non-null      object
 2   price         3 non-null      int64 
dtypes: int64(2), object(1)
memory usage: 204.0+ bytes
None

Members Data Info:
<class 'pandas.core.frame.DataFrame'>
RangeIndex: 2 entries, 0 to 1
Data columns (total 2 columns):
 #   Column       Non-Null Count  Dtype 
---  ------       --------------  ----- 
 0   customer_id  2 non-nul

In [3]:
print("Sales Data Types:")
print(sales_df.dtypes)

print("\nMenu Data Types:")
print(menu_df.dtypes)

print("\nMembers Data Types:")
print(members_df.dtypes)

Sales Data Types:
customer_id    object
order_date     object
product_id      int64
dtype: object

Menu Data Types:
product_id       int64
product_name    object
price            int64
dtype: object

Members Data Types:
customer_id    object
join_date      object
dtype: object


In [4]:
sales_df['order_date'] = pd.to_datetime(sales_df['order_date'])
members_df['join_date'] = pd.to_datetime(members_df['join_date'])

In [5]:
merged_df = pd.merge(sales_df, menu_df, on='product_id', how='left')

merged_df = pd.merge(merged_df, members_df, on='customer_id', how='left')

In [6]:
total_spent_by_customer = merged_df.groupby('customer_id')['price'].sum()
print("Total Amount Spent by Each Customer:")
print(total_spent_by_customer)

Total Amount Spent by Each Customer:
customer_id
A    76
B    74
C    36
Name: price, dtype: int64


In [7]:
days_visited_by_customer = merged_df.groupby('customer_id')['order_date'].nunique()
print("\nNumber of Days Each Customer Visited the Restaurant:")
print(days_visited_by_customer)


Number of Days Each Customer Visited the Restaurant:
customer_id
A    4
B    6
C    2
Name: order_date, dtype: int64


In [8]:
first_item_purchased_by_customer = merged_df.groupby('customer_id')['product_name'].first()
print("\nFirst Item Purchased by Each Customer:")
print(first_item_purchased_by_customer)


First Item Purchased by Each Customer:
customer_id
A    sushi
B    curry
C    ramen
Name: product_name, dtype: object


In [9]:
most_purchased_item = merged_df['product_name'].value_counts().idxmax()
count_most_purchased_item = merged_df['product_name'].value_counts().max()
print("\nMost Purchased Item on the Menu:")
print("Product Name:", most_purchased_item)
print("Count:", count_most_purchased_item)


Most Purchased Item on the Menu:
Product Name: ramen
Count: 8


In [10]:
most_popular_item_by_customer = (merged_df.groupby('customer_id')['product_name'].value_counts()
                                 .groupby('customer_id').idxmax())
print("\nMost Popular Item for Each Customer:")
print(most_popular_item_by_customer)


Most Popular Item for Each Customer:
customer_id
A    (A, ramen)
B    (B, curry)
C    (C, ramen)
Name: count, dtype: object


In [11]:
first_purchase_after_join = (merged_df[merged_df['order_date'] >= merged_df['join_date']]
                             .groupby('customer_id')['product_name'].first())
print("\nItem Purchased First After Joining for Each Customer:")
print(first_purchase_after_join)


Item Purchased First After Joining for Each Customer:
customer_id
A    curry
B    sushi
Name: product_name, dtype: object


In [12]:
last_purchase_before_join = (merged_df[merged_df['order_date'] < merged_df['join_date']]
                             .groupby('customer_id')['product_name'].last())
print("\nItem Purchased Just Before Customer Became a Member:")
print(last_purchase_before_join)


Item Purchased Just Before Customer Became a Member:
customer_id
A    curry
B    sushi
Name: product_name, dtype: object


In [13]:
total_items_amount_before_join = (merged_df[merged_df['order_date'] < merged_df['join_date']]
                                  .groupby('customer_id').agg({'product_name': 'count', 'price': 'sum'}))
print("\nTotal Items and Amount Spent for Each Member Before Joining:")
print(total_items_amount_before_join)


Total Items and Amount Spent for Each Member Before Joining:
             product_name  price
customer_id                     
A                       2     25
B                       3     40


In [14]:
merged_df['points'] = merged_df['price'] * 10
merged_df.loc[merged_df['product_name'] == 'sushi', 'points'] *= 2

total_points_by_customer = merged_df.groupby('customer_id')['points'].sum()
print("\nTotal Points for Each Customer:")
print(total_points_by_customer)


Total Points for Each Customer:
customer_id
A    860
B    940
C    360
Name: points, dtype: int64


In [15]:
joined_first_week_points = merged_df[(merged_df['order_date'] >= merged_df['join_date']) &
                                     (merged_df['order_date'] <= merged_df['join_date'] +
                                      pd.Timedelta(days=6))]['points']
total_points_customer_A = joined_first_week_points[merged_df['customer_id'] == 'A'].sum()
total_points_customer_B = joined_first_week_points[merged_df['customer_id'] == 'B'].sum()

print("\nPoints for Customer A in the First Week After Joining:", total_points_customer_A)
print("Points for Customer B in the First Week After Joining:", total_points_customer_B)


Points for Customer A in the First Week After Joining: 510
Points for Customer B in the First Week After Joining: 200
