# Airline Passenger Satisfaction dataset description

## Problem
Airline passenger satisfaction can be complex and multifaceted, influenced by various factors across different stages of the travel experience:
1. Passengers have diverse preferences and expectations regarding airline travel. This diversity makes it challenging for airlines to meet the needs and preferences of all passengers simultaneously.
2. Service quality can vary widely across different airlines, routes, aircraft types, and even individual flights. Factors such as crew behavior, cleanliness, on-time performance, seat comfort, and food quality can all impact passenger satisfaction. Maintaining consistent service quality across diverse operations can be challenging for airlines.
3. Airlines face numerous operational challenges affecting passenger satisfaction, such as flight delays, cancellations, overbooking, lost luggage, and customer service issues, which are sometimes beyond the airline's control.

## Data mining tasks
Our project primarily aims to understand customer satisfaction and identify the factors that influence it in the airline industry. We aim to uncover patterns, relationships, and areas for improvement related to customer satisfaction. Our task is to classify customers' travel patterns based on satisfaction levels, focusing on the differences between personal and business travel and between business and economy class. By analyzing factors such as travel purpose and seating class, we'll determine if business travelers are more satisfied and if the service class affects satisfaction. We will also use clustering to analyze the impact of various service quality factors on customer satisfaction, such as departure delay and arrival delay in minutes. We aim to group customers based on their satisfaction levels and service quality ratings to identify segments with similar preferences and experiences. This analysis will provide valuable insights into the key drivers of satisfaction and guide targeted improvements to enhance the overall customer experience.

## The goal of collecting this dataset
Understanding and improving airline passenger satisfaction by analyzing the data is crucial for an airline to attract and retain customers, build brand loyalty, and differentiate itself in a competitive market by addressing these challenges and listing the issues found to brainstorm solutions, then ordering them by its effect on the airline to tackle; which would enhance the overall travel experience and drive long-term success.

## Dataset source
The source: [click here](https://www.kaggle.com/datasets/teejmahal20/airline-passenger-satisfaction/data) 

The sample: [click here](https://studentksuedu-my.sharepoint.com/:x:/g/personal/443200417_student_ksu_edu_sa/Ec3dYaSbv1RGmES0SjVtsOwBloVBn_6aO17v7ZuLztfLow?e=5tZlSF)

Classification: Predicting Customer Satisfaction, where we will attemp to create a model that predicts whether a customer was Satisfied or Unsatisfied with the experience and/or service which an airline provided.

Clustering:
Clustering customer survey responses involves grouping similar data points based on common characteristics to establish distinct clusters. Different techniques will be employed to identify the optimal number of clusters that align with our dataset. Subsequently, evaluation methods will be applied to gauge the efficacy of these clusters in effectively categorizing survey responses based on their intrinsic features, facilitating insightful predictions and analysis.

Defect Prediction:
Defect prediction in a dataset of customer survey responses involves identifying potential issues or anomalies within the collected data. This process encompasses the detection of inconsistencies or irregularities in survey responses, including the identification of missing or unreliable data points, which can impact the reliability of the dataset. By employing advanced data analysis techniques, defect prediction aims to enhance the overall quality and dependability of the dataset, ensuring it accurately captures the intended information from survey participants. This proactive approach reduces the risk of generating misleading or unreliable outcomes when utilizing the dataset for subsequent tasks and applications, such as predictive modeling or trend analysis based on customer feedback.

##General information

Number of Attributes :24 , Number of Object:500,Class lable: satisfaction has tow level Satisfaction if the passenger is satisfied,neutral or dissatisfaction if not .

1-id : Unique identifier for the passenger 
2- Gender:Gender of the passengers (Female, Male)
3- Customer Type:The customer type (Loyal customer, disloyal customer)
4- Age:The actual age of the passengers
5- Type of Travel:Purpose of the flight of the passengers (Personal Travel, Business Travel)"
6-Class:Travel class in the plane of the passengers (Business, Eco, Eco Plus)
7- Flight distance:The flight distance of this journey
8- "Inflight wifi service:Satisfaction level of the inflight wifi service (0:Not Applicable;1-5)"
9= Departure/Arrival time convenient : Satisfaction level of Departure/Arrival time convenient
10- Ease of Online booking:Satisfaction level of online booking
11- Gate location : Satisfaction level of Gate location
12- Food and drink : Satisfaction level of Food and drink
13- Online boarding:Satisfaction level of online boarding
14- Seat comfort : Satisfaction level of Seat comfort
15- Inflight entertainment : Satisfaction level of inflight entertainment
16- On-board service : Satisfaction level of On-board service
17- Leg room service : Satisfaction level of Leg room service
18- Baggage handling : Satisfaction level of baggage handling
19- Check-in service : Satisfaction level of Check-in service
20- Inflight service:Satisfaction level of inflight service
21- Cleanliness : Satisfaction level of Cleanliness
22- Departure Delay in Minutes : Minutes delayed when departure
23- Arrival Delay in Minutes : Minutes delayed when Arrival
24- Satisfaction:Airline satisfaction level(Satisfaction, neutral or dissatisfaction)


In [None]:

| Attribute Name                     | Data Type | Description                                                               | Possible Values                                            |
|------------------------------------|-----------|---------------------------------------------------------------------------|------------------------------------------------------------|
| id                                 | Integer   | Unique identifier for the passenger                                       | Numeric IDs                                                |
| Gender                             | String    | Gender of the passenger                                                   | Male, Female                                               |
| Customer Type                      | String    | Type of the customer                                                      | Loyal Customer, disloyal Customer                          |
| Age                                | Integer   | Age of the passenger                                                      | Numeric ages                                               |
| Type of Travel                     | String    | Purpose of the flight                                                     | Business travel, Personal Travel                           |
| Class                              | String    | Class of the flight                                                       | Eco, Business, Eco Plus                                    |
| Flight Distance                    | Integer   | Distance of the flight                                                    | Numeric distances  31 to 4983                              |
| Inflight wifi service              | Integer   | Satisfaction level with inflight wifi service                             | 0 to 5                                                     |
| Departure/Arrival time convenient  | Integer   | Satisfaction level with departure/arrival time convenience                | 0 to 5                                                     |
| Ease of Online booking             | Integer   | Satisfaction level with online booking ease                               | 0 to 5                                                     |
| Gate location                      | Integer   | Satisfaction level with gate location                                     | 0 to 5                                                     |
| Food and drink                     | Integer   | Satisfaction level with food and drink                                    | 0 to 5                                                     |
| Online boarding                    | Integer   | Satisfaction level with online boarding                                   | 0 to 5                                                     |
| Seat comfort                       | Integer   | Satisfaction level with seat comfort                                      | 0 to 5                                                     |
| Inflight entertainment             | Integer   | Satisfaction level with inflight entertainment                            | 0 to 5                                                     |
| On-board service                   | Integer   | Satisfaction level with on-board service                                  | 0 to 5                                                     |
| Leg room service                   | Integer   | Satisfaction level with leg room service                                  | 0 to 5                                                     |
| Baggage handling                   | Integer   | Satisfaction level with baggage handling                                  | 0 to 5                                                     |
| Checkin service                    | Integer   | Satisfaction level with checkin service                                   | 0 to 5                                                     |
| Inflight service                   | Integer   | Satisfaction level with inflight service                                  | 0 to 5                                                     |
| Cleanliness                        | Integer   | Satisfaction level with cleanliness                                       | 0 to 5                                                     |
| Departure Delay in Minutes         | Integer   | Minutes delayed at departure                                              | Numeric minutes 0-1592                                     |
| Arrival Delay in Minutes           | Integer   | Minutes delayed on arrival                                                | Numeric minutes 0-158000                                   |
| satisfaction                       | String    | Overall satisfaction with the flight                                      | satisfied, neutral or dissatisfied                         |
