Skip to content

Data includes airline reviews from 2006 to 2019 for popular airlines around the world. The main objective is to predict whether passengers will refer the airline to their friends.

Notifications You must be signed in to change notification settings

paulsoumyadip/airline_referral

Repository files navigation

Airline_passenger_referal_prediction

Predicting aircraft passenger referal and excavating the main influencing factors can help airlines improve their services and gain.

Airline_passenger_referal_prediction

readme photo

Air transport or aviation plays a very important role in the present transport structure of the world and surely it is considered the gift of the twentieth century to the world. In today’s fast-paced world, air transport has been a blessing to all because of its speed of transportation. This mode of transport is very useful to get the products with short delivery times quickly and safely to those who require it also allows the tourism industry in each country to have stable growth by shortening the distance among all the people who inhabit the world. Here, I have a dataset regarding the ratings of services provided by different airlines to customers. The main objective of this project is to understand how likely the passengers will recommend the airlines to others. The dataset here is quite large which initially had 131895 rows and 17 columns. On checking the data information, it was derived that there were basically two different types of data in the dataset there are 7 columns of floats64, data types 10 columns with object types. Coming to the null values and missing values in the dataset, it was observed that there was a mismatch in the non-null counts which clearly stated that a large number of missing and null values were present in the dataset.

Data is scrapped in the Spring of 2019 from the Skytrax website. Data includes airline reviews from 2006 to 2019 for popular airlines around the world with multiple-choice and free-text questions. The main objective is to predict whether passengers will refer the airline to them or not.

Objective: The main objective is to predict whether passengers will refer the airline to their friends.

Description of features in the dataset:

  • airline: Name of the airline.
  • overall: Overall point is given to the trip between 1 to 10.
  • author: Author of the trip
  • review date: Date of the Review customer review: Review of the customers in free text format
  • aircraft: Type of the aircraft
  • traveller type: Type of traveler (e.g. business, leisure)
  • cabin: Cabin at the flight date flown: Flight date
  • seat comfort: Rated between 1-5
  • cabin service: Rated between 1-5
  • foodbev: Rated between 1-5 entertainment: Rated between 1-5
  • groundservice: Rated between 1-5
  • Data Preparation:

  • Dropping rows having entire row as NAN
  • Dropping columns which don't add value for the analysis
  • Dropping duplicates
  • EDA: The primary goal of EDA is to support the analysis of data prior to making any conclusions. It may aid in the detection of apparent errors, as well as a deeper understanding of data patterns, the detection of outliers or anomalous events, and the discovery of interesting relationships between variables.

    Feature Engineering: Engineered new features based on-

  • review text such as polarity using VADER.
  • overall rating column such as positive or negative review
  • onehot encoding on cabin_type ,Airlines and travller_type to create new features.
  • Model Building: Built different classifier models such as

  • Logistic Regression
  • DecisionTree
  • XG Boost
  • SVM
  • Random Forest
  • K-Nearest Neighbour
  • The performance is exceptionally good but we saw a scope of improvement where we can detect anomalies and replace the recommended column with the correct one. All models are working great on this dataset and getting a good range of accuracies around 95%, which is pretty good.

    About

    Data includes airline reviews from 2006 to 2019 for popular airlines around the world. The main objective is to predict whether passengers will refer the airline to their friends.

    Topics

    Resources

    Stars

    Watchers

    Forks

    Releases

    No releases published

    Packages

    No packages published