Skip to content

Explore my Kaggle competition solution repository for the year 2912. Join in to help rescue passengers trapped in an alternate dimension!

Notifications You must be signed in to change notification settings

Tikhon-Radkevich/SpaceshipTitanic

Repository files navigation

Kaggle Competition: Spaceship Titanic Challenge

Description

Welcome to the year 2912, where your data science skills are needed to solve a cosmic mystery. We've received a transmission from four lightyears away and things aren't looking good.

The Spaceship Titanic was an interstellar passenger liner launched a month ago. With almost 13,000 passengers on board, the vessel set out on its maiden voyage transporting emigrants from our solar system to three newly habitable exoplanets orbiting nearby stars.

While rounding Alpha Centauri en route to its first destination—the torrid 55 Cancri E—the unwary Spaceship Titanic collided with a spacetime anomaly hidden within a dust cloud. Sadly, it met a similar fate as its namesake from 1000 years before. Though the ship stayed intact, almost half of the passengers were transported to an alternate dimension!

To help rescue crews and retrieve the lost passengers, you are challenged to predict which passengers were transported by the anomaly using records recovered from the spaceship’s damaged computer system.

Help save them and change history!

Evaluation

Metric: Submissions are evaluated based on their classification accuracy, the percentage of predicted labels that are correct.

Submission Format

The submission format for the competition is a CSV file with the following table format:

PassengerId Transported
0013_01 False
0018_01 False
0019_01 False
0021_01 False
... ...

Libraries Used:

  • pandas
  • numpy
  • sklearn
  • catboost
  • plotly

Dataset Description

In this competition, your task is to predict whether a passenger was transported to an alternate dimension during the Spaceship Titanic's collision with the spacetime anomaly. To help you make these predictions, you're given a set of personal records recovered from the ship's damaged computer system.

File and Data Field Descriptions

train.csv

  • PassengerId: A unique Id for each passenger. Each Id takes the form gggg_pp where gggg indicates a group the passenger is traveling with and pp is their number within the group. People in a group are often family members, but not always.
  • HomePlanet: The planet the passenger departed from, typically their planet of permanent residence.
  • CryoSleep: Indicates whether the passenger elected to be put into suspended animation for the duration of the voyage. Passengers in cryosleep are confined to their cabins.
  • Cabin: The cabin number where the passenger is staying. Takes the form deck/num/side, where side can be either P for Port or S for Starboard.
  • Destination: The planet the passenger will be debarking to.
  • Age: The age of the passenger.
  • VIP: Whether the passenger has paid for special VIP service during the voyage.
  • RoomService, FoodCourt, ShoppingMall, Spa, VRDeck: Amount the passenger has billed at each of the Spaceship Titanic's many luxury amenities.
  • Name: The first and last names of the passenger.
  • Transported: Whether the passenger was transported to another dimension. This is the target, the column you are trying to predict.

test.csv

  • PassengerId: Id for each passenger in the test set.
  • Transported: The target. For each passenger, predict either True or False.

sample_submission.csv

  • PassengerId: Id for each passenger in the test set.
  • Transported: The target. For each passenger, predict either True or False.

About

Explore my Kaggle competition solution repository for the year 2912. Join in to help rescue passengers trapped in an alternate dimension!

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages