This Project is about predicting the successful landing of SpaceX Falcon 9 first stage.
SpaceX advertises Falcon 9 rocket launches on its website with a cost of 62 million dollars; other providers cost upward of 165 million dollars each, much of the savings is because SpaceX can reuse the first stage. Therefore if we can determine if the first stage will land, we can determine the cost of a launch. This information can be used if an alternate company wants to bid against SpaceX for a rocket launch.
The following is an example of a successful and launch.
The repository contains all the codes and Jupiter notebooks for the final machine learning specialist training project, "Python Basics for Data Science". You can find all the solutions in notebook format here, including all the necessary steps like data collection (scraping), EDA, data analysis, visualizations, model training, and prediction for a typical Machine learning project.
All datasets can be downloaded or scraped by running the notebooks or scripts provided in the repository.
The repository uses Python3 to run the code. You can find all the necessary packages
inside the requirements.txt
file.
It is recommended to use virtualenv
to install and run the codes. After activating the virtualenv
you can run the following
command in your terminal to install all the required packages:
pip3 install -r requirements.txt
Use the following commands to run the interactive notebooks:
jupyter notebook
Steps conducted in the projects are listed in the following sections.
Collecting the following fields, using the API provided by SpaceX:
- FlightNumber
- Date
- BoosterVersion
- PayloadMass
- Orbit
- LaunchSite
- Outcome
- Flights
- GridFins
- Reused
- Legs
- LandingPad
- Block
- ReusedCount
- Serial
- Longitude
- Latitude
Scraping the dataset from the HTML page.
- FlightNumber
- Date
- BoosterVersion
- PayloadMass
- Orbit
- LaunchSite
- Outcome
- Flights
- GridFins
- Reused
- Legs
- LandingPad
- Block
- ReusedCount
- Serial
- Longitude
- Latitude
- Class
The purpose of this step is to perform exploratory Data Analysis and determine Training Labels for the next steps.
More exploratory Data Analysis and Feature Engineering using Pandas
and Matplotlib
.
EDA using SQL to understand the Spacex DataSet.
Analyzing and visualizing data and geographical patterns about launch sites.
This step marks the success/failed launches for each site on the map and performs additional steps for analyzing the launch data.
In this step, we perform the actual training and prediction by preprocessing and standardization steps on the data. The steps include spliting the training and test data and training models for the prediction.