# SpaceX Capstone Presentation Guide

This notebook is designed to help you assemble all code, charts, and explanations for your PowerPoint and PDF, organized by your outline. For each section, you will find:
- Instructions for what to screenshot from your Jupyter notebooks
- Markdown and explanations to copy into your slides
- References to the relevant notebook/code cells
- Space to add links to your GitHub or local notebooks for peer review

You can use this notebook as a master guide while building your presentation.

# Executive Summary
## Instructions:
- Summarize the methodologies used (API, web scraping, wrangling, EDA, SQL, Folium, ML)
- Summarize key results (charts, model accuracy, findings)

## Markdown Example:
### Executive Summary
- Data collected from SpaceX API and Wikipedia using web scraping
- Data wrangled and cleaned for analysis
- Exploratory Data Analysis performed with visualization and SQL
- Interactive analytics with Folium and Plotly Dash
- Predictive analysis using classification models (Logistic Regression, SVM, Decision Tree, KNN)
- Key findings: [Insert main results here]

## Screenshot:
- Take a screenshot of your summary table or main results chart (from any notebook)

## Reference:
- [Link to main notebook(s) on GitHub or local path]

# Introduction
## Instructions:
- Provide project background and context
- State the problems/questions you want to answer

## Markdown Example:
### Introduction
SpaceX aims to reduce launch costs by reusing Falcon 9 first stages. Predicting landing success can help competitors bid for launches and optimize costs. This project analyzes SpaceX launch data to answer:
- What factors influence landing success?
- How can we predict landing outcomes?

## Screenshot:
- Optional: Screenshot of project overview or context slide

## Reference:
- [Link to project background notebook or markdown cell]

# Methodology
## Instructions:
- Describe data collection, wrangling, EDA, SQL, Folium, ML steps
- Use flowcharts or bullet points
- For each bullet, reference the notebook and code cell to run for screenshots

## Markdown Example:
### Methodology
- Data collected via SpaceX API ([jupyter-labs-spacex-data-collection-api.ipynb], see cell: API request)
- Wikipedia web scraping ([jupyter-labs-webscraping.ipynb], see cell: requests.get and BeautifulSoup)
- Data wrangled and cleaned ([labs-jupyter-spacex-Data wrangling.ipynb], see cell: missing value analysis)
- EDA performed ([edadataviz.ipynb], see cells: scatter plots, bar charts)
- SQL queries ([jupyter-labs-eda-sql-coursera_sqllite.ipynb], see cells: SQL queries)
- Interactive maps ([lab_jupyter_launch_site_location.ipynb], see cells: Folium map creation)
- Predictive models ([SpaceX_Machine Learning Prediction_Part_5.ipynb], see cells: model training and evaluation)

## Screenshot:
- Screenshot of your flowchart or methodology diagram (draw in PowerPoint or use a tool like draw.io)
- Screenshot of code cell outputs for each step

## Reference:
- [GitHub URLs or local paths to each notebook]

# Data Collection
## Instructions:
- Describe how datasets were collected (API, web scraping)
- Present flowcharts for each process
- Reference the code cells for API and web scraping

## Markdown Example:
### Data Collection
- SpaceX API used to collect launch data ([jupyter-labs-spacex-data-collection-api.ipynb], cell: requests.get)
- Wikipedia web scraping for historical launches ([jupyter-labs-webscraping.ipynb], cell: requests.get and BeautifulSoup)

## Screenshot:
- Screenshot of API code cell and output
- Screenshot of web scraping code cell and output

## Reference:
- [GitHub URLs or local paths to API and web scraping notebooks]

# Data Collection – SpaceX API
## Instructions:
- Present your data collection with SpaceX REST calls using key phrases and flowcharts
- Add the GitHub URL of the completed SpaceX API calls notebook (must include completed code cell and outcome cell)
- Place your flowchart of SpaceX API calls here

## Markdown Example:
### Data Collection – SpaceX API
- Used requests.get to call SpaceX REST API ([jupyter-labs-spacex-data-collection-api.ipynb], cell: spacex_url)
- Parsed JSON response and normalized into DataFrame
- [Insert flowchart image here]

## Screenshot:
- Screenshot of API request code and output

## Reference:
- [GitHub URL to API notebook]

# Data Collection – Scraping
## Instructions:
- Present your web scraping process using key phrases and flowcharts
- Add the GitHub URL of the completed web scraping notebook
- Place your flowchart of web scraping here

## Markdown Example:
### Data Collection – Scraping
- Used requests.get and BeautifulSoup to scrape Wikipedia ([jupyter-labs-webscraping.ipynb], cell: static_url)
- Parsed HTML table and extracted launch records
- [Insert flowchart image here]

## Screenshot:
- Screenshot of web scraping code and output

## Reference:
- [GitHub URL to web scraping notebook]

# Data Wrangling
## Instructions:
- Describe how data were processed
- Present your data wrangling process using key phrases and flowcharts
- Add the GitHub URL of your completed data wrangling related notebooks

## Markdown Example:
### Data Wrangling
- Removed duplicates and missing values ([labs-jupyter-spacex-Data wrangling.ipynb], cell: missing value analysis)
- Feature engineering (created landing outcome labels, encoded categorical variables)
- Final cleaned dataset ready for analysis
- [Insert flowchart image here]

## Screenshot:
- Screenshot of wrangling code and output

## Reference:
- [GitHub URL to data wrangling notebook]

# EDA with Data Visualization
## Instructions:
- Summarize what charts were plotted and why you used those charts
- Add the GitHub URL of your completed EDA with data visualization notebook

## Markdown Example:
### EDA with Data Visualization
- Scatter plots: Flight Number vs. Payload, Launch Site, Orbit ([edadataviz.ipynb], cells: sns.catplot)
- Bar charts: Success rate by orbit ([edadataviz.ipynb], cell: groupby mean)
- Line charts: Yearly success trend ([edadataviz.ipynb], cell: Extract_year)

## Screenshot:
- Screenshot each chart from edadataviz.ipynb

## Reference:
- [GitHub URL to EDA notebook]

# EDA with SQL
## Instructions:
- Using bullet point format, summarize the SQL queries you performed
- Add the GitHub URL of your completed EDA with SQL notebook

## Markdown Example:
### EDA with SQL
- Unique launch sites ([jupyter-labs-eda-sql-coursera_sqllite.ipynb], cell: SELECT DISTINCT)
- Launch sites starting with 'CCA' ([jupyter-labs-eda-sql-coursera_sqllite.ipynb], cell: LIKE 'CCA%')
- Total payload mass by NASA ([jupyter-labs-eda-sql-coursera_sqllite.ipynb], cell: SUM)
- Average payload mass by F9 v1.1 ([jupyter-labs-eda-sql-coursera_sqllite.ipynb], cell: AVG)
- First successful ground landing date ([jupyter-labs-eda-sql-coursera_sqllite.ipynb], cell: MIN)
- Successful drone ship landings with payload 4000-6000 ([jupyter-labs-eda-sql-coursera_sqllite.ipynb], cell: WHERE)
- Total successful/failure outcomes ([jupyter-labs-eda-sql-coursera_sqllite.ipynb], cell: GROUP BY)
- Boosters with max payload ([jupyter-labs-eda-sql-coursera_sqllite.ipynb], cell: MAX)
- 2015 failed drone ship landings ([jupyter-labs-eda-sql-coursera_sqllite.ipynb], cell: substr(Date,0,5)='2015')
- Ranked landing outcomes (2010-2017) ([jupyter-labs-eda-sql-coursera_sqllite.ipynb], cell: ORDER BY)

## Screenshot:
- Screenshot SQL query and output for each bullet point

## Reference:
- [GitHub URL to SQL notebook]

# Build an Interactive Map with Folium
## Instructions:
- Summarize what map objects such as markers, circles, lines, etc. you created and added to a folium map
- Explain why you added those objects
- Add the GitHub URL of your completed interactive map with Folium map

## Markdown Example:
### Interactive Map with Folium
- Markers for launch sites ([lab_jupyter_launch_site_location.ipynb], cell: folium.Circle, folium.Marker)
- Color-coded markers for launch outcomes ([lab_jupyter_launch_site_location.ipynb], cell: marker_cluster)
- Lines and distance markers to proximities ([lab_jupyter_launch_site_location.ipynb], cell: PolyLine)

## Screenshot:
- Screenshot of global map with launch sites
- Screenshot of outcome markers
- Screenshot of proximity lines/distances

## Reference:
- [GitHub URL to Folium notebook]

# Build a Dashboard with Plotly Dash
## Instructions:
- Summarize what plots/graphs and interactions you have added to a dashboard
- Explain why you added those plots and interactions
- Add the GitHub URL of your completed Plotly Dash lab

## Markdown Example:
### Dashboard with Plotly Dash
- Pie chart: Launch success count for all sites
- Pie chart: Site with highest success ratio
- Scatter plot: Payload vs. Launch Outcome (with range slider)

## Screenshot:
- Screenshot of each dashboard plot/interactivity

## Reference:
- [GitHub URL to Plotly Dash notebook]

# Predictive Analysis (Classification)
## Instructions:
- Summarize how you built, evaluated, improved, and found the best performing classification model
- Present your model development process using key phrases and flowchart
- Add the GitHub URL of your completed predictive analysis lab

## Markdown Example:
### Predictive Analysis
- Built and tuned Logistic Regression, SVM, Decision Tree, KNN models ([SpaceX_Machine Learning Prediction_Part_5.ipynb], cells: model training)
- Compared model accuracy ([SpaceX_Machine Learning Prediction_Part_5.ipynb], cell: accuracy bar chart)
- Selected best performing model ([SpaceX_Machine Learning Prediction_Part_5.ipynb], cell: confusion matrix)
- [Insert flowchart image here]

## Screenshot:
- Screenshot of model accuracy bar chart
- Screenshot of confusion matrix for best model

## Reference:
- [GitHub URL to ML notebook]

# Results
## Instructions:
- Exploratory data analysis results
- Interactive analytics demo in screenshots
- Predictive analysis results

## Markdown Example:
### Results
- Key findings from EDA
- Interactive analytics demo screenshots
- Predictive analysis results

## Screenshot:
- Screenshot of main results charts/tables

## Reference:
- [GitHub URLs or local paths to relevant notebooks]

# Section 2: Visualizations and SQL Outputs
## Instructions:
- For each visualization or SQL output, run the specified code cell in the relevant notebook and take a screenshot.
- Add a short explanation for each output.

### Flight Number vs. Launch Site
- Notebook: edadataviz.ipynb
- Code: `sns.catplot(y="LaunchSite", x="FlightNumber", hue="Class", data=df, aspect=5)`
- Screenshot: Scatter plot
- Explanation: Describe the pattern you see.

### Payload vs. Launch Site
- Notebook: edadataviz.ipynb
- Code: `sns.catplot(y="LaunchSite", x="PayloadMass", hue="Class", data=df, aspect=5)`
- Screenshot: Scatter plot
- Explanation: Describe the pattern you see.

### Success Rate vs. Orbit Type
- Notebook: edadataviz.ipynb
- Code: `df.groupby('Orbit')['Class'].mean().plot(kind='bar')`
- Screenshot: Bar chart
- Explanation: Describe which orbits have highest success rates.

### Flight Number vs. Orbit Type
- Notebook: edadataviz.ipynb
- Code: `sns.catplot(y="Orbit", x="FlightNumber", hue="Class", data=df, aspect=5)`
- Screenshot: Scatter plot
- Explanation: Describe the pattern you see.

### Payload vs. Orbit Type
- Notebook: edadataviz.ipynb
- Code: `sns.catplot(y="Orbit", x="PayloadMass", hue="Class", data=df, aspect=5)`
- Screenshot: Scatter plot
- Explanation: Describe the pattern you see.

### Launch Success Yearly Trend
- Notebook: edadataviz.ipynb
- Code: Extract year and plot success rate by year
- Screenshot: Line chart
- Explanation: Describe the yearly trend.

### All Launch Site Names
- Notebook: jupyter-labs-eda-sql-coursera_sqllite.ipynb
- Code: `%sql SELECT DISTINCT "Launch_Site" FROM SPACEXTABLE;`
- Screenshot: SQL output
- Explanation: List unique launch sites.

### Launch Site Names Begin with 'CCA'
- Notebook: jupyter-labs-eda-sql-coursera_sqllite.ipynb
- Code: `%sql SELECT * FROM SPACEXTABLE WHERE "Launch_Site" LIKE 'CCA%' LIMIT 5;`
- Screenshot: SQL output
- Explanation: Show 5 records.

### Total Payload Mass
- Notebook: jupyter-labs-eda-sql-coursera_sqllite.ipynb
- Code: `%sql SELECT SUM("Payload_Mass__kg_") FROM SPACEXTABLE WHERE "Customer" LIKE '%NASA (CRS)%';`
- Screenshot: SQL output
- Explanation: Show total payload mass.

### Average Payload Mass by F9 v1.1
- Notebook: jupyter-labs-eda-sql-coursera_sqllite.ipynb
- Code: `%sql SELECT AVG("Payload_Mass__kg_") FROM SPACEXTABLE WHERE "Booster_Version" = 'F9 v1.1';`
- Screenshot: SQL output
- Explanation: Show average payload mass.

### First Successful Ground Landing Date
- Notebook: jupyter-labs-eda-sql-coursera_sqllite.ipynb
- Code: `%sql SELECT MIN(Date) FROM SPACEXTABLE WHERE "Landing_Outcome" = 'Success (ground pad)';`
- Screenshot: SQL output
- Explanation: Show first successful ground landing date.

### Successful Drone Ship Landing with Payload between 4000 and 6000
- Notebook: jupyter-labs-eda-sql-coursera_sqllite.ipynb
- Code: `%sql SELECT "Booster_Version" FROM SPACEXTABLE WHERE "Landing_Outcome" = 'Success (drone ship)' AND "Payload_Mass__kg_" > 4000 AND "Payload_Mass__kg_" < 6000;`
- Screenshot: SQL output
- Explanation: List boosters.

### Total Number of Successful and Failure Mission Outcomes
- Notebook: jupyter-labs-eda-sql-coursera_sqllite.ipynb
- Code: `%sql SELECT "Landing_Outcome", COUNT(*) FROM SPACEXTABLE GROUP BY "Landing_Outcome";`
- Screenshot: SQL output
- Explanation: Show counts.

### Boosters Carried Maximum Payload
- Notebook: jupyter-labs-eda-sql-coursera_sqllite.ipynb
- Code: `%sql SELECT "Booster_Version" FROM SPACEXTABLE WHERE "Payload_Mass__kg_" = (SELECT MAX("Payload_Mass__kg_") FROM SPACEXTABLE);`
- Screenshot: SQL output
- Explanation: List boosters.

### 2015 Launch Records
- Notebook: jupyter-labs-eda-sql-coursera_sqllite.ipynb
- Code: `%sql SELECT substr(Date, 6,2) AS month, "Landing_Outcome", "Booster_Version", "Launch_Site" FROM SPACEXTABLE WHERE "Landing_Outcome" = 'Failure (drone ship)' AND substr(Date,0,5) = '2015';`
- Screenshot: SQL output
- Explanation: List failed landings in 2015.

### Rank Landing Outcomes Between 2010-06-04 and 2017-03-20
- Notebook: jupyter-labs-eda-sql-coursera_sqllite.ipynb
- Code: `%sql SELECT "Landing_Outcome", COUNT(*) as outcome_count FROM SPACEXTABLE WHERE Date BETWEEN '2010-06-04' AND '2017-03-20' GROUP BY "Landing_Outcome" ORDER BY outcome_count DESC;`
- Screenshot: SQL output
- Explanation: Rank outcomes.


# Section 3: Folium Map Screenshots
## Instructions:
- For each Folium map screenshot, run the relevant cell in lab_jupyter_launch_site_location.ipynb
- Take screenshots as directed

### Folium Map Screenshot 1
- Title: Global Launch Sites Map
- Cell: Folium map with all launch site markers
- Screenshot: Show all launch sites
- Explanation: Describe findings

### Folium Map Screenshot 2
- Title: Launch Outcomes Map
- Cell: Folium map with color-labeled outcomes
- Screenshot: Show outcome markers
- Explanation: Describe findings

### Folium Map Screenshot 3
- Title: Launch Site Proximity Map
- Cell: Folium map with lines/distances to proximities
- Screenshot: Show proximity lines
- Explanation: Describe findings


# Section 4: Dashboard Screenshots
## Instructions:
- For each dashboard screenshot, run the relevant cell in your Plotly Dash notebook
- Take screenshots as directed

### Dashboard Screenshot 1
- Title: Launch Success Count Pie Chart
- Screenshot: Pie chart for all sites
- Explanation: Describe findings

### Dashboard Screenshot 2
- Title: Highest Success Ratio Site Pie Chart
- Screenshot: Pie chart for highest success site
- Explanation: Describe findings

### Dashboard Screenshot 3
- Title: Payload vs. Launch Outcome Scatter Plot
- Screenshot: Scatter plot with range slider
- Explanation: Describe findings


# Section 5: Classification Accuracy and Confusion Matrix
## Instructions:
- Run the relevant cells in SpaceX_Machine Learning Prediction_Part_5.ipynb
- Take screenshots as directed

### Classification Accuracy
- Cell: Bar chart comparing model accuracies
- Screenshot: Bar chart
- Explanation: Which model is best

### Confusion Matrix
- Cell: Confusion matrix for best model
- Screenshot: Confusion matrix
- Explanation: Describe results


# Conclusions
## Instructions:
- Summarize main points and findings

## Markdown Example:
### Conclusions
- Point 1: [Insert]
- Point 2: [Insert]
- Point 3: [Insert]
- Point 4: [Insert]


# Appendix
## Instructions:
- Include any relevant assets like Python code snippets, SQL queries, charts, Notebook outputs, or data sets that you may have created during this project

## Markdown Example:
### Appendix
- Python code snippets
- SQL queries
- Charts and outputs
- Links to datasets and notebooks
