# 9F: Trash in Space

[![YouTube Video Link](https://i.postimg.cc/05xmRp5W/image.png)](https://youtube.com/clip/UgkxDUniIVYUtXLZHZWGzvYuM67A24MMZ_ot?si=MsqC8ibknMTITAlP)

In [None]:
# load the coursekata library
library(coursekata)

# get rid of scientific notation in this notebook
options(scipen = 999)

# load the data
SpaceLaunches <- read.csv("https://docs.google.com/spreadsheets/d/e/2PACX-1vTDl_AY0dCw8I1Ant_jIsfICy7uFdcfIur4aTHbtBvk3kkqWwnl9I49BQtHSFknhW02mxRXiMT4lili/pub?gid=1871387628&single=true&output=csv")
SpaceLaunches3 <- SpaceLaunches %>%
  filter(Year < "2020" & Country %in% c("United States","Russia","China"))
SpaceUSA <- read.csv("https://docs.google.com/spreadsheets/d/e/2PACX-1vTfjSDV7_clFIX0AwkBMHSLeIXCGmJGoMOm7F-Yd_jeuaQZCVFpLzQHnv7FLednLARzX0f-Eo1UlVzo/pub?gid=722119930&single=true&output=csv")

## 1.0: What do you notice? What do you wonder? (Data Talk, Optional)

<img src="https://www.visualcapitalist.com/wp-content/uploads/2023/07/Updated-Every-Space-Debris-orbiting-the-Earth-Visual-Capitalist-Preyash-Shah-1.jpg" alt="Visual Capitalist Space Debris graph" width="80%"/>

[Originally posted on Visual Capitalist](https://www.visualcapitalist.com/cp/space-debris-by-country/)

Although there are large scale efforts to track space debris (that's part of the job of the [US Space Force](https://www.space-track.org/)), let's at least take a look at the stuff we launched up there **on purpose.** Some of these launches are now considered trash or will contribute to trash in the future!

## 2.0: How have space launches changed over time?

<div class="alert alert-block alert-info">
    
Run the code cell below to create a scatterplot of space launches from three countries (US, Russia, China) over time *before 2020*. 

</div>

In [None]:
# run this
gf_point(Launches ~ Year, data = filter(SpaceLaunches3, Country == "United States"), color = "darkgray") %>%
  gf_point(Launches ~ Year, data = filter(SpaceLaunches3, Country == "Russia"), color = "orange") %>%
  gf_point(Launches ~ Year, data = filter(SpaceLaunches3, Country =="China"), color = "red")


Possible questions to review the graph:

- *Based on the code* which countries are shown in the graph? 
- *Based on the code* which color corresponds to which country? 
- Who had the most space launches in 1980? How do you know? 
- When did China begin their space launches?  
- In the past 20 years, who has been responsible for the most space launches? 

## 3.0: The US has a lot of Launches

NASA's space shuttle program ended in 2011. What happened to space launches since then in the US?

To help us investigate this question, we've included a small data set of just the US launches since 2011 called `SpaceUSA`. 


3.1 - Run the code below to see the data and to create a scatterplot of the launches since 2011.

In [None]:
head(SpaceUSA)

# create a scatterplot of Launches by YearSince2011


3.2 - Let's try modeling the pattern we see in `Launches` by `YearSince2011` with a simple linear model. 

Find the best-fitting linear model and place it on the scatterplot.

3.3 - Write the equation for the best-fitting linear model using GLM notation.

3.4 - Run the code below to see what the model would have predict for each year after 2011.

What would this model predict for the number of US space launches 13 years after 2011 (the year 2024)? Do you think this is a trustworthy prediction? Why or why not?

(Optional: Add that prediction to the scatterplot by piping `%>%` another `gf_point()` on.)

In [None]:
# Run this
gf_point(Launches ~ YearSince2011, data = SpaceUSA) %>%
  gf_point(predict(linear_model) ~ YearSince2011, color = "slateblue") 

3.5 - Even though this is the "best-fitting" linear model, a lot of students look at this so called "best" model and think it sucks. Why?

## 4.0: Can we do better than the best linear function?

You may have learned about exponential functions in the past where the $X_i$ is an exponent:

$$Y_i = e^{X_i}$$

If you haven't taken calculus yet, just think of $e$ as a special number like $\pi$. The value of $e$ (the exponential constant) is approximately 2.718. 

#### The important feature of an exponential function is that it can bend just like this -- higher values of x increase much faster than earlier values of x.

We can modify our typical GLM function to include an exponential element like this:

$$Y_i = b_0 + b_1e^{X_i} + e_i$$

(Note that $e$ is the exponential constant but $e_i$ is residual error!)

4.1 - Run the code below to find the best-fitting $b_0$ and $b_1$ for an exponential (rather than linear) model. Write the resulting equation in GLM notation.

In [None]:
lm(Launches ~ exp(YearSince2011), data = SpaceUSA)

4.2 - Modify the graph below to place the predictions of the best-fitting exponential model on it.

In [None]:
# modify this code
gf_point(Launches ~ YearSince2011, data = SpaceUSA) %>%
  gf_point(predict(linear_model) ~ YearSince2011, color = "green4")


4.3 - What would this model predict for the number of US space launches 13 years after 2011 (the year 2024)? Do you think this is a trustworthy prediction? Why or why not?

Note: In R we would code, we would code $e^2$ by writing `exp(2)`. How might you code $e^{13}$?

(Optional: Add that prediction to the scatterplot by piping `%>%` another `gf_point()` on.)

In [None]:
# this computes e to the 2nd power
# how would you modify this for the 13th power?
exp(2)

4.4 - How is the exponential model growing differently than the linear? 

## 5.0: How much error has been reduced?

5.1 - How much error in `Launches` is explained by the linear model? How about the exponential model?

In [None]:
# COMPLETE
supernova(linear_model)
supernova(exp_model)

5.2 - **Concluding thoughts**, as the number of space launches increase, what do you will think will happen to the amount of space trash?


## Data Details

The launches counted in the dataframes used in this Jupyter notebook include satellites, probes, landers, crewed spacecrafts, and space station flight elements launched into Earth orbit or beyond. Original data from United Nations Office for Outer Space Affairs (2024) with major processing by [Our World in Data](https://ourworldindata.org/space-exploration-satellites). 