# **BEACO$_{2}$N Notebook 3b: Exploring Wildfires in August 2021**

### Learning Outcomes
Working through this notebook, you will learn about:
1. How CO$_{2}$, NO, and PM$_{2.5}$ concentrations change throughout a wildfire period
1. Importing data
1. Time series analysis



## Table of Contents

1. Wildfires in August 2021  
> 1.1 Loading in Packages  <br>

2. A Guide to Time Series  

3. Pollutant Exploration  
> 3.1 Select and Load Your Pollutant Data  <br>
> 3.2 Data Visualization  <br>

4. Conclusion


*Note: In this notebook, there are some more advanced topics that are "optional". This means you can just read over these sections; don't worry about fully understanding these parts unless you are really interested. They may be useful later in the course, but for now they are not necessary, so feel free to just skim the parts labelled "Optional"!*


<hr style="border: 2px solid #003262">
<hr style="border: 2px solid #C9B676">

##1. Wildfires in August 2021
Now that we've learned about the Orange Sky wildfire event of September 9th, 2020, let's look at other wildfire events. California experienced *numerous* wildfires in August of 2021.

**Q1: Do a Google search to see what fires may have been affecting the air quality of the Bay Area in August 2021. Write your answer in a few sentences the text cell below.**



*Double click on this text cell to edit this cell and replace this sentence
with your answer.*

Similarly to the Wildfire *In Class* notebook, you will now get a chance to explore how wildfires affected several different pollutants in **August 2021**. Feel free to have that notebook open to help with this one!


### 1.1 Loading in Packages

Python requires us to load packages. We'll be using the same packages we loaded in the *In Class* notebook we went through together; however, since this is a separate notebook than the *In Class* notebook, we need to load them in again. **Run the cell below.**

In [None]:
## RUN THIS CELL

# Packages used for visualizations
import matplotlib
import matplotlib.pyplot as plt
import seaborn as sns

# Numerical computation
import numpy as np
from decimal import Decimal

# Manuplating data in form of series or dataframes
import pandas as pd

# Manipulating fields that are date or time
import datetime
from datetime import time

Since we're going to be creating visualizations in this notebook as well, it can be easiest to set some parameters in advance. Consider this like setting the stage for a show, so everything will be shown in the same view. **Run the cell below.** If any warning pops up, press "run anyway".

If you get any error saying "matplotlib is not defined", that means you haven't run the cell above this. Run that one *first* then the one below next.

In [None]:
# Using this cell, we will be defining some matplotlib parameters for the plots that we will be creating.
matplotlib.rcParams['axes.linewidth'] = 1
matplotlib.rcParams['lines.linewidth'] = 3
matplotlib.rcParams['ytick.labelsize'] = 20
matplotlib.rcParams['xtick.labelsize'] = 20
matplotlib.rcParams['axes.titlesize'] = 20
matplotlib.rcParams['axes.labelsize'] = 20
plt.rcParams['figure.figsize'] = [20, 14]
plt.rcParams["axes.labelweight"] = "bold"
np.set_printoptions(suppress=True)
plt.rcParams["axes.labelweight"] = "bold"
plt.rcParams["savefig.facecolor"]= '#FFFFFF'

***
##2. A Guide to Time Series

**Time Series Analysis** is a method used to examine data points collected sequentially *over time*. These data points, plotted on a graph with time on the horizontal (x-axis) and the variable of interest on the vertical (y-axis), show how a particular variable evolves over time.<br>Time series analysis can help predict future values using patterns from the past. It's especially useful for understanding trends, cycles, or any recurring patterns in data over a time period.
<br><br>


You can use your Wildfires *In Class* notebook for reference, but for convenience, we've included the code that set up the `data_2020` DataFrame below.

In [None]:
# We'll start by pulling in the 2020 data.
filename_2020 = "https://github.com/wintera71/BEACO2N-Modules/raw/refs/heads/main/Lesson%203:%20Wildfires/CSVs/filename_2020_in_class.csv"

# The function pd.read_csv helps us read the data into DataFrames.
data_2020  = pd.read_csv(filename_2020, on_bad_lines='skip', index_col = [0], parse_dates=[0])
data_2020[:10]

Now that you've looked at the steps in the example above, let's try to practice this on our own.

***
## 3. Pollutant Exploration

Each group will explore a different pollutant. We will be looking at **PM$_{2.5}$, CO$_{2}$, and NO**.
1. With your group, **select which pollutant or greenhouse gas** you'd like to investigate.
2. Locate the appropriate section under 3.1 (for example, if you are investigating PM$_{2.5}$, read the "PM$_{2.5}$ Data:" section). Take a few minutes to load the DataFrame. <br> *You will only need to load the data for the species your group is working with. Skip the other 2 species in section 3.1.*


### 3.1 Select and Load Your Pollutant Data

#### PM$_{2.5}$ Data:
You will need to fill in the **`filename_pm`** variable with this link:
>"https://github.com/wintera71/BEACO2N-Modules/raw/refs/heads/main/Lesson%203:%20Wildfires/CSVs/pm_2021.csv"

Make sure to include the quotation marks.

**You can reference the example above in Section 2: A Guide to Time Series to help you fill in the code.**

In [None]:
filename_pm = ## replace this comment with the filename
data_pm = ## replace this comment to read the data into a DataFrame

data_pm[:10]

#### CO$_{2}$ Data:
You will need to fill in the **`filename_co2`** variable with this link:
>"https://github.com/wintera71/BEACO2N-Modules/raw/refs/heads/main/Lesson%203:%20Wildfires/CSVs/co2_2021.csv"

Make sure to include the quotation marks.

**You can reference the example above in Section 2: A Guide to Time Series to help you fill in the code.**

In [None]:
filename_co2 = ## replace this comment with the filename
data_co2  = ## replace this comment to read the data into a DataFrame

data_co2[:10]

#### NO Data:
You will need to fill in the **`filename_no`** variable with this link:
>"https://github.com/wintera71/BEACO2N-Modules/raw/refs/heads/main/Lesson%203:%20Wildfires/CSVs/no_2021.csv"

Make sure to include the quotation marks.

**You can reference the example above in Section 2: A Guide to Time Series to help you fill in the code.**

In [None]:
filename_no = ## replace this comment with the filename
data_no  = ## replace this comment to read the data into a DataFrame

data_no[:10]

### 3.2 Data Visualization

Great job working with your data and turning it into a very helpful DataFrame! Now, it's your turn to explore the data!

**Q2: Create a time series plot for your species (*species concentration vs. time*)**. You can follow along with the example from the *In Class* notebook we walked through together.
>Follow this syntax: **`plt.plot(<dataframe_name>.index, <dataframe_name>["<column_name>"], color = "<color_name>")`**

It is good practice to **include units** in your axis labels. The units for the different species are listed below:

>PM$_{2.5}$: micrograms per meter cubed ($\mu$g/m$^3$)

>NO: parts per billion (ppb)

>CO$_{2}$: parts per million (ppm)



In [None]:
# Replace ... with your own code
plt.figure(figsize = (30, 8))
plt.plot(...)

# Adds a x-axis label: "Date"


# Adds a y-axis label: (Your Species and the units)


# Adds a title to our line plot: "<Your Species> Time Series"


**Q3: Talk to a classmate who investigated a different pollutant and compare plots. What differences or similarities do you see?**<br>Be sure to indicate which two species you're comparing.






*Double click on this text cell to edit this cell and replace this sentence
with your answer.*

**Q4: What did you learn about how wildfires affect concentrations of these species? What do you see in the plots that show you this?**

*Double click on this text cell to edit this cell and replace this sentence
with your answer.*

**Q5: Changes in these species concentrations during wildfire periods can cause health impacts. Summarize a few of these impacts.**

*Double click on this text cell to edit this cell and replace this sentence
with your answer.*


***
##4. Conclusion
In this notebook, you learned about:
- Wildfire pollutants
- Importing data
- Visualizing data
  - Creating lineplots from DataFrames
  - Labelling lineplots
  - Understanding lineplots and what they represent


#### You've finished the **Wildfires *Student Exploration* notebook**. Good job! Congratulations on finishing the **Wildfires** lesson!

***
***