**Introduction** 
This project examines a live database that contains every fire call in the Seattle area from the early 2000s to date. Each call has the following documented: address, type, datetime, latitude, longitude, report location and incident number. The purpose of this project was to find the predominant type of Seattle fire call during the year of Washington’s worst wildfire season and examine that type in the surrounding years to find trends and potential indicators of a significantly worse wildfire season. 

**Data selected for analysis:** 
I selected this data set: Seattle Real Time Fire 911 Calls
Link: https://data.seattle.gov/Public-Safety/Seattle-Real-Time-Fire-911-Calls/kzjm-xkqj 

**Background on Wildfire - Causes and Washington State History**
Washingtons state’s worst wildfire season to date was the year 2015. Washington state’s typical fire season is July through September. There was not very much information or research done on Seattle specifically, so it should be noted that my research does not account for the specific landscapes and infrastructures of Seattle compared to the rest of Washington state. According to previous research, the main causes of fire in indoor settings are heat producing appliances, open flame ignition sources, electric malfunction, smoking and static electricity (https://fire.nv.gov/uploadedfiles/firenvgov/content/bureaus/FST/2-ifipp-FHsm.pdf). Other most common causes specific to homes include electric appliance malfunctions, outlets, misused storage space, heating equipment malfunction and kitchen appliance failure(https://www.redcross.org/get-help/how-to-prepare-for-emergencies/types-of-emergencies/fire/is-your-home-a-fire-hazard.html#:~:text=Heating%20equipment%2C%20like%20space%20heaters,from%20flammable%20materials%20or%20items.). Outdoor wildfires are most commonly rubbish fires and natural vegetation fires (https://www.usfa.fema.gov/downloads/pdf/statistics/v9i2.pdf). 

**Research questions and Methodology**
Because the worst wildfire season for Washington state to date is 2015, I wanted to examine the types of Seattle fire calls 5 years leading up to the year 2015 and 5 years after. My main research questions were:

How many people were affected by the worst wildfire season in the Seattle area? 

What was the leading cause of fires in the Seattle area in the year 2015?

Do we see the same leading causes of fires in the Seattle area compared to the year 2015?

Were there upward trends in the most prevalent type of fire call leading up to the year 2015?

Once I had my research questions narrowed down, I found the top 5 leading causes of fire calls in the Seattle area in the year 2015:


In [16]:
import pandas as pd
import numpy as np

In [17]:
df = pd.read_csv('2015_calls.csv')

In [18]:
df_types = df.groupby("Type")

In [19]:
df_types_counted = df_types.size()

In [20]:
df_types_final = df_types_counted.to_frame()

In [21]:
df_types_final = df_types_final.rename(columns= {0: 'Frequency'})

In [22]:
df_types_final.sort_values(by='Frequency', ascending=False)

Unnamed: 0_level_0,Frequency
Type,Unnamed: 1_level_1
Aid Response,50622
Medic Response,17448
Trans to AMR,5175
Auto Fire Alarm,4617
MVI - Motor Vehicle Incident,2614
...,...
"Mutual Aid, Tech Res",1
"Mutual Aid, Hazmat",1
ANTIB - Antibiotic Delivery,1
"Mutual Aid, Adv. Life",1


Aid Response was the number one type of fire call in the year 2015, which means that firefighters were mostly providing aid to people already affected by a fire in the area. Aid response means firefighters were providing direct non-medical aid to those impacted by fire. This gives us a sense of how many people were perhaps impacted by the increase in fires in this year. Guided by my research questions, I was interested to look at this specific type (Aid Response) of call throughout the years 2010-2020 to compare it to 2015 and see if there were any trends. A progressive upward trend in the 5 years leading up to 2015 could indicate that when Aid Response calls slowly increase by a certain number over time, a worse wildfire season is to be expected. A negative trend would mean that the opposite was true and no trend would mean no correlation. My hypothesis looking at this data was that there was a progressive upward trend in the number of Aid Response calls in the years leading up to worst wildfire season. 

To find the trend, I inputted the data into Tableau and cleaned it to find the number of Aid Response calls for each year. I then inserted those numbers into an excel sheet to better display the data and create visualizations. ![Data visualization 1.PNG](attachment:b891fd8f-99f3-4ed9-b5b8-0f0a23d82f31.PNG) 


**Finding**
The minimum frequency of Aid Response was 6, and the maximum was 50622. The maximum of the decade did in fact occur in the year 2015. 
The mean frequency over the decade of Aid Response was 21661, this indicates the presence of outliers in the data and is half of the frequency of the year 2015. 

The graph shows an upward trend for the 5 years leading up to the wildfire event, and a sharp downward trend right after. From these findings, it can be concluded that there is a correlation between frequency of Aid Response and worsening wildfire seasons but this relationship is spurious and needs further examining to determine a causal relationship between the two.

**Discussion and Limitations**

Limitation 1 - Washington Landscape 
The urban landscape of Seattle is not representative of the settings in which wildfires occur in Washington, which are mainly forested or arid areas. Therefore, it is not within the capabilities of this report to say that the data in this set can necessarily relate to mass wildfire events in the state. 

Limitation 2 – Discrepancies in Number of Types in Dataset
In the dataset, after the year 2015, there is a sudden drop in the number of rows with the type ‘aid response’. By looking at the distribution of the different type variables, I found that this is due to the addition of more granular type variables which automatically mean that data points that were once formerly classified under aid response could then fall into another type of fire call variable. This leads to discrepancies while trying to analyze this data longitudinally. 

Limitation 3 – Lack of consistency of qualitative codes
When looking further into the type variables, it is evident that some of the variables signified the cause of the fire event, whereas other variables stood for the type of response relayed by responders. Since I was trying to attribute the fire call types to the incidence of wildfires, it was challenging to figure out whether the type with the highest frequency was a direct cause or effect of wildfires. 

All things considered, this report is an in-depth look at the frequency of fire calls that were aid responses. The report summarizes a potential causal relationship between aid response fire calls and the increased severity of wildfire seasons. Nonetheless, the limitations that I identified clearly show that this data set is not highly representative of the total landscape of the state of Washington, meaning that further investigation in the form of more comprehensive data is needed to properly assess the linkage between aid response and wildfires. 

