Skip to content

In this mini data science tutorial our task is to predict reasons for 911 calls, given a fictitious 911 calls database. We'll build and test a Random Forest model using Python and scikit-learn.

License

Notifications You must be signed in to change notification settings

jcatanza/seattle-911

Repository files navigation

seattle-911

In this Jupyter notebook (seattle_911_calls.ipynb), we'll explore an alternative universe, in which the city of Seattle receives 911 calls for only four reasons: an accident involving beavers, a seal attack, a Marshawn Lynch sighting, or a hot latte spills in someone's lap.

We're given a data set consisting of the 911 call logs. Our task is to learn from this data to predict the reason for any 911 call. Working in Python, we'll learn how to examine the data via exploratory data analysis (EDA). Using the scikit-learn library, we'll build a Random Forest model and train it to predict the type of 911 call based on patterns in the data. We'll test our model by making predictions and evaluating its performance.

Along the way, you'll encounter and absorb a numberr of key concepts that are at the heart of the data science process. Ready? Come along, then. As Chris Cuomo would say: "Let's get after it!"

Material for a Medium post on a simple classification problem

Jupyter notebook seattle_911_calls.ipynb requires the input file 'seattle_911_calls.xlsx'

About

In this mini data science tutorial our task is to predict reasons for 911 calls, given a fictitious 911 calls database. We'll build and test a Random Forest model using Python and scikit-learn.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published