![alt text](http://www.civictechs.com/wp-content/uploads/2019/06/instruct5.jpg)

# How to Analyze Time-Based Data 

---



This guide is for novice users interested in analyzing data that varies based on the city. Organizations, LA County, and cities  collect a variety of **time-based** data. Time-based data has at least one column with a year, month, or day in it. In this example, we'll analyze a dataset on when Los Angeles artists created public projects – like the mural above. This gives us a better sense of the history of public art in the city, and barriers to its preservation. 

While programming experience helps for this instructable, it is not required.* (Please see our [first](https://colab.research.google.com/drive/1102rYgCZMWIPa0HdezbiiEx-t5Ikct0s#scrollTo=bu7i1hbHvGzW) and [second](https://colab.research.google.com/drive/1QKoElHpzqC0wf7T4oBFbZ4QQXgXRSXMr#scrollTo=w4D-Jd8tgvBQ) instructables for information on the tools used in this exercise, and the [final](https://colab.research.google.com/drive/1NyiS1KsojrsGxBSf5zxeil-M4R_ffD-2#scrollTo=l6t7XEUgDGZY) instructable for information on APIs)*

# Step 1: Gather and Understand Ingredients Used in This Notebook


*  A dataset from [LA Counts](https://data.lacounty.gov/api/views/9qv9-dayy/rows.csv) on public art in the City of Los Angeles. We selected this file because it is in CSV format and has a field for year. Your data set should look similar, although you may have a different format for time-based data, such as month and year or actual date (e.g. 03/01/2019 for March 1, 2019). 
*  A Jupyter Notebook like this one, hosted on [Google's colab.](http://colab.research.google.com/) 
*   Free Python Libraries ([numpy](http://www.numpy.org/), [pandas](https://pandas.pydata.org/), and [plotly](https://plot.ly/python/)). These are accessible within Jupyter Notebooks, so you don't need to download them. 
*   Your smarts! 

# Step 2: Load and Show Data as a Table

The first step is to load data into this Jupyter Notebook. A Jupyter Notebook is an open-source application that runs in your web browser. It can contain sections of live code, data visualizations, and text. 

1. Register for a Plot.ly account at https://plot.ly/feed – click on “sign up.” 
2. Generate an API key at https://plot.ly/settings/api – copy and paste the “API Key” and “Username” fields to the Jupyter Notebook in the appropriate place. 

In [0]:
import plotly.plotly as py
import plotly.figure_factory as ff
import plotly.tools as tl
import pandas as pd

data = pd.read_csv("https://data.lacounty.gov/api/views/9qv9-dayy/rows.csv")

# 2. Register for a Plot.ly account at https://plot.ly/feed – click on “sign up.” 
# 3. Generate an API key at https://plot.ly/settings/api – copy and paste the “API Key” and “Username” fields to the Jupyter Notebook in the appropriate place. 

tl.set_credentials_file(username='aschrock', api_key='CWwy1WgujKrPK4Z1J0Oh')

# 4. The below code selects columns 0 (Artwork Title), 6 (Year Created), & 7 (Artwork Type) in the dataset to show in a table. 

newdata = data.iloc[:, [0,6,7]]

table = ff.create_table(newdata)
py.iplot(table, filename='Artwork in LA')

# Step 3: Visualize a Variable Changing over Time 

Now that we have loaded our CSV file, we can create a chart using some simple Python code to show how different types of artwork ("Artwork Type") vary by "Year Created". 

In [0]:
import plotly.plotly as py
import plotly.figure_factory as ff
import plotly.tools as tl
import pandas as pd
import plotly.graph_objs as go

# *** Change the URL in quotes below to your CSV file location *** 

data = pd.read_csv("https://data.lacounty.gov/api/views/9qv9-dayy/rows.csv")

# *** Change username and api_key to your own, found at https://plot.ly/settings/api 

tl.set_credentials_file(username='aschrock', api_key='CWwy1WgujKrPK4Z1J0Oh')

# Remember column 6 is year of the artwork! 

x = data.iloc[:, 6]

# Create the histogram using Plotly. 

data = [go.Histogram(x=x)]

# Display the histogram on the screen. 

py.iplot(data, filename='Public Artwork in LA by Year')


**Step 4: What next? **

After graphing the data, you've learned quite a bit about when public artworks in Los Angeles were created. 

*   You could create a map of where these public artworks are! Who has access to the arts in the city of Los Angeles? 
*   You might get curious about what the timeline does not show. Why do the years between 1945 and 1954 have no public artworks entered? 
*   You could overlay this dataset with another one to look at how funding of public artwork has changed over time. Is the spike in artwork starting in 2005 a result of greater funding for the arts? 

Have fun with your data analysis, and come back for the [next instructable](https://colab.research.google.com/drive/1wqICfSgtIzZU5RIbPY8m00LTMab8AL4t#scrollTo=TuYYxrnr7m6J)! 