![Callysto.ca Banner](https://github.com/callysto/curriculum-notebooks/blob/master/callysto-notebook-banner-top.jpg?raw=true)

# Callysto’s Weekly Data Visualization

## Weekly Title

### Recommended grade level: 6-12

### Instructions:
#### Step 1 (your only step): “Run” the cells to see the graphs
Click “Cell” and select “Run All.” This will import the data and run all the code to make this week's data visualizations (scroll to the top after you’ve run the cells). **You don’t need to do any coding**.

![instructions](https://github.com/callysto/data-viz-of-the-week/blob/pie-chart-religion/images/instructions.png?raw=true)

### About The Notebook:

Callysto's Weekly Data Visualization is a learning resource that helps Grades 5-12 teachers and students grow and develop data literacy skills. We do this by providing a data visualization, like a graph, and asking teachers and students to interpret it. This companion resource walks learners through how the data visualization is created and interpreted using the data science process. The steps of this process are listed below and applied to each weekly topic.

1. Question - What are we trying to answer? 
2. Gather - Find the data source(s) you will need. 
3. Organize - Arrange the data so that you can easily explore it. 
4. Explore - Examine the data to look for evidence to answer our question. This includes creating visualizations. 
5. Interpret - Explain how the evidence answers our question. 
6. Communicate - Reflect on the interpretation. 

## 1. Question

**How common is the use of decorative LED lights durring the holidays?**

Light Emmitting Diode (LED) lights have become more common in households as technology for LED lights has improved greatly over the last decade. LED lights have an advantage over the previously common halogen bulbs due to much greater energy efficiency in LEDs. So, how common do you think it is for households to use LED holiday lights? 

## 2. Gather

The code below will import the Python programming libraries we need to gather and organize the data to answer our question.

In [1]:
import os
import pandas as pd
import plotly.express as px

This code will read in a comma seperated values (csv) file containing Statistics Canada (Stats Can) [data](https://www150.statcan.gc.ca/t1/tbl1/en/tv.action?pid=3810004801) about the use of holiday LEDs. This data was found as part of the [Christmas...by the numbers](https://www.statcan.gc.ca/eng/dai/smr08/2017/smr08_222_2017) data collection by Stats Can.

In [2]:
light_path = os.path.join('datasets', 'light_data.csv')
df = pd.read_csv(light_path)

This final code cell in the "gather" section will let us take a findal look at our data.

In [3]:
print(df.columns)
print(df.head())

Index(['REF_DATE', 'GEO', 'DGUID', 'Type of energy-saving light', 'UOM',
       'UOM_ID', 'SCALAR_FACTOR', 'SCALAR_ID', 'VECTOR', 'COORDINATE', 'VALUE',
       'STATUS', 'SYMBOL', 'TERMINATED', 'DECIMALS'],
      dtype='object')
   REF_DATE                        GEO           DGUID  \
0      2011                     Canada  2016A000011124   
1      2013                     Canada  2016A000011124   
2      2015                     Canada  2016A000011124   
3      2017                     Canada  2016A000011124   
4      2011  Newfoundland and Labrador     2016A000210   

                 Type of energy-saving light      UOM  UOM_ID SCALAR_FACTOR  \
0  Light emitting diode (LED) holiday lights  Percent     239         units   
1  Light emitting diode (LED) holiday lights  Percent     239         units   
2  Light emitting diode (LED) holiday lights  Percent     239         units   
3  Light emitting diode (LED) holiday lights  Percent     239         units   
4  Light emitting diode (LE

In [4]:
df = df[['REF_DATE', 'GEO', 'VALUE']]
df

Unnamed: 0,REF_DATE,GEO,VALUE
0,2011,Canada,33
1,2013,Canada,40
2,2015,Canada,41
3,2017,Canada,40
4,2011,Newfoundland and Labrador,40
5,2013,Newfoundland and Labrador,46
6,2015,Newfoundland and Labrador,48
7,2017,Newfoundland and Labrador,36
8,2011,Prince Edward Island,32
9,2013,Prince Edward Island,46


In [5]:
df.columns=['year', 'area', '% households using holiday LEDs']
df_small = df.pivot(index='area', columns='year', values='% households using holiday LEDs').reset_index()
df_small.columns.name = ""
df_small

Unnamed: 0,area,2011,2013,2015,2017
0,Alberta,29,36,42,42
1,British Columbia,32,43,41,40
2,Canada,33,40,41,40
3,Manitoba,39,37,44,41
4,New Brunswick,34,47,49,38
5,Newfoundland and Labrador,40,46,48,36
6,Nova Scotia,45,45,46,37
7,Ontario,34,38,41,39
8,Prince Edward Island,32,46,48,46
9,Quebec,32,41,38,41


In [8]:
fig = px.line(df, y='% households using holiday LEDs', 
              x='year', color='area', 
              title='Use of LED lights During the Holidays')
fig.update_traces(mode='lines+markers')
fig.show()