# Lumber Prices

* **Data:** `lumber-prices.csv`
* **Description:** Chicago lumber futures per 1000 board feet, in USD, as of some time in 2021. To keep things simple, we're going to say that this is the measurement over a month instead of just one date per month. Basically: each month, how much you'd pay if you bought 125 2x4 boards at the hardware store.
* **Source:** https://tradingeconomics.com/commodity/lumber
* **Columns of interest:**
    * `Open` is the price at the start of the month
    * `High` is the high price over the course of the month
    * `Low` is the low number over the course of the month
    * `Close` is the number at the end of the month

## Read in your data

In [48]:
import pandas as pd
df = pd.read_csv("lumber-prices.csv")

## What is the earliest month in this dataset?

We only have one reading per month, so you could also read this as "what is the earliest date in the dataset"

In [49]:
df.date.min()
# It looks like the earliers month is December 1996

'1996-12-09T00:00:00'

## What is the most recent month in this dataset?

We only have one reading per month, so you could also read this as "what is the earliest date in the dataset"

In [50]:
df.date.max()

'2021-11-01T00:00:00'

In [51]:
# It looks like the most recent month is November 2021. 

## What 3 months had the highest high lumber price?

In [52]:
# To be able to sort the high column I remove the $ in front of every number and convert the variable into an integer
df['high'] = df['high'].str.replace('$', '')
df['high'] = df['high'].astype(float)

  df['high'] = df['high'].str.replace('$', '')


In [53]:
# And then I sort the column and print the three highest prices. It looks like May, April and June of 2021 had
# the highest lumber prices.
df.sort_values('high', ascending=False).head(3)

Unnamed: 0,open,high,low,close,date
293,$1520.60,1711.2,$1201.00,$1309.50,2021-05-03T00:00:00
292,$1012.60,1502.3,$1006.00,$1500.50,2021-04-01T00:00:00
294,$1315.00,1380.0,$705.00,$716.00,2021-06-01T00:00:00


## What 3 months had the lowest low lumber price?

In [54]:
df['low'] = df['low'].str.replace('$', '')
df['low'] = df['low'].astype(float)

  df['low'] = df['low'].str.replace('$', '')


In [55]:
# And then I sort the column and print the three highest prices. It looks like the 
df.sort_values('low', ascending=False).tail(3)
# The lowest lumber prices where in January, February and March of 2009. The financial crisis really hit the 
# lumber industry hard huh?

Unnamed: 0,open,high,low,close,date
147,$144.60,179.8,138.0,$171.30,2009-03-02T00:00:00
146,$158.10,175.9,138.0,$147.00,2009-02-02T00:00:00
145,$168.70,171.8,137.9,$148.10,2009-01-02T00:00:00


In [56]:
# However, it looks like I get the same answer if I take the three lowest values in the high column. But I wouldn't
# have known that, if I hadn't done it the other way too. 
df.sort_values('high', ascending=False).tail(3)

Unnamed: 0,open,high,low,close,date
147,$144.60,179.8,138.0,$171.30,2009-03-02T00:00:00
146,$158.10,175.9,138.0,$147.00,2009-02-02T00:00:00
145,$168.70,171.8,137.9,$148.10,2009-01-02T00:00:00


## What is the median "high" lumber price in our dataset?

In [58]:
df.high.median()
# The median "high" lumber price is 329.65 dollars

329.65

## What is the median "low" lumber price in our dataset?

In [59]:
df.low.median()
# And the median "low" lumber price is 286.1 dollars

286.1

## Plot the maximum "high" lumber price on an annual basis.

## Plot the lowest "low" lumber price on an annual basis.

In [57]:
## Plot the median "close" lumber price on an quarterly basis.

## What month in our dataset had the largest swing between high and low?

You'll want to create a new column for this one

## In how many months did lumber prices rise?

If the price was greater at the end of the month than the beginning of the month, we'll say the price rose.

## How many months did lumber prices drop?

If the price was lower at the end of the month than the beginning of the month, we'll say the price dropped.

## In 2020, how many months saw lumber prices rise vs lumber prices fall?

## In 2000, how many months saw lumber prices rise vs lumber prices fall?