In [None]:
import warnings
warnings.filterwarnings("ignore")

import pandas as pd
import numpy as np
import seaborn as sns
import matplotlib.pyplot as plt
import importers
import fbprophet
import sklearn
import cpi
import seaborn as sns

import prepare
import explore
#needed for matplotlib to handle plotting pandas
pd.plotting.register_matplotlib_converters()


### Acquire 

#### Coffee Data

We initially attempted to use ICO data to produce a model that would predict the global commodity coffee price. Very quickly, we found this question to be impossibly broad, and instead to focus on one country, Colombia.

ICO's data was insufficiently specific, and it only went back until 1990. We began searching for other data sources and ultimately found the National Federation of Coffee Growers of Colombia, commonly abbreviated FedCafe.

After some difficulty with the language barrier, we downloaded a few Excel files that detailed the export coffee prices for Excelso coffee. Excelso is a size grading of coffee which is the second largest bean. This is the most commonly exported Colombian coffee bean.

#### Weather Data

In [None]:
df = prepare.get_data()

### Prep

In [None]:
df2 = prepare.get_prepped()

### Exploration

After acquiring and cleaning the data we started to explore what our chosen features looked like over time for the top 13 coffee producing departments of Colombia. Initial exploration showed significant volatility in coffee production and precipitation. 

In [None]:
# Looking at possible causes of dramatic changes to price over time in data
explore.events_over_time(df2)

In [None]:
explore.export_price_1991_2018(df2)

In [None]:
# Look at price distribution per region
explore.area_cultivated()

In [None]:
# Production by region 1980-2018
explore.production_graph(df)

In [None]:
#Precipitation by region
explore.precipitation_by_region(df2)

In [None]:
# Does the average precipitation by region correlate with inflated price of coffee?
explore.corr_price_and_precip(df2)

In [None]:
#Average temperature by region
explore.avg_temp_by_region(df2)

In [None]:
# Does the average temperature by region correlate with inflated price of coffee?
explore.corr_price_and_temp(df2)

#### The Observatory of Economic Complexity Data Exploration
After acquisition and preparation of the OEC's data we wanted to explore Colombia's export numbers to other countries and attempt to uncover trends within the data. 
Initially we compared Colombia's exports to Brazil's looking to see if there was any indicators from Brazil's exports that Colombia's exports would move in any particular direction. As noticed in the graph below there's not pre-movement indicators from Brazil's production. 

In [None]:
importers.compare_brazil()

## Barchart showing top 10 importers between 1962 and 2017. 


In [None]:
importers.compare_import_change()

#### Correlating Volatility
We wanted to see if the increased volatility of a prior years production numbers would increase volatility of next year's price. TO BE CONTINUED


In [None]:
importers.get_volatility_graph()

In [None]:
#corr

##### Exploratory models




We discovered a noticeable shift in 

In [None]:
#After reviewing the data distribution we saw that the pricing was bimodal
#Splitting the data for modeling was greatly improved based on this analysis
explore.dist_of_price(df2)

In [None]:
#pre 1991
explore.dist_before(df)

In [None]:
#post 1991
explore.dist_after(df)

### Model