# Assessing Market Opportunities for Alpaca Wool Scarves - Data Preparation

In this activity, you’ll prepare the data to be used for time series forecasting to analyze Google Trends data. The purpose will be to validate market opportunities to help the Aymara indigenous people in Bolivia export alpaca wool scarves to different countries or regions.

You’ll continue collaborating with the International Co-operative Alliance. But now, you’ll use Prophet to validate prospective market opportunities

## Instructions

1. Open Google Colab and import the provided notebook.

2. Read the Google Trends data into a Pandas DataFrame.

3. Plot the general trends.

4. Prepare the data to fit the Prophet model. Create two new DataFrames: one for Canada and another for Uruguay. Recall setting the column names to “ds” for the dates and “y” for the trend values.

## References

- [Pandas rename function](https://pandas.pydata.org/pandas-docs/stable/reference/api/pandas.DataFrame.rename.html)


In [1]:
# Install the required libraries
!pip install hvplot
!pip install holoviews

Looking in indexes: https://pypi.org/simple, https://us-python.pkg.dev/colab-wheels/public/simple/
Collecting hvplot
  Downloading hvplot-0.8.1-py2.py3-none-any.whl (3.2 MB)
[K     |████████████████████████████████| 3.2 MB 4.4 MB/s 
Installing collected packages: hvplot
Successfully installed hvplot-0.8.1
Looking in indexes: https://pypi.org/simple, https://us-python.pkg.dev/colab-wheels/public/simple/


In [2]:
# Import the required libraries and dependencies
import pandas as pd
import hvplot.pandas
import datetime as dt
import holoviews as hv
import matplotlib.pyplot as plt

%matplotlib inline

Output hidden; open in https://colab.research.google.com to view.

## Step 2: Read the Google Trends data into a Pandas DataFrame, and then plot the general trends.

In [3]:
# Upload the scarf-google-trends-data.csv file
from google.colab import files
uploaded = files.upload()

Saving scarf-google-trends-data.csv to scarf-google-trends-data.csv


In [11]:
# Read the data from the scarf-google-trends-data.csv file into a Pandas DataFrame
df_alpaca = pd.read_csv(
    'scarf-google-trends-data.csv'
).dropna()

# Review the DataFrame
display(df_alpaca.head())
display(df_alpaca.tail())

Unnamed: 0,week,canada,uruguay
0,11/8/15,74,17
1,11/15/15,81,17
2,11/22/15,94,17
3,11/29/15,92,0
4,12/6/15,87,18


Unnamed: 0,week,canada,uruguay
255,9/27/20,33,14
256,10/4/20,38,15
257,10/11/20,43,15
258,10/18/20,42,14
259,10/25/20,43,0


## Step 3: Plot the general trends

In [12]:
# Set `bokeh` to render hvPlot charts
hv.extension('bokeh')

# Plot the general tredns using hvPlot
df_alpaca.hvplot()

Output hidden; open in https://colab.research.google.com to view.

## Step 4: Prepare the data to fit the Prophet model. Create two new DataFrames: one for Canada and another for Uruguay. Recall setting the column names to “ds” for the dates and “y” for the trend values.


In [13]:
# Create a DataFrame for Canada to include the week and canada columns
df_canada = df_alpaca[["week","canada"]]

# Rename the columns to the Prophet model syntax 
df_canada = df_canada.rename(columns={'week':'ds', 'canada':'y'})
# Review the Canada DataFrame
display(df_canada.head())
display(df_canada.tail())

Unnamed: 0,ds,y
0,11/8/15,74
1,11/15/15,81
2,11/22/15,94
3,11/29/15,92
4,12/6/15,87


Unnamed: 0,ds,y
255,9/27/20,33
256,10/4/20,38
257,10/11/20,43
258,10/18/20,42
259,10/25/20,43


In [14]:
# Create a DataFrame for Uruguay to include the week and uruguay columns
df_uruguay = df_alpaca[['week', 'uruguay']]

# Rename the columns to the Prophet model syntax 
df_uruguay = df_uruguay.rename(columns={'week':'ds', 'uruguay':'y'})

# Review the Uruguay DataFrame
display(df_uruguay.head())
display(df_uruguay.tail())

Unnamed: 0,ds,y
0,11/8/15,17
1,11/15/15,17
2,11/22/15,17
3,11/29/15,0
4,12/6/15,18


Unnamed: 0,ds,y
255,9/27/20,14
256,10/4/20,15
257,10/11/20,15
258,10/18/20,14
259,10/25/20,0
