# About the Data

## Description

This data set is from the [New Zealand Government](https://www.stats.govt.nz/), and it details all electronic card transactions through February 2022.

## Where to Find

The data, along with other NZ datasets, can be downloaded from https://www.stats.govt.nz/large-datasets/csv-files-for-download/

## Questions to Answer

Here are some questions from a first glance at this data:

1. How much has been spent on each purchase over the liftime of this data?
2. What has been the increase in purchase amounts for `ECTA.S19A1` over the lifetime of this data?

## Importing the Data

In [None]:
import pandas as pd
df = pd.read_csv('electronic-card-transactions-february-2022-csv-tables.csv')

## Describing the Data with Pandas

In [None]:
df.describe()

## Data Analysis

Let's first get an array of all the columns and the index array

In [None]:
cols = df.columns
indecies = df.index
print('Columns:', cols)
print('Indecies: ', indecies)

We will now plot some simple portion of the data, specifically the `ECTA.S19A1` series reference over the given time period

In [None]:
import matplotlib.pyplot as plt
ectas19a1 = df[df['Series_reference'] == 'ECTA.S19A1']
ectas19a1.plot(x='Period', y='Data_value', kind='scatter', figsize=(10,5))

In [None]:
piv = df.pivot_table(columns='Series_title_2', values='Data_value')
piv.plot(kind='bar', figsize=(20,10))
plt.title('Amounts per Type of Electronic Card Transaction')
plt.xlabel('Type of Electronic Card Transaction')
plt.ylabel('Amount')

In [None]:
df.groupby('Series_title_2')['Data_value'].sum().plot(kind='bar', figsize=(10,5))
plt.title('Amounts per Type of Electronic Card Transaction')
plt.xlabel('Type of Electronic Card Transaction')
plt.ylabel('Amount')

# Discussing the Results

In regards to each of the questions asked in the beginning:

1. Not all purchase categories have a non-trivial amount to be seen on the graph. However, notably debit and credit stand out as common transaction types, but also core industry transactions, which either have significantly more or just more logged data than transactions in consumables or fuel.
2. As expected, the amount in transactions for `ECTA.S19A1` has increased over the span of the lifetime of this data. Its shape, however, is interesting - the last few, more recent, years suggest it may no longer be linear, and may be capping or soon dropping.