## Statistical Aggregates
In statistics, an aggregate condenses a collection of data down to a single number. This number describes some feature of the data, such as:

* Centrality (mean, median, mode)
* Spread (standard deviation, variance, range)
* Extremities (min, max, first, last)
* Totals (sum, count, size)
 
***
## Pandas Aggregates
All the aggregates above can be computed via their corresponding pandas method (recall that methods are called with the syntax <code>variable_name.method_name(arguments)</code>. The methods work for both series and dataframes and are generally only valid for numeric data types.

| Method |	Description |
| ----------- | ----------- |
| <code>mean</code> |	The average value |
| <code>median</code> |	The middle value after being sorted |
| <code>var</code> |	The average squared distance from the mean |
| <code>std</code> |	The square root of variance |
| <code>min</code> |	Smallest value |
| <code>max</code> |	Largest value |
| <code>sum</code> |	Sum of all the values |
| <code>count</code>	| Count of all the values |
 

Some examples are provided below for the pollution dataset.

In [1]:
import pandas as pd
pollution = pd.read_csv('LSTM-Multivariate_pollution.csv', index_col = 'date', parse_dates = True)

In [2]:
# Average temperature
pollution['temp'].mean()

np.float64(12.45904109589048)

In [3]:
# Maximum pollution level
pollution['pollution'].max()

np.int64(994)

In [4]:
# Total amount of rainfall
pollution['rain'].sum()

np.int64(8542)

In [5]:
# Total snowfall in March 2010
pollution.loc['1/03/2010 0:00': '1/04/2010 0:00', 'snow'].sum()

np.int64(178)