# Pandas 1.1

When wrangling data with Python, we always begin by importing Python packages like pandas so we could access its tools (or methods) which are useful for data manipulation.

**pandas** is a fast, powerful, flexible and easy to use open source data analysis and manipulation tool, built on top of the Python programming language.

*Description from https://pandas.pydata.org/ 

<img src="https://upload.wikimedia.org/wikipedia/commons/thumb/e/ed/Pandas_logo.svg/1200px-Pandas_logo.svg.png" width=500/>


In [1]:
import pandas as pd

Use of pandas can generally be divided into four. The following covers the general uses of pandas in data wrangling.
- Loading the data
- Exploring the data
- Cleaning the data
- Preparing data for visualization


For this discussion, focus will be on **loading & exploring datasets**.

--------------------------------
## Loading the data

*Data from [DHS Program](https://dhsprogram.com/methodology/survey/survey-display-510.cfm)*

The most common source of simple data is csv or comma-separated values file. Pandas can read other file types as well such as:
- read_excel
- read_sql
- read_json

The read_csv accepts multiple arguments to read the file better by pressing shift+tab.

In [2]:
# Read the data
df = pd.read_csv('data/PHHR71FL-labeled.csv')

  df = pd.read_csv('data/PHHR71FL-labeled.csv')


Pandas tries to determine what dtype to set by analyzing the data in each column. A ```DtypeWarning``` is raised when the dataset read has different dtypes in a column from a file. Recall that there are different dtypes.

We have access to numpy dtypes: `float`, `int`, `bool`, `timedelta64[ns]` and `datetime64[ns]`. Note that the numpy date/time dtypes are not time zone aware. Pandas extends this set of dtypes with its own: `datetime64[ns, <tz>]`, `category`, `Int64`, `string`, `boolean`, etc.

Read the complete reference here: [Pandas dtype reference](https://pandas.pydata.org/pandas-docs/stable/reference/api/pandas.DataFrame.dtypes.html)

How do we address the ```DtypeWarning```?

In [3]:
pd.read_csv('data/PHHR71FL-labeled.csv', dtype = 'unicode')

Unnamed: 0,HHID,HV000,HV001,HV002,HV003,HV004,HV005,HV006,HV007,HV008,...,SH233X$1,SH233X$2,SH233X$3,SH233X$4,SH233X$5,SH233Z$1,SH233Z$2,SH233Z$3,SH233Z$4,SH233Z$5
0,00010001,PH7,1.0,1.0,1.0,1.0,364301.0,8.0,2017.0,1412.0,...,,,,,,,,,,
1,00010002,PH7,1.0,2.0,4.0,1.0,364301.0,8.0,2017.0,1412.0,...,,,,,,,,,,
2,00010004,PH7,1.0,4.0,2.0,1.0,364301.0,8.0,2017.0,1412.0,...,,,,,,,,,,
3,00010007,PH7,1.0,7.0,2.0,1.0,364301.0,8.0,2017.0,1412.0,...,,,,,,,,,,
4,00010008,PH7,1.0,8.0,1.0,1.0,364301.0,8.0,2017.0,1412.0,...,,,,,,,,,,
...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...
27491,12500022,PH7,1250.0,22.0,2.0,1250.0,1245167.0,8.0,2017.0,1412.0,...,,,,,,,,,,
27492,12500023,PH7,1250.0,23.0,2.0,1250.0,1245167.0,8.0,2017.0,1412.0,...,Yes,Yes,,,,No,No,,,
27493,12500024,PH7,1250.0,24.0,2.0,1250.0,1245167.0,8.0,2017.0,1412.0,...,,,,,,,,,,
27494,12500025,PH7,1250.0,25.0,2.0,1250.0,1245167.0,8.0,2017.0,1412.0,...,,,,,,,,,,


**Note.** Setting `dtype=unicode` will silence above warning. To numpy, a `unicode` is represented as object. Since we are setting dtype as `unicode` in the above sample, all columns were read as objects or strings.

---------------------------------------
## Exploring the data

We explore the data in three ways: 1) **by checking its characteristics and whether it has errors**; 2) **by inspecting its subsets**; and 3) **by looking at its descriptive statistics**.

### 1. Checking the data

#### 1.a. Standard checks
How big is our dataset?

In [4]:
# Check size of dataframe
df.shape

(27496, 2472)

How does it look like?

In [5]:
# Check first 5 rows
df.head()

Unnamed: 0,HHID,HV000,HV001,HV002,HV003,HV004,HV005,HV006,HV007,HV008,...,SH233X$1,SH233X$2,SH233X$3,SH233X$4,SH233X$5,SH233Z$1,SH233Z$2,SH233Z$3,SH233Z$4,SH233Z$5
0,10001,PH7,1.0,1.0,1.0,1.0,364301.0,8.0,2017.0,1412.0,...,,,,,,,,,,
1,10002,PH7,1.0,2.0,4.0,1.0,364301.0,8.0,2017.0,1412.0,...,,,,,,,,,,
2,10004,PH7,1.0,4.0,2.0,1.0,364301.0,8.0,2017.0,1412.0,...,,,,,,,,,,
3,10007,PH7,1.0,7.0,2.0,1.0,364301.0,8.0,2017.0,1412.0,...,,,,,,,,,,
4,10008,PH7,1.0,8.0,1.0,1.0,364301.0,8.0,2017.0,1412.0,...,,,,,,,,,,


Is it clean until the end?

In [6]:
# Check last 5 rows
df.tail()

Unnamed: 0,HHID,HV000,HV001,HV002,HV003,HV004,HV005,HV006,HV007,HV008,...,SH233X$1,SH233X$2,SH233X$3,SH233X$4,SH233X$5,SH233Z$1,SH233Z$2,SH233Z$3,SH233Z$4,SH233Z$5
27491,12500022,PH7,1250.0,22.0,2.0,1250.0,1245167.0,8.0,2017.0,1412.0,...,,,,,,,,,,
27492,12500023,PH7,1250.0,23.0,2.0,1250.0,1245167.0,8.0,2017.0,1412.0,...,Yes,Yes,,,,No,No,,,
27493,12500024,PH7,1250.0,24.0,2.0,1250.0,1245167.0,8.0,2017.0,1412.0,...,,,,,,,,,,
27494,12500025,PH7,1250.0,25.0,2.0,1250.0,1245167.0,8.0,2017.0,1412.0,...,,,,,,,,,,
27495,12500026,PH7,1250.0,26.0,2.0,1250.0,1245167.0,8.0,2017.0,1412.0,...,,,,,,,,,,


What other information are available?

In [7]:
# Check additional info
df.info()

<class 'pandas.core.frame.DataFrame'>
RangeIndex: 27496 entries, 0 to 27495
Columns: 2472 entries, HHID to SH233Z$5
dtypes: float64(1055), int64(1), object(1416)
memory usage: 518.6+ MB


Since the number of columns is too big, the information provided above is compressed. Let's look at the information available for the first 20 columns in the dataset...

For now, let's take the first part of the code (slicing) as is. As we go further our pandas discussion, we will also be going over slicing.

In [8]:
# Check additional info of the first 20 rows
df.iloc[:,0:20].info()

<class 'pandas.core.frame.DataFrame'>
RangeIndex: 27496 entries, 0 to 27495
Data columns (total 20 columns):
 #   Column  Non-Null Count  Dtype  
---  ------  --------------  -----  
 0   HHID    27496 non-null  int64  
 1   HV000   27496 non-null  object 
 2   HV001   27496 non-null  float64
 3   HV002   27496 non-null  float64
 4   HV003   27496 non-null  float64
 5   HV004   27496 non-null  float64
 6   HV005   27496 non-null  float64
 7   HV006   27496 non-null  float64
 8   HV007   27496 non-null  float64
 9   HV008   27496 non-null  float64
 10  HV008A  27496 non-null  float64
 11  HV009   27496 non-null  float64
 12  HV010   27496 non-null  float64
 13  HV011   0 non-null      float64
 14  HV012   27496 non-null  float64
 15  HV013   27496 non-null  float64
 16  HV014   27496 non-null  float64
 17  HV015   27496 non-null  object 
 18  HV016   27496 non-null  float64
 19  HV017   27496 non-null  float64
dtypes: float64(17), int64(1), object(2)
memory usage: 4.2+ MB


What are the data types?

In [9]:
# Check data types
df.dtypes

HHID          int64
HV000        object
HV001       float64
HV002       float64
HV003       float64
             ...   
SH233Z$1     object
SH233Z$2     object
SH233Z$3     object
SH233Z$4     object
SH233Z$5     object
Length: 2472, dtype: object

**Self-check:** Try loading the data with `dtype = 'unicode'` as a `read_csv` argument and see how the dtypes will all be set as object.

In [10]:
# Check column names
df.columns

Index(['HHID', 'HV000', 'HV001', 'HV002', 'HV003', 'HV004', 'HV005', 'HV006',
       'HV007', 'HV008',
       ...
       'SH233X$1', 'SH233X$2', 'SH233X$3', 'SH233X$4', 'SH233X$5', 'SH233Z$1',
       'SH233Z$2', 'SH233Z$3', 'SH233Z$4', 'SH233Z$5'],
      dtype='object', length=2472)

Given that the column headers use code, it's hard to make sense of the data. Let's load the `json` data dictionary to know what each column represents.

In [11]:
#Load json dictionary
import json

with open(f'data/PHHR71FL-data-dictionary.json', 'r') as file:
    col_dict = json.load(file)
    
col_dict

{'HHID': 'Case Identification',
 'HV000': 'Country code and phase',
 'HV001': 'Cluster number',
 'HV002': 'Household number',
 'HV003': "Respondent's line number (answering Household questionnaire)",
 'HV004': 'Ultimate area unit',
 'HV005': 'Household sample weight (6 decimals)',
 'HV006': 'Month of interview',
 'HV007': 'Year of interview',
 'HV008': 'Date of interview (CMC)',
 'HV008A': 'Date of interview Century Day Code (CDC)',
 'HV009': 'Number of household members',
 'HV010': 'Number of eligible women in household',
 'HV011': 'NA - Number of eligible men in household',
 'HV012': 'Number of de jure members',
 'HV013': 'Number of de facto members',
 'HV014': 'Number of children 5 and under (de jure)',
 'HV015': 'Result of household interview',
 'HV016': 'Day of interview',
 'HV017': 'Number of visits',
 'HV018': 'Interviewer identification',
 'HV019': 'NA - Keyer identification',
 'HV020': 'Ever-married sample',
 'HV021': 'Primary sampling unit',
 'HV022': 'Sample strata for sampl

#### 1.b. Checking for errors

Are there missing values?

In [12]:
#Check how many missing values there are in each column
df.isnull().sum()

HHID            0
HV000           0
HV001           0
HV002           0
HV003           0
            ...  
SH233Z$1    22685
SH233Z$2    27076
SH233Z$3    27442
SH233Z$4    27490
SH233Z$5    27494
Length: 2472, dtype: int64

Are there duplicates?

In [13]:
#How many unique values are there in a column - Case ID
df["HHID"].nunique()

27496

In [14]:
# How many duplicated rows are there in a column - Case ID
df["HHID"].duplicated(keep=False).sum()

0

In [15]:
# Another example - Region
df["HV024"].nunique()

17

In [21]:
df["HV024"].duplicated(keep=False).sum()

27496

---------------------------------
### 2. Inspecting subsets of a dataset

#### 2.a. Slicing the dataset by column name
We can choose a column in a dataframe using the syntax `df["column_name"]` or `df.column_name`

In [22]:
# Returns series - Household Number (HHID)
df['HHID']

0           10001
1           10002
2           10004
3           10007
4           10008
           ...   
27491    12500022
27492    12500023
27493    12500024
27494    12500025
27495    12500026
Name: HHID, Length: 27496, dtype: int64

In [23]:
# Another way of selecting column
df.HHID

0           10001
1           10002
2           10004
3           10007
4           10008
           ...   
27491    12500022
27492    12500023
27493    12500024
27494    12500025
27495    12500026
Name: HHID, Length: 27496, dtype: int64

In [24]:
# Returns dataframe
df[['HHID']]

Unnamed: 0,HHID
0,10001
1,10002
2,10004
3,10007
4,10008
...,...
27491,12500022
27492,12500023
27493,12500024
27494,12500025


To select more than one column, pass a list of column names.

In [25]:
# Try selecting any interesting columns to look at from the data dictionary
# HHID, HV009 - Number of household members, HV010 - Number of eligible women in household
# HV011 - NA - Number of eligible men in household
# HV014 - Number of children 5 and under

df[['HHID', 'HV009', 'HV010', 'HV011', 'HV014']]

Unnamed: 0,HHID,HV009,HV010,HV011,HV014
0,10001,8.0,2.0,,1.0
1,10002,5.0,2.0,,1.0
2,10004,4.0,1.0,,0.0
3,10007,5.0,1.0,,2.0
4,10008,11.0,0.0,,0.0
...,...,...,...,...,...
27491,12500022,5.0,1.0,,3.0
27492,12500023,5.0,1.0,,0.0
27493,12500024,4.0,1.0,,2.0
27494,12500025,7.0,0.0,,2.0


#### 2.b. Slicing the dataset by position

To obtain a subset of the dataframe using the position of rows or column, use `.iloc`

In [26]:
df.iloc[:10]

Unnamed: 0,HHID,HV000,HV001,HV002,HV003,HV004,HV005,HV006,HV007,HV008,...,SH233X$1,SH233X$2,SH233X$3,SH233X$4,SH233X$5,SH233Z$1,SH233Z$2,SH233Z$3,SH233Z$4,SH233Z$5
0,10001,PH7,1.0,1.0,1.0,1.0,364301.0,8.0,2017.0,1412.0,...,,,,,,,,,,
1,10002,PH7,1.0,2.0,4.0,1.0,364301.0,8.0,2017.0,1412.0,...,,,,,,,,,,
2,10004,PH7,1.0,4.0,2.0,1.0,364301.0,8.0,2017.0,1412.0,...,,,,,,,,,,
3,10007,PH7,1.0,7.0,2.0,1.0,364301.0,8.0,2017.0,1412.0,...,,,,,,,,,,
4,10008,PH7,1.0,8.0,1.0,1.0,364301.0,8.0,2017.0,1412.0,...,,,,,,,,,,
5,10009,PH7,1.0,9.0,1.0,1.0,364301.0,8.0,2017.0,1412.0,...,,,,,,,,,,
6,10010,PH7,1.0,10.0,2.0,1.0,364301.0,8.0,2017.0,1412.0,...,,,,,,,,,,
7,10011,PH7,1.0,11.0,2.0,1.0,364301.0,8.0,2017.0,1412.0,...,,,,,,,,,,
8,10012,PH7,1.0,12.0,2.0,1.0,364301.0,8.0,2017.0,1412.0,...,,,,,,,,,,
9,10013,PH7,1.0,13.0,2.0,1.0,364301.0,8.0,2017.0,1412.0,...,,,,,,,,,,


In [27]:
df.iloc[:, :3]

Unnamed: 0,HHID,HV000,HV001
0,10001,PH7,1.0
1,10002,PH7,1.0
2,10004,PH7,1.0
3,10007,PH7,1.0
4,10008,PH7,1.0
...,...,...,...
27491,12500022,PH7,1250.0
27492,12500023,PH7,1250.0
27493,12500024,PH7,1250.0
27494,12500025,PH7,1250.0


In [28]:
# Get index 1 to 3 of the dataframe (start inclusive, end exclusive)
df.iloc[1:4]

Unnamed: 0,HHID,HV000,HV001,HV002,HV003,HV004,HV005,HV006,HV007,HV008,...,SH233X$1,SH233X$2,SH233X$3,SH233X$4,SH233X$5,SH233Z$1,SH233Z$2,SH233Z$3,SH233Z$4,SH233Z$5
1,10002,PH7,1.0,2.0,4.0,1.0,364301.0,8.0,2017.0,1412.0,...,,,,,,,,,,
2,10004,PH7,1.0,4.0,2.0,1.0,364301.0,8.0,2017.0,1412.0,...,,,,,,,,,,
3,10007,PH7,1.0,7.0,2.0,1.0,364301.0,8.0,2017.0,1412.0,...,,,,,,,,,,


#### <font color='pink'>Practice:</font>  Get the fifth up to the fifteenth row of the columns specified in the dtype warning earlier

In [29]:
# Insert code
df.iloc[5:16, [71,87,103,115,153,161]]

Unnamed: 0,HV220,HV235,HV238,HV245,SH109C,SH123B
5,77.0,,,,,
6,39.0,,2.0,,,2011.0
7,30.0,,,,,
8,41.0,,,40.0,,
9,20.0,,,,,
10,40.0,,,,,
11,46.0,,,,,
12,44.0,,,,,2013.0
13,59.0,,,30.0,,2011.0
14,52.0,,2.0,,,


#### 2.c. Slicing by index name

If you don't know the position but know the index you are looking for, you can use `.loc`.

In [30]:
#Sample selection of row using .loc
df.loc[10]

HHID        10014
HV000         PH7
HV001         1.0
HV002        14.0
HV003         1.0
            ...  
SH233Z$1      NaN
SH233Z$2      NaN
SH233Z$3      NaN
SH233Z$4      NaN
SH233Z$5      NaN
Name: 10, Length: 2472, dtype: object

In [31]:
#Sample selection of rows & columns using .loc
df.loc[:10,"HV005"]

0     364301.0
1     364301.0
2     364301.0
3     364301.0
4     364301.0
5     364301.0
6     364301.0
7     364301.0
8     364301.0
9     364301.0
10    364301.0
Name: HV005, dtype: float64

#### 2.d. Filtering data based on column values

Filtering allows you to select or inspect parts of the dataframe based on cell values.

In [32]:
# Look at unique values of region column (HV024)
df["HV024"].unique()

array(['Autonomous Region in Muslim Mindanao', 'Zamboanga Peninsula',
       'Soccsksargen', 'Cordillera', 'Caraga', 'National Capital',
       'Western Visayas', 'Central Visayas', 'Ilocos', 'Cagayan Valley',
       'Central Luzon', 'Calabarzon', 'Mimaropa', 'Bicol',
       'Eastern Visayas', 'Northern Mindanao', 'Davao'], dtype=object)

In [33]:
df["HV024"] == "National Capital"

0        False
1        False
2        False
3        False
4        False
         ...  
27491    False
27492    False
27493    False
27494    False
27495    False
Name: HV024, Length: 27496, dtype: bool

In [34]:
# Returns all rows that is TRUE
df[df["HV024"] == "National Capital"]

Unnamed: 0,HHID,HV000,HV001,HV002,HV003,HV004,HV005,HV006,HV007,HV008,...,SH233X$1,SH233X$2,SH233X$3,SH233X$4,SH233X$5,SH233Z$1,SH233Z$2,SH233Z$3,SH233Z$4,SH233Z$5
5138,2200002,PH7,220.0,2.0,2.0,220.0,3066799.0,9.0,2017.0,1413.0,...,No,,,,,No,,,,
5139,2200005,PH7,220.0,5.0,2.0,220.0,3066799.0,9.0,2017.0,1413.0,...,,,,,,,,,,
5140,2200006,PH7,220.0,6.0,3.0,220.0,3066799.0,9.0,2017.0,1413.0,...,,,,,,,,,,
5141,2200007,PH7,220.0,7.0,5.0,220.0,3066799.0,9.0,2017.0,1413.0,...,,,,,,,,,,
5142,2200008,PH7,220.0,8.0,2.0,220.0,3066799.0,9.0,2017.0,1413.0,...,,,,,,,,,,
...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...
7516,3450020,PH7,345.0,20.0,1.0,345.0,2261838.0,9.0,2017.0,1413.0,...,No,,,,,No,,,,
7517,3450021,PH7,345.0,21.0,1.0,345.0,2261838.0,9.0,2017.0,1413.0,...,,,,,,,,,,
7518,3450022,PH7,345.0,22.0,1.0,345.0,2261838.0,9.0,2017.0,1413.0,...,,,,,,,,,,
7519,3450023,PH7,345.0,23.0,1.0,345.0,2261838.0,9.0,2017.0,1413.0,...,,,,,,,,,,


You can use `&` and `|` to add more conditions

In [35]:
# HV024 - Region, HV009 - Number of household members
df[(df["HV024"] == "National Capital") & (df["HV009"] >= 5)]

Unnamed: 0,HHID,HV000,HV001,HV002,HV003,HV004,HV005,HV006,HV007,HV008,...,SH233X$1,SH233X$2,SH233X$3,SH233X$4,SH233X$5,SH233Z$1,SH233Z$2,SH233Z$3,SH233Z$4,SH233Z$5
5138,2200002,PH7,220.0,2.0,2.0,220.0,3066799.0,9.0,2017.0,1413.0,...,No,,,,,No,,,,
5141,2200007,PH7,220.0,7.0,5.0,220.0,3066799.0,9.0,2017.0,1413.0,...,,,,,,,,,,
5144,2200010,PH7,220.0,10.0,2.0,220.0,3066799.0,9.0,2017.0,1413.0,...,,,,,,,,,,
5145,2200011,PH7,220.0,11.0,2.0,220.0,3066799.0,9.0,2017.0,1413.0,...,,,,,,,,,,
5149,2200016,PH7,220.0,16.0,2.0,220.0,3066799.0,9.0,2017.0,1413.0,...,,,,,,,,,,
...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...
7506,3450010,PH7,345.0,10.0,2.0,345.0,2261838.0,9.0,2017.0,1413.0,...,,,,,,,,,,
7507,3450011,PH7,345.0,11.0,1.0,345.0,2261838.0,9.0,2017.0,1413.0,...,No,,,,,No,,,,
7512,3450016,PH7,345.0,16.0,2.0,345.0,2261838.0,9.0,2017.0,1413.0,...,,,,,,,,,,
7513,3450017,PH7,345.0,17.0,1.0,345.0,2261838.0,9.0,2017.0,1413.0,...,,,,,,,,,,


#### <font color='pink'>Practice:</font>  Get all households outside Central Visayas with more than 3 eligible women to interview in the household

In [36]:
# Insert code
df[(df["HV024"] != "Central Visayas") & (df["HV010"] > 3)]

Unnamed: 0,HHID,HV000,HV001,HV002,HV003,HV004,HV005,HV006,HV007,HV008,...,SH233X$1,SH233X$2,SH233X$3,SH233X$4,SH233X$5,SH233Z$1,SH233Z$2,SH233Z$3,SH233Z$4,SH233Z$5
11,10015,PH7,1.0,15.0,2.0,1.0,364301.0,8.0,2017.0,1412.0,...,No,,,,,No,,,,
94,50006,PH7,5.0,6.0,2.0,5.0,192659.0,10.0,2017.0,1414.0,...,,,,,,,,,,
128,60018,PH7,6.0,18.0,2.0,6.0,337154.0,9.0,2017.0,1413.0,...,,,,,,,,,,
137,70001,PH7,7.0,1.0,1.0,7.0,96330.0,9.0,2017.0,1413.0,...,,,,,,,,,,
260,110025,PH7,11.0,25.0,11.0,11.0,108590.0,10.0,2017.0,1414.0,...,,,,,,,,,,
...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...
27180,12370011,PH7,1237.0,11.0,8.0,1237.0,1116325.0,9.0,2017.0,1413.0,...,,,,,,,,,,
27262,12410010,PH7,1241.0,10.0,2.0,1241.0,1115116.0,9.0,2017.0,1413.0,...,No,,,,,No,,,,
27331,12440004,PH7,1244.0,4.0,2.0,1244.0,1945227.0,9.0,2017.0,1413.0,...,No,,,,,No,,,,
27435,12480013,PH7,1248.0,13.0,13.0,1248.0,276704.0,9.0,2017.0,1413.0,...,,,,,,,,,,


You can also use string methods such as `contains` to find items

In [37]:
# Applies boolean to all rows
df[df["HV024"].str.contains("Visayas")]

Unnamed: 0,HHID,HV000,HV001,HV002,HV003,HV004,HV005,HV006,HV007,HV008,...,SH233X$1,SH233X$2,SH233X$3,SH233X$4,SH233X$5,SH233Z$1,SH233Z$2,SH233Z$3,SH233Z$4,SH233Z$5
7521,3460001,PH7,346.0,1.0,3.0,346.0,1553942.0,9.0,2017.0,1413.0,...,No,,,,,No,,,,
7522,3460002,PH7,346.0,2.0,1.0,346.0,1553942.0,9.0,2017.0,1413.0,...,,,,,,,,,,
7523,3460003,PH7,346.0,3.0,2.0,346.0,1553942.0,9.0,2017.0,1413.0,...,,,,,,,,,,
7524,3460004,PH7,346.0,4.0,2.0,346.0,1553942.0,9.0,2017.0,1413.0,...,No,,,,,No,,,,
7525,3460005,PH7,346.0,5.0,1.0,346.0,1553942.0,9.0,2017.0,1413.0,...,,,,,,,,,,
...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...
22017,10060021,PH7,1006.0,21.0,2.0,1006.0,560029.0,9.0,2017.0,1413.0,...,,,,,,,,,,
22018,10060022,PH7,1006.0,22.0,2.0,1006.0,560029.0,9.0,2017.0,1413.0,...,,,,,,,,,,
22019,10060023,PH7,1006.0,23.0,4.0,1006.0,560029.0,9.0,2017.0,1413.0,...,,,,,,,,,,
22020,10060024,PH7,1006.0,24.0,2.0,1006.0,560029.0,9.0,2017.0,1413.0,...,,,,,,,,,,


#### <font color='pink'>Practice:</font>  Get all rows with province code containing 'zamboanga'

In [38]:
# Insert code
df[df["SHPROV"].str.contains("zamboanga")]

Unnamed: 0,HHID,HV000,HV001,HV002,HV003,HV004,HV005,HV006,HV007,HV008,...,SH233X$1,SH233X$2,SH233X$3,SH233X$4,SH233X$5,SH233Z$1,SH233Z$2,SH233Z$3,SH233Z$4,SH233Z$5
22022,10070002,PH7,1007.0,2.0,1.0,1007.0,1044976.0,8.0,2017.0,1412.0,...,,,,,,,,,,
22023,10070003,PH7,1007.0,3.0,2.0,1007.0,1044976.0,9.0,2017.0,1413.0,...,,,,,,,,,,
22024,10070004,PH7,1007.0,4.0,1.0,1007.0,1044976.0,9.0,2017.0,1413.0,...,,,,,,,,,,
22025,10070005,PH7,1007.0,5.0,1.0,1007.0,1044976.0,9.0,2017.0,1413.0,...,No,,,,,No,,,,
22026,10070006,PH7,1007.0,6.0,1.0,1007.0,1044976.0,9.0,2017.0,1413.0,...,,,,,,,,,,
...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...
22921,10500019,PH7,1050.0,19.0,1.0,1050.0,455445.0,9.0,2017.0,1413.0,...,,,,,,,,,,
22922,10500020,PH7,1050.0,20.0,2.0,1050.0,455445.0,9.0,2017.0,1413.0,...,No,,,,,No,,,,
22923,10500027,PH7,1050.0,27.0,2.0,1050.0,455445.0,9.0,2017.0,1413.0,...,,,,,,,,,,
22924,10500028,PH7,1050.0,28.0,2.0,1050.0,455445.0,9.0,2017.0,1413.0,...,,,,,,,,,,


------------------------------------
### 3. Inspecting summary statistics

When exploring a dataset, it's important to look at statistical measures that can help us estimate characteristics of a group, to observe patterns, and to summarize the data concisely.

Statistical measures that summarizes the dataset are called **summary statistics**. Some summary statistics are as follows.
<br>
<br>
**Measures of Central Tendency**
- **Mean** - Sum of all values in a set divided by total number of values
- **Median** - Most common data point in a data set. There could be more than one mode in a dataset
- **Mode** - Middle value when the data is in order. Interpolated when the number of values is even

**Measures of Variability**
- **Variance and Standard Deviation** - Describes how close the values are to the mean. The smaller the variance & standard deviation, the more indicative is the mean of the whole data set 
- **Range** - Difference between the largest and smallest values in the data set
- **Interquartile Range (IQR)** - Describes the range of the middle 50% values

<br>

Most of these summary statistics are readily available with pandas `.describe()` method.

In [41]:
#Insert code here
df.describe()

Unnamed: 0,HHID,HV001,HV002,HV003,HV004,HV005,HV006,HV007,HV008,HV008A,HV009,HV010,HV011,HV012,HV013,HV014,HV016,HV017,HV018,HV019,HV021,HV026,HV028,HV030,HV031,HV032,HV035,HV040,HV041,HV801,HV802,HV804,HV807D,HV807M,HV807Y,HV807C,HV807A,HV216,HV218,HV227,HV228,HV232C,HV232D,HV232E,HV234,HV234A,HV236,HV237G,HV237H,HV237I,HV237J,HV237K,HV239,HV240,HV246H,HV246I,HV246J,HV246K,HV247,HV253,HV253A,HV253B,HV253C,HV253D,HV253E,HV253F,HV253G,HV253H,HV253X,HV253Z,HV271,HV271A,HML1,HML1A,HML2,SHBSN,SHUSN,SHSN,SH202,SH207,SH221A,SHNUMDV,HVIDX$01,HVIDX$02,HVIDX$03,HVIDX$04,HVIDX$05,HVIDX$06,HVIDX$07,HVIDX$08,HVIDX$09,HVIDX$10,HVIDX$11,HVIDX$12,HVIDX$13,HVIDX$14,HVIDX$15,HVIDX$16,HVIDX$17,HVIDX$18,HVIDX$19,HVIDX$20,HVIDX$21,HV105$09,HV105$10,HV105$12,HV105$13,HV105$14,HV105$15,HV105$16,HV105$17,HV105$18,HV105$19,HV105$20,HV105$21,HV107$03,HV107$08,HV107$11,HV107$12,HV107$13,HV107$14,HV107$15,HV107$16,HV107$17,HV107$18,HV107$19,HV107$20,HV107$21,HV108$13,HV108$14,HV108$15,HV108$16,HV108$17,HV108$18,HV108$19,HV108$20,HV108$21,HV110$01,HV110$02,HV110$03,HV110$04,HV110$05,HV110$06,HV110$07,HV110$08,HV110$09,HV110$10,HV110$11,HV110$12,HV110$13,HV110$14,HV110$15,HV110$16,HV110$17,HV110$18,HV110$19,HV110$20,HV110$21,HV111$01,HV111$02,HV111$03,HV111$04,HV111$05,HV111$06,HV111$07,HV111$08,HV111$09,HV111$10,HV111$11,HV111$12,HV111$13,HV111$14,HV111$15,HV111$16,HV111$17,HV111$18,HV111$19,HV111$20,HV111$21,HV112$01,HV112$02,HV112$03,HV112$04,HV112$05,HV112$06,HV112$07,HV112$08,HV112$09,HV112$10,HV112$11,HV112$12,HV112$13,HV112$14,HV112$15,HV112$16,HV112$17,HV112$18,HV112$19,HV112$20,HV112$21,HV113$01,HV113$02,HV113$03,HV113$04,HV113$05,HV113$06,HV113$07,HV113$08,HV113$09,HV113$10,HV113$11,HV113$12,HV113$13,HV113$14,HV113$15,HV113$16,HV113$17,HV113$18,HV113$19,HV113$20,HV113$21,HV114$01,HV114$02,HV114$03,HV114$04,HV114$05,HV114$06,HV114$07,HV114$08,HV114$09,HV114$10,HV114$11,HV114$12,HV114$13,HV114$14,HV114$15,HV114$16,HV114$17,HV114$18,HV114$19,HV114$20,HV114$21,HV115$20,HV115$21,HV116$20,HV116$21,HV118$01,HV118$02,HV118$03,HV118$04,HV118$05,HV118$06,HV118$07,HV118$08,HV118$09,HV118$10,HV118$11,HV118$12,HV118$13,HV118$14,HV118$15,HV118$16,HV118$17,HV118$18,HV118$19,HV118$20,HV118$21,HV123$01,HV123$02,HV123$03,HV123$05,HV123$07,HV123$08,HV123$09,HV123$10,HV123$11,HV123$12,HV123$13,HV123$14,HV123$15,HV123$16,HV123$17,HV123$18,HV123$19,HV123$20,HV123$21,HV124$01,HV124$05,HV124$08,HV124$09,HV124$10,HV124$11,HV124$12,HV124$13,HV124$14,HV124$15,HV124$16,HV124$17,HV124$18,HV124$19,HV124$20,HV124$21,HV125$01,HV125$02,HV125$03,HV125$04,HV125$05,HV125$06,HV125$07,HV125$08,HV125$09,HV125$10,HV125$11,HV125$12,HV125$13,HV125$14,HV125$15,HV125$16,HV125$17,HV125$18,HV125$19,HV125$20,HV125$21,HV126$01,HV126$02,HV126$03,HV126$04,HV126$05,HV126$06,HV126$07,HV126$08,HV126$09,HV126$10,HV126$11,HV126$12,HV126$13,HV126$14,HV126$15,HV126$16,HV126$17,HV126$18,HV126$19,HV126$20,HV126$21,HV127$01,HV127$02,HV127$03,HV127$04,HV127$05,HV127$06,HV127$07,HV127$08,HV127$09,HV127$10,HV127$11,HV127$12,HV127$13,HV127$14,HV127$15,HV127$16,HV127$17,HV127$18,HV127$19,HV127$20,HV127$21,HV128$01,HV128$02,HV128$03,HV128$04,HV128$05,HV128$06,HV128$07,HV128$08,HV128$09,HV128$10,HV128$11,HV128$12,HV128$13,HV128$14,HV128$15,HV128$16,HV128$17,HV128$18,HV128$19,HV128$20,HV128$21,HV129$01,HV129$02,HV129$03,HV129$04,HV129$05,HV129$06,HV129$07,HV129$08,HV129$09,HV129$10,HV129$11,HV129$12,HV129$13,HV129$14,HV129$15,HV129$16,HV129$17,HV129$18,HV129$19,HV129$20,HV129$21,HV140$01,IDXH4$01,IDXH4$02,IDXH4$03,IDXH4$04,IDXH4$05,IDXH4$06,IDXH4$07,IDXH4$08,IDXH4$09,IDXH4$10,IDXH4$11,IDXH4$12,IDXH4$13,IDXH4$14,IDXH4$15,IDXH4$16,IDXH4$17,IDXH4$18,IDXH4$19,IDXH4$20,IDXH4$21,SHCOL2A$01,SHCOL2A$02,SHCOL2A$03,SHCOL2A$04,SHCOL2A$05,SHCOL2A$06,SHCOL2A$07,SHCOL2A$08,SHCOL2A$09,SHCOL2A$10,SHCOL2A$11,SHCOL2A$12,SHCOL2A$13,SHCOL2A$14,SHCOL2A$15,SHCOL2A$16,SHCOL2A$17,SHCOL2A$18,SHCOL2A$19,SHCOL2A$20,SHCOL2A$21,SH17L$01,SH17L$02,SH17L$03,SH17L$04,SH17L$05,SH17L$06,SH17L$07,SH17L$08,SH17L$09,SH17L$10,SH17L$11,SH17L$12,SH17L$13,SH17L$14,SH17L$15,SH17L$16,SH17L$17,SH17L$18,SH17L$19,SH17L$20,SH17L$21,SH19L$20,SH19L$21,SH203A$01,SH203A$02,SH203A$03,SH203A$04,SH203A$05,SH203A$06,SH203A$07,SH203A$08,SH203A$09,SH203A$10,SH203A$11,SH203A$12,SH203A$13,SH203A$14,SH203A$15,SH203A$16,SH203A$17,SH203A$18,SH203A$19,SH203A$20,SH203A$21,SH204AA$12,SH204AA$13,SH204AA$14,SH204AA$15,SH204AA$16,SH204AA$17,SH204AA$18,SH204AA$19,SH204AA$20,SH204AA$21,SH204AB$12,SH204AB$13,SH204AB$14,SH204AB$15,SH204AB$16,SH204AB$17,SH204AB$18,SH204AB$19,SH204AB$20,SH204AB$21,SH204AC$12,SH204AC$13,SH204AC$14,SH204AC$15,SH204AC$16,SH204AC$17,SH204AC$18,SH204AC$19,SH204AC$20,SH204AC$21,SH204AD$12,SH204AD$13,SH204AD$14,SH204AD$15,SH204AD$16,SH204AD$17,SH204AD$18,SH204AD$19,SH204AD$20,SH204AD$21,SH204AE$12,SH204AE$13,SH204AE$14,SH204AE$15,SH204AE$16,SH204AE$17,SH204AE$18,SH204AE$19,SH204AE$20,SH204AE$21,SH204AF$12,SH204AF$13,SH204AF$14,SH204AF$15,SH204AF$16,SH204AF$17,SH204AF$18,SH204AF$19,SH204AF$20,SH204AF$21,SH204AG$12,SH204AG$13,SH204AG$14,SH204AG$15,SH204AG$16,SH204AG$17,SH204AG$18,SH204AG$19,SH204AG$20,SH204AG$21,SH204AH$12,SH204AH$13,SH204AH$14,SH204AH$15,SH204AH$16,SH204AH$17,SH204AH$18,SH204AH$19,SH204AH$20,SH204AH$21,SH204AI$12,SH204AI$13,SH204AI$14,SH204AI$15,SH204AI$16,SH204AI$17,SH204AI$18,SH204AI$19,SH204AI$20,SH204AI$21,SH204AJ$12,SH204AJ$13,SH204AJ$14,SH204AJ$15,SH204AJ$16,SH204AJ$17,SH204AJ$18,SH204AJ$19,SH204AJ$20,SH204AJ$21,SH204AK$12,SH204AK$13,SH204AK$14,SH204AK$15,SH204AK$16,SH204AK$17,SH204AK$18,SH204AK$19,SH204AK$20,SH204AK$21,SH204AL$12,SH204AL$13,SH204AL$14,SH204AL$15,SH204AL$16,SH204AL$17,SH204AL$18,SH204AL$19,SH204AL$20,SH204AL$21,SH204AM$12,SH204AM$13,SH204AM$14,SH204AM$15,SH204AM$16,SH204AM$17,SH204AM$18,SH204AM$19,SH204AM$20,SH204AM$21,SH204AN$12,SH204AN$13,SH204AN$14,SH204AN$15,SH204AN$16,SH204AN$17,SH204AN$18,SH204AN$19,SH204AN$20,SH204AN$21,SH204AO$12,SH204AO$13,SH204AO$14,SH204AO$15,SH204AO$16,SH204AO$17,SH204AO$18,SH204AO$19,SH204AO$20,SH204AO$21,SH204AP$12,SH204AP$13,SH204AP$14,SH204AP$15,SH204AP$16,SH204AP$17,SH204AP$18,SH204AP$19,SH204AP$20,SH204AP$21,SH204AQ$12,SH204AQ$13,SH204AQ$14,SH204AQ$15,SH204AQ$16,SH204AQ$17,SH204AQ$18,SH204AQ$19,SH204AQ$20,SH204AQ$21,SH204AR$12,SH204AR$13,SH204AR$14,SH204AR$15,SH204AR$16,SH204AR$17,SH204AR$18,SH204AR$19,SH204AR$20,SH204AR$21,SH204AS$12,SH204AS$13,SH204AS$14,SH204AS$15,SH204AS$16,SH204AS$17,SH204AS$18,SH204AS$19,SH204AS$20,SH204AS$21,SH204AT$12,SH204AT$13,SH204AT$14,SH204AT$15,SH204AT$16,SH204AT$17,SH204AT$18,SH204AT$19,SH204AT$20,SH204AT$21,SH204AU$12,SH204AU$13,SH204AU$14,SH204AU$15,SH204AU$16,SH204AU$17,SH204AU$18,SH204AU$19,SH204AU$20,SH204AU$21,SH204AV$12,SH204AV$13,SH204AV$14,SH204AV$15,SH204AV$16,SH204AV$17,SH204AV$18,SH204AV$19,SH204AV$20,SH204AV$21,SH204AW$12,SH204AW$13,SH204AW$14,SH204AW$15,SH204AW$16,SH204AW$17,SH204AW$18,SH204AW$19,SH204AW$20,SH204AW$21,SH204AX$12,SH204AX$13,SH204AX$14,SH204AX$15,SH204AX$16,SH204AX$17,SH204AX$18,SH204AX$19,SH204AX$20,SH204AX$21,SH204C$04,SH204C$05,SH204C$06,SH204C$07,SH204C$08,SH204C$09,SH204C$10,SH204C$11,SH204C$12,SH204C$13,SH204C$14,SH204C$15,SH204C$16,SH204C$17,SH204C$18,SH204C$19,SH204C$20,SH204C$21,SH204E$04,SH204E$05,SH204E$07,SH204E$08,SH204E$09,SH204E$10,SH204E$11,SH204E$12,SH204E$13,SH204E$14,SH204E$15,SH204E$16,SH204E$17,SH204E$18,SH204E$19,SH204E$20,SH204E$21,SH204G$07,SH204G$08,SH204G$09,SH204G$10,SH204G$11,SH204G$12,SH204G$13,SH204G$14,SH204G$15,SH204G$16,SH204G$17,SH204G$18,SH204G$19,SH204G$20,SH204G$21,SH204I$06,SH204I$08,SH204I$09,SH204I$10,SH204I$11,SH204I$12,SH204I$13,SH204I$14,SH204I$15,SH204I$16,SH204I$17,SH204I$18,SH204I$19,SH204I$20,SH204I$21,SH204K$08,SH204K$09,SH204K$10,SH204K$11,SH204K$12,SH204K$13,SH204K$14,SH204K$15,SH204K$16,SH204K$17,SH204K$18,SH204K$19,SH204K$20,SH204K$21,SH204M$07,SH204M$09,SH204M$10,SH204M$11,SH204M$12,SH204M$13,SH204M$14,SH204M$15,SH204M$16,SH204M$17,SH204M$18,SH204M$19,SH204M$20,SH204M$21,HA0$1,HA0$2,HA1$1,HA1$2,HA2$1,HA2$2,HA3$1,HA3$2,HA4$1,HA4$2,HA5$1,HA5$2,HA6$1,HA6$2,HA11$1,HA11$2,HA12$1,HA12$2,HA12A$1,HA12A$2,HA12B$1,HA12B$2,HA13$1,HA13$2,HA21$1,HA21$2,HA32$1,HA32$2,HA33$1,HA33$2,HA35$1,HA35$2,HA40$1,HA40$2,HA41$1,HA41$2,HA50$1,HA50$2,HA51$1,HA51$2,HA52$1,HA52$2,HA53$1,HA53$2,HA54$1,HA54$2,HA55$1,HA55$2,HA56$1,HA56$2,HA57$1,HA57$2,HA58$1,HA58$2,HA60$1,HA60$2,HA61$1,HA61$2,HA62$1,HA62$2,HA63$1,HA63$2,HA64$1,HA64$2,HA65$1,HA65$2,HA66$1,HA66$2,HA67$1,HA67$2,HA68$1,HA68$2,HA69$1,HA69$2,HA70$1,HA70$2,HC0$1,HC0$2,HC1$1,HC1$2,HC1A$1,HC1A$2,HC2$1,HC2$2,HC3$1,HC3$2,HC4$1,HC4$2,HC5$1,HC5$2,HC6$1,HC6$2,HC7$1,HC7$2,HC8$1,HC8$2,HC9$1,HC9$2,HC10$1,HC10$2,HC11$1,HC11$2,HC12$1,HC12$2,HC13$1,HC13$2,HC15$1,HC15$2,HC16$1,HC16$2,HC17$1,HC17$2,HC18$1,HC18$2,HC19$1,HC19$2,HC20$1,HC20$2,HC21$1,HC21$2,HC27$1,HC27$2,HC30$1,HC30$2,HC31$1,HC31$2,HC32$1,HC32$2,HC32A$1,HC32A$2,HC33$1,HC33$2,HC51$1,HC51$2,HC52$1,HC52$2,HC53$1,HC53$2,HC55$1,HC55$2,HC56$1,HC56$2,HC57$1,HC57$2,HC58$1,HC58$2,HC60$1,HC60$2,HC61$1,HC61$2,HC62$1,HC62$2,HC63$1,HC63$2,HC64$1,HC64$2,HC68$1,HC68$2,HC70$1,HC70$2,HC71$1,HC71$2,HC72$1,HC72$2,HC73$1,HC73$2,IDXH7$1,IDXH7$2,IDXH7$3,IDXH7$4,IDXH7$5,IDXH7$6,IDXH7$7,SH208A$1,SH208A$2,SH208A$3,SH208A$4,SH208A$5,SH208A$6,SH208A$7,SH215M$4,SH215M$5,SH215M$6,SH215M$7,IDXH8$1,IDXH8$2,IDXH8$3,IDXH8$4,IDXH8$5,SHCOL2C$1,SHCOL2C$2,SHCOL2C$3,SHCOL2C$4,SHCOL2C$5,SH222A$1,SH222A$2,SH222A$3,SH222A$4,SH222A$5,SH225$5,SH228B$2,SH228B$3,SH228B$4,SH228B$5,SH228D$1,SH228D$2,SH228D$3,SH228D$4,SH228D$5,SH230A$1,SH230A$2,SH230A$3,SH230A$4,SH230A$5,SH230B$1,SH230B$2,SH230B$3,SH230B$4,SH230B$5,SH232A$1,SH232A$2,SH232A$3,SH232A$4,SH232A$5,SH232B$1,SH232B$2,SH232B$3,SH232B$4,SH232B$5
count,27496.0,27496.0,27496.0,27496.0,27496.0,27496.0,27496.0,27496.0,27496.0,27496.0,27496.0,27496.0,0.0,27496.0,27496.0,27496.0,27496.0,27496.0,27496.0,0.0,27496.0,0.0,27496.0,27496.0,0.0,0.0,0.0,0.0,0.0,27496.0,27496.0,0.0,0.0,0.0,0.0,0.0,0.0,27496.0,27496.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,27496.0,27496.0,0.0,0.0,0.0,0.0,27496.0,27496.0,11357.0,7380.0,4819.0,27496.0,27496.0,25349.0,21900.0,17270.0,11797.0,7282.0,4190.0,2295.0,1213.0,681.0,350.0,189.0,104.0,56.0,43.0,26.0,16.0,8.0,5.0,2.0,1.0,1213.0,681.0,189.0,104.0,56.0,43.0,26.0,16.0,8.0,5.0,2.0,1.0,19515.0,1466.0,216.0,118.0,71.0,38.0,27.0,18.0,13.0,6.0,2.0,0.0,0.0,104.0,56.0,43.0,26.0,16.0,8.0,5.0,2.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,137.0,1796.0,11348.0,6494.0,1888.0,901.0,457.0,229.0,111.0,63.0,33.0,16.0,14.0,8.0,7.0,3.0,1.0,0.0,0.0,27496.0,11797.0,2295.0,1213.0,681.0,350.0,189.0,104.0,56.0,43.0,26.0,16.0,8.0,5.0,2.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,27496.0,25349.0,21900.0,17270.0,11797.0,7282.0,4190.0,2295.0,1213.0,681.0,350.0,189.0,104.0,56.0,43.0,26.0,16.0,8.0,5.0,2.0,1.0,11357.0,3965.0,1534.0,667.0,313.0,142.0,68.0,30.0,9.0,4.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,11357.0,3965.0,1534.0,667.0,313.0,142.0,68.0,30.0,9.0,4.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,7380.0,1335.0,279.0,67.0,25.0,6.0,3.0,7380.0,1335.0,279.0,67.0,25.0,6.0,3.0,67.0,25.0,6.0,3.0,4811.0,420.0,54.0,6.0,2.0,4811.0,420.0,54.0,6.0,2.0,4811.0,420.0,54.0,6.0,2.0,2.0,185.0,20.0,2.0,1.0,878.0,44.0,2.0,0.0,0.0,3033.0,196.0,21.0,2.0,1.0,3033.0,196.0,21.0,2.0,1.0,3959.0,332.0,43.0,4.0,1.0,3959.0,332.0,43.0,4.0,1.0
mean,6267132.2159,626.7118,13.726,1.7806,626.7118,999999.9895,8.7148,2017.0,1412.7148,42986.0544,4.3742,0.9343,,4.338,4.1712,0.5197,16.9801,1.156,4593.6165,,626.7118,,0.0,4591.937,,,,,,1229.6819,1262.4647,,,,,,,1.8617,1.0,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,0.0008,-0.0004,,,,,129.2125,431.1577,1.5929,1.2324,1.0984,1.9278,1.0,2.0,3.0,4.0,5.0,6.0,7.0,8.0,9.0,10.0,11.0,12.0,13.0,14.0,15.0,16.0,17.0,18.0,19.0,20.0,21.0,14.3446,14.4919,14.8307,15.1058,14.8214,10.186,15.7692,17.625,19.0,8.2,0.5,0.0,3.0961,2.7565,3.1343,2.8644,3.1268,3.2895,2.8889,3.3889,2.6154,2.3333,3.0,,,4.5577,4.6964,4.4651,5.1154,6.25,4.75,3.6,0.0,0.0,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,2.9562,3.3185,3.1922,2.9524,2.7426,2.566,2.6849,2.4192,2.7027,2.4127,2.4242,2.75,3.0,2.5,2.2857,1.3333,4.0,,,0.0664,3.0605,1.5904,1.6002,1.4567,1.5257,1.3122,1.4038,1.2143,1.6744,1.2308,2.125,1.25,2.0,0.0,0.0,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,1.0,2.0,3.0,4.0,5.0,6.0,7.0,8.0,9.0,10.0,11.0,12.0,13.0,14.0,15.0,16.0,17.0,18.0,19.0,20.0,21.0,1.0,2.0,3.0,4.0,5.0,6.0,7.0,8.0,9.0,10.0,11.0,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,2.6281,3.8179,4.6486,5.1019,5.6965,6.4085,7.0588,8.3333,10.0,11.5,13.0,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,1.0,2.0,3.0,4.0,5.0,6.0,7.0,2.9484,4.3633,5.4086,6.1642,6.88,5.1667,6.6667,13.7761,11.4,15.0,10.0,1.0,2.0,3.0,4.0,5.0,1.0,2.0,3.0,4.0,5.0,2.7088,4.3952,5.5741,6.3333,8.0,40.5,3791.1243,2479.35,1550.0,2500.0,5091.4385,1523.9318,197.5,,,9303.8022,5326.5714,4303.9048,1550.0,2500.0,7692.9941,5493.2908,1285.7143,6625.0,0.0,11515.539,10237.4789,3658.7674,3375.0,2000.0,10816.4307,9420.6175,7300.0,5337.5,0.0
std,3648306.1911,364.8306,50.7118,0.882,364.8306,1286848.5133,0.6122,0.0,0.6122,15.6222,2.2203,0.8444,,2.2091,2.183,0.7905,8.5228,0.4582,2642.5727,,364.8306,,0.0,2642.5589,,,,,,305.1466,300.4554,,,,,,,0.9611,0.0,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,99768.6595,99998.1835,,,,,123.8106,1636.477,1.0592,0.5668,0.3492,1.6777,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,,16.8575,16.7383,16.9179,16.5488,14.9643,9.215,17.4615,15.6114,22.9222,8.5849,0.7071,,1.8607,2.0888,2.0244,2.132,2.1107,2.1674,1.8257,2.0041,2.256,1.9664,0.0,,,4.9558,5.0486,5.0301,4.9584,5.7562,5.7508,4.9295,0.0,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,1.4083,1.7552,1.8245,1.8558,1.8698,1.9077,1.9448,1.8398,1.9098,1.9397,1.8713,1.9833,2.3534,1.4142,2.6277,2.3094,,,,0.9485,3.9287,2.9071,2.9642,3.0772,3.1255,2.7431,2.9838,2.6057,3.5235,2.597,3.9476,3.5355,4.4721,0.0,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,1.8981,1.9453,1.946,1.8229,1.7543,1.8648,1.6741,2.3094,1.118,1.9149,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,0.0,0.0,0.0,0.0,0.0,0.0,0.0,2.0344,2.0647,2.1453,2.1923,2.3685,2.7869,0.5774,10.9015,9.5219,8.3666,8.6603,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.986,2.336,3.1599,1.633,1.4142,51.6188,15454.1309,2550.9651,1343.5029,,16624.7224,3459.135,144.9569,,,53628.4817,22397.4199,8774.7595,1343.5029,,53466.7703,15356.4321,2887.7821,7601.3979,,41913.8596,44990.7247,7442.3439,4308.422,,36283.7799,13015.3015,5605.0997,4902.954,
min,10001.0,1.0,1.0,1.0,1.0,9113.0,8.0,2017.0,1412.0,42961.0,1.0,0.0,,0.0,0.0,0.0,1.0,1.0,100.0,,1.0,,0.0,100.0,,,,,,48.0,123.0,,,,,,,1.0,1.0,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,-235175.0,-302583.0,,,,,1.0,1.0,1.0,1.0,0.0,0.0,1.0,2.0,3.0,4.0,5.0,6.0,7.0,8.0,9.0,10.0,11.0,12.0,13.0,14.0,15.0,16.0,17.0,18.0,19.0,20.0,21.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,2.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,0.0,0.0,3.0,,,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,0.0,0.0,4.0,,,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,1.0,2.0,3.0,4.0,5.0,6.0,7.0,8.0,9.0,10.0,11.0,12.0,13.0,14.0,15.0,16.0,17.0,18.0,19.0,20.0,21.0,1.0,2.0,3.0,4.0,5.0,6.0,7.0,8.0,9.0,10.0,11.0,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,0.0,0.0,1.0,1.0,1.0,1.0,1.0,1.0,9.0,10.0,13.0,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,1.0,2.0,3.0,4.0,5.0,6.0,7.0,0.0,0.0,1.0,1.0,4.0,1.0,6.0,0.0,0.0,5.0,0.0,1.0,2.0,3.0,4.0,5.0,1.0,2.0,3.0,4.0,5.0,0.0,0.0,1.0,4.0,7.0,4.0,0.0,100.0,600.0,2500.0,0.0,0.0,95.0,,,0.0,0.0,0.0,600.0,2500.0,0.0,0.0,0.0,1250.0,0.0,0.0,0.0,0.0,0.0,2000.0,0.0,0.0,0.0,2250.0,0.0
25%,3120009.75,312.0,7.0,1.0,312.0,334158.0,8.0,2017.0,1412.0,42972.0,3.0,0.0,,3.0,3.0,0.0,10.0,1.0,2302.0,,312.0,,0.0,2300.0,,,,,,1003.0,1026.0,,,,,,,1.0,1.0,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,-76952.0,-74560.5,,,,,49.0,49.0,1.0,1.0,1.0,0.0,1.0,2.0,3.0,4.0,5.0,6.0,7.0,8.0,9.0,10.0,11.0,12.0,13.0,14.0,15.0,16.0,17.0,18.0,19.0,20.0,21.0,3.0,3.0,3.0,2.75,3.75,1.5,4.0,5.0,4.25,1.0,0.25,0.0,2.0,1.0,1.0,1.0,1.5,1.25,1.5,2.25,0.0,0.75,3.0,,,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,2.0,2.0,2.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.75,0.0,0.0,4.0,,,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,1.0,2.0,3.0,4.0,5.0,6.0,7.0,8.0,9.0,10.0,11.0,12.0,13.0,14.0,15.0,16.0,17.0,18.0,19.0,20.0,21.0,1.0,2.0,3.0,4.0,5.0,6.0,7.0,8.0,9.0,10.0,11.0,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,1.0,2.0,3.0,4.0,5.0,6.0,7.0,8.0,9.0,10.0,13.0,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,1.0,2.0,3.0,4.0,5.0,6.0,7.0,1.0,3.0,4.0,5.0,5.0,3.75,6.5,5.0,5.0,11.25,7.5,1.0,2.0,3.0,4.0,5.0,1.0,2.0,3.0,4.0,5.0,1.0,2.0,3.0,5.25,7.5,22.25,500.0,746.75,1075.0,2500.0,380.0,298.75,146.25,,,500.0,400.0,400.0,1075.0,2500.0,0.0,0.0,0.0,3937.5,0.0,0.0,0.0,0.0,0.0,2000.0,500.0,2000.0,3500.0,2437.5,0.0
50%,6310017.5,631.0,13.0,2.0,631.0,666927.0,9.0,2017.0,1413.0,42985.0,4.0,1.0,,4.0,4.0,0.0,18.0,1.0,4602.0,,631.0,,0.0,4600.0,,,,,,1208.0,1226.0,,,,,,,2.0,1.0,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,-7221.0,-5769.0,,,,,104.0,104.0,1.0,1.0,1.0,2.0,1.0,2.0,3.0,4.0,5.0,6.0,7.0,8.0,9.0,10.0,11.0,12.0,13.0,14.0,15.0,16.0,17.0,18.0,19.0,20.0,21.0,8.0,8.0,8.0,9.0,12.0,8.0,13.5,15.5,11.0,4.0,0.5,0.0,3.0,3.0,3.0,3.0,3.0,3.0,3.0,3.0,3.0,3.0,3.0,,,3.0,2.5,2.0,5.5,6.0,2.5,0.0,0.0,0.0,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,3.0,3.0,3.0,3.0,3.0,2.0,2.0,2.0,2.0,2.0,2.0,2.0,3.0,2.0,1.0,0.0,4.0,,,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,1.0,2.0,3.0,4.0,5.0,6.0,7.0,8.0,9.0,10.0,11.0,12.0,13.0,14.0,15.0,16.0,17.0,18.0,19.0,20.0,21.0,1.0,2.0,3.0,4.0,5.0,6.0,7.0,8.0,9.0,10.0,11.0,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,2.0,4.0,4.0,5.0,5.0,6.0,7.0,8.0,10.0,11.0,13.0,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,1.0,2.0,3.0,4.0,5.0,6.0,7.0,2.0,4.0,5.0,6.0,6.0,6.0,7.0,10.0,10.0,15.0,15.0,1.0,2.0,3.0,4.0,5.0,1.0,2.0,3.0,4.0,5.0,2.0,4.0,5.0,6.5,8.0,40.5,1000.0,1500.0,1550.0,2500.0,1000.0,600.0,197.5,,,1800.0,1250.0,1300.0,1550.0,2500.0,0.0,0.0,0.0,6625.0,0.0,2000.0,794.5,90.0,2250.0,2000.0,6000.0,7000.0,6000.0,3250.0,0.0
75%,9422519.75,942.25,20.0,2.0,942.25,1155955.0,9.0,2017.0,1413.0,42998.0,6.0,1.0,,6.0,5.0,1.0,24.0,1.0,6904.0,,942.25,,0.0,6900.0,,,,,,1441.0,1501.0,,,,,,,2.0,1.0,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,72726.25,68296.75,,,,,183.0,184.0,2.0,1.0,1.0,2.0,1.0,2.0,3.0,4.0,5.0,6.0,7.0,8.0,9.0,10.0,11.0,12.0,13.0,14.0,15.0,16.0,17.0,18.0,19.0,20.0,21.0,20.0,21.0,22.0,23.0,21.25,17.0,22.75,25.25,23.25,17.0,0.75,0.0,5.0,5.0,5.0,5.0,5.0,5.0,4.5,4.0,4.0,3.0,3.0,,,9.0,9.25,9.5,9.0,10.5,9.0,9.0,0.0,0.0,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,4.0,5.0,5.0,4.0,4.0,4.0,4.0,4.0,4.0,4.0,4.0,3.5,5.0,3.25,4.5,2.0,4.0,,,0.0,6.0,2.0,2.0,1.0,1.0,1.0,1.0,1.0,0.5,1.0,1.5,0.0,0.0,0.0,0.0,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,1.0,2.0,3.0,4.0,5.0,6.0,7.0,8.0,9.0,10.0,11.0,12.0,13.0,14.0,15.0,16.0,17.0,18.0,19.0,20.0,21.0,1.0,2.0,3.0,4.0,5.0,6.0,7.0,8.0,9.0,10.0,11.0,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,4.0,5.0,5.0,6.0,6.0,7.0,8.0,9.0,11.0,12.5,13.0,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,1.0,2.0,3.0,4.0,5.0,6.0,7.0,4.0,6.0,6.0,7.0,8.0,6.0,7.0,22.5,15.0,15.0,15.0,1.0,2.0,3.0,4.0,5.0,1.0,2.0,3.0,4.0,5.0,4.0,6.0,7.0,7.75,8.5,58.75,2100.0,3000.0,2025.0,2500.0,3000.0,1500.0,248.75,,,5000.0,3000.0,4500.0,2025.0,2500.0,5000.0,5000.0,0.0,9312.5,0.0,9000.0,5000.0,3950.0,5625.0,2000.0,12000.0,11000.0,10500.0,6150.0,0.0
max,12500026.0,1250.0,8002.0,13.0,1250.0,23977595.0,10.0,2017.0,1414.0,43029.0,21.0,7.0,,21.0,21.0,6.0,31.0,5.0,9002.0,,1250.0,,0.0,9000.0,,,,,,2355.0,2356.0,,,,,,,22.0,1.0,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,297764.0,329347.0,,,,,4748.0,9999.0,11.0,7.0,5.0,16.0,1.0,2.0,3.0,4.0,5.0,6.0,7.0,8.0,9.0,10.0,11.0,12.0,13.0,14.0,15.0,16.0,17.0,18.0,19.0,20.0,21.0,92.0,90.0,82.0,90.0,76.0,29.0,86.0,54.0,70.0,18.0,1.0,0.0,7.0,7.0,7.0,7.0,7.0,7.0,5.0,7.0,6.0,5.0,3.0,,,15.0,15.0,15.0,15.0,15.0,15.0,9.0,0.0,0.0,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,6.0,6.0,6.0,6.0,6.0,6.0,6.0,6.0,6.0,6.0,6.0,6.0,6.0,5.0,6.0,4.0,4.0,,,16.0,16.0,15.0,15.0,16.0,15.0,15.0,13.0,12.0,12.0,10.0,11.0,10.0,10.0,0.0,0.0,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,1.0,2.0,3.0,4.0,5.0,6.0,7.0,8.0,9.0,10.0,11.0,12.0,13.0,14.0,15.0,16.0,17.0,18.0,19.0,20.0,21.0,1.0,2.0,3.0,4.0,5.0,6.0,7.0,8.0,9.0,10.0,11.0,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,15.0,18.0,19.0,16.0,16.0,14.0,11.0,12.0,12.0,14.0,13.0,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,1.0,2.0,3.0,4.0,5.0,6.0,7.0,17.0,13.0,15.0,15.0,16.0,9.0,7.0,30.0,30.0,30.0,15.0,1.0,2.0,3.0,4.0,5.0,1.0,2.0,3.0,4.0,5.0,15.0,16.0,19.0,8.0,9.0,77.0,200000.0,10000.0,2500.0,2500.0,200000.0,22000.0,300.0,,,999998.0,288000.0,40000.0,2500.0,2500.0,999998.0,136652.0,11000.0,12000.0,0.0,999999.0,600000.0,40000.0,9000.0,2000.0,999998.0,100000.0,22000.0,12600.0,0.0


In [40]:
#To format float values in the dataframe and see more columns
pd.set_option('display.max_columns', 1056)
pd.set_option('display.float_format', '{:,.4f}'.format)

Refer to the linked discussion to understand how the different summary statistics are computed: [link](http://mathcenter.oxford.emory.edu/site/math117/shapeCenterAndSpread/)