# Pandas letem světem

[10 minutový úvod](https://pandas.pydata.org/pandas-docs/stable/getting_started/10min.html)

[Srovnání s jinými nástrojí (R, SQL, SAS)](https://pandas.pydata.org/pandas-docs/stable/getting_started/comparison/index.html)

## Data

Budeme používat [data o ceně nemovitostí z Kaggle](https://www.kaggle.com/alphaepsilon/housing-prices-dataset).
Máte 2 soubory: data [`housing.csv`](housing.csv) a popis dat [`housing_description.txt`](housing_description.txt).

## Načtení dat

Nejčastěji budeme používat funkci [`read_csv`](https://pandas.pydata.org/pandas-docs/stable/reference/api/pandas.read_csv.html), ale existují i [další funkce](https://pandas.pydata.org/pandas-docs/stable/user_guide/io.html).

Výhodou pandas je, že vstupem může být jak soubor, tak i URL.

In [1]:
# Načteme pandas jako zkratku 'pd'
import pandas as pd

import numpy as np

In [2]:
houses = pd.read_csv('housing.csv')
# Head zobrazi prvních 5 řádků, číslo lze změnit
houses.head()

Unnamed: 0,Id,MSSubClass,MSZoning,LotFrontage,LotArea,Street,Alley,LotShape,LandContour,Utilities,...,PoolArea,PoolQC,Fence,MiscFeature,MiscVal,MoSold,YrSold,SaleType,SaleCondition,SalePrice
0,1,60,RL,65.0,8450,Pave,,Reg,Lvl,AllPub,...,0,,,,0,2,2008,WD,Normal,208500
1,2,20,RL,80.0,9600,Pave,,Reg,Lvl,AllPub,...,0,,,,0,5,2007,WD,Normal,181500
2,3,60,RL,68.0,11250,Pave,,IR1,Lvl,AllPub,...,0,,,,0,9,2008,WD,Normal,223500
3,4,70,RL,60.0,9550,Pave,,IR1,Lvl,AllPub,...,0,,,,0,2,2006,WD,Abnorml,140000
4,5,60,RL,84.0,14260,Pave,,IR1,Lvl,AllPub,...,0,,,,0,12,2008,WD,Normal,250000


## Výběr sloupců

Obecný postup (platí pro všechny názvy sloupců)

```python
houses['SalePrice']
```

Postup pro sloupce neobsahující mezery, výhoda: doplnění názvu sloupce tabulátorem

```python
houses.SalePrice
```

In [3]:
houses['SalePrice'].head()

0    208500
1    181500
2    223500
3    140000
4    250000
Name: SalePrice, dtype: int64

In [4]:
houses.SalePrice.head()

0    208500
1    181500
2    223500
3    140000
4    250000
Name: SalePrice, dtype: int64

## Odbočka - porovnání desetinných čísel

Neporovnáváme čísla přímo, zjišťujeme, zda rozdíl mezi dvěma číslo není příliš velký pomocí `numpy` nebo `math`.

Pokud chcete počítat s desetinnými čísly s velkou přesností, použijte knihovnu [`decimal`](https://docs.python.org/3/library/decimal.html).

In [5]:
# Základní python
import math
math.isclose(0.5, 0.5)

True

In [6]:
# Pro pandas
np.isclose(houses.LotFrontage, houses.MasVnrArea)

array([False, False, False, ..., False, False, False])

## Základní datové typy v Pandas

- `DataFrame` - tabulka s indexovanými řádky
- `Series` - sloupec s indexovanými řádky

Dále každá `Series` může být:

- Číslo (`int64`, `int32`, `float64`, `float32`)
- True/False (`bool`)
- Řetězec (`object`)
- Kategorie (`category`)
- Datum (`datetime64`)

Více v [tomto článku](https://pbpython.com/pandas_dtypes.html)

In [7]:
# Funkce pro určování datového typy ze základního Pythonu
type(houses)

pandas.core.frame.DataFrame

In [8]:
type(houses.SalePrice)

pandas.core.series.Series

In [9]:
# Zjistíme datové typy jednotlivých sloupců
houses.dtypes.head()

Id               int64
MSSubClass       int64
MSZoning        object
LotFrontage    float64
LotArea          int64
dtype: object

## Nastavení zobrazovacích možností v pandas

Lze nastavit více možností, [odkaz na dokumentaci](https://pandas.pydata.org/pandas-docs/stable/user_guide/options.html).

`pd.options.display.max_rows` nastavuje maximální počet zobrazovaných řádků u tabulek.

`pd.options.display.max_columns` nastavuje maximální počet zobrazovaných sloupců u tabulek.

In [10]:
pd.options.display.max_rows = 999
pd.options.display.max_columns = 999

## Atributy versus metody

Dle názvu rozdíl není vždy zřetelný. Obecně je atribut něco, co není potřeba počítat (sloupce, index, ale i dimenze).

Metoda se volá jako v základním pythonu se závorkami (např. `houses.head()`), zatímco atribut se volá bez závorek (např. `houses.columns`).

[Seznam atributů a metod `DataFrame`](https://pandas.pydata.org/pandas-docs/stable/reference/api/pandas.DataFrame.html)

[Seznam atributů a metod `Series`](https://pandas.pydata.org/pandas-docs/stable/reference/api/pandas.Series.html)

In [11]:
houses.columns

Index(['Id', 'MSSubClass', 'MSZoning', 'LotFrontage', 'LotArea', 'Street',
       'Alley', 'LotShape', 'LandContour', 'Utilities', 'LotConfig',
       'LandSlope', 'Neighborhood', 'Condition1', 'Condition2', 'BldgType',
       'HouseStyle', 'OverallQual', 'OverallCond', 'YearBuilt', 'YearRemodAdd',
       'RoofStyle', 'RoofMatl', 'Exterior1st', 'Exterior2nd', 'MasVnrType',
       'MasVnrArea', 'ExterQual', 'ExterCond', 'Foundation', 'BsmtQual',
       'BsmtCond', 'BsmtExposure', 'BsmtFinType1', 'BsmtFinSF1',
       'BsmtFinType2', 'BsmtFinSF2', 'BsmtUnfSF', 'TotalBsmtSF', 'Heating',
       'HeatingQC', 'CentralAir', 'Electrical', '1stFlrSF', '2ndFlrSF',
       'LowQualFinSF', 'GrLivArea', 'BsmtFullBath', 'BsmtHalfBath', 'FullBath',
       'HalfBath', 'BedroomAbvGr', 'KitchenAbvGr', 'KitchenQual',
       'TotRmsAbvGrd', 'Functional', 'Fireplaces', 'FireplaceQu', 'GarageType',
       'GarageYrBlt', 'GarageFinish', 'GarageCars', 'GarageArea', 'GarageQual',
       'GarageCond', 'PavedDrive

In [12]:
houses.shape

(1460, 81)

In [13]:
houses.head

<bound method NDFrame.head of         Id  MSSubClass MSZoning  LotFrontage  LotArea Street Alley LotShape  \
0        1          60       RL         65.0     8450   Pave   NaN      Reg   
1        2          20       RL         80.0     9600   Pave   NaN      Reg   
2        3          60       RL         68.0    11250   Pave   NaN      IR1   
3        4          70       RL         60.0     9550   Pave   NaN      IR1   
4        5          60       RL         84.0    14260   Pave   NaN      IR1   
5        6          50       RL         85.0    14115   Pave   NaN      IR1   
6        7          20       RL         75.0    10084   Pave   NaN      Reg   
7        8          60       RL          NaN    10382   Pave   NaN      IR1   
8        9          50       RM         51.0     6120   Pave   NaN      Reg   
9       10         190       RL         50.0     7420   Pave   NaN      Reg   
10      11          20       RL         70.0    11200   Pave   NaN      Reg   
11      12          60

In [14]:
houses.head(n=10)

# houses[:5]

Unnamed: 0,Id,MSSubClass,MSZoning,LotFrontage,LotArea,Street,Alley,LotShape,LandContour,Utilities,LotConfig,LandSlope,Neighborhood,Condition1,Condition2,BldgType,HouseStyle,OverallQual,OverallCond,YearBuilt,YearRemodAdd,RoofStyle,RoofMatl,Exterior1st,Exterior2nd,MasVnrType,MasVnrArea,ExterQual,ExterCond,Foundation,BsmtQual,BsmtCond,BsmtExposure,BsmtFinType1,BsmtFinSF1,BsmtFinType2,BsmtFinSF2,BsmtUnfSF,TotalBsmtSF,Heating,HeatingQC,CentralAir,Electrical,1stFlrSF,2ndFlrSF,LowQualFinSF,GrLivArea,BsmtFullBath,BsmtHalfBath,FullBath,HalfBath,BedroomAbvGr,KitchenAbvGr,KitchenQual,TotRmsAbvGrd,Functional,Fireplaces,FireplaceQu,GarageType,GarageYrBlt,GarageFinish,GarageCars,GarageArea,GarageQual,GarageCond,PavedDrive,WoodDeckSF,OpenPorchSF,EnclosedPorch,3SsnPorch,ScreenPorch,PoolArea,PoolQC,Fence,MiscFeature,MiscVal,MoSold,YrSold,SaleType,SaleCondition,SalePrice
0,1,60,RL,65.0,8450,Pave,,Reg,Lvl,AllPub,Inside,Gtl,CollgCr,Norm,Norm,1Fam,2Story,7,5,2003,2003,Gable,CompShg,VinylSd,VinylSd,BrkFace,196.0,Gd,TA,PConc,Gd,TA,No,GLQ,706,Unf,0,150,856,GasA,Ex,Y,SBrkr,856,854,0,1710,1,0,2,1,3,1,Gd,8,Typ,0,,Attchd,2003.0,RFn,2,548,TA,TA,Y,0,61,0,0,0,0,,,,0,2,2008,WD,Normal,208500
1,2,20,RL,80.0,9600,Pave,,Reg,Lvl,AllPub,FR2,Gtl,Veenker,Feedr,Norm,1Fam,1Story,6,8,1976,1976,Gable,CompShg,MetalSd,MetalSd,,0.0,TA,TA,CBlock,Gd,TA,Gd,ALQ,978,Unf,0,284,1262,GasA,Ex,Y,SBrkr,1262,0,0,1262,0,1,2,0,3,1,TA,6,Typ,1,TA,Attchd,1976.0,RFn,2,460,TA,TA,Y,298,0,0,0,0,0,,,,0,5,2007,WD,Normal,181500
2,3,60,RL,68.0,11250,Pave,,IR1,Lvl,AllPub,Inside,Gtl,CollgCr,Norm,Norm,1Fam,2Story,7,5,2001,2002,Gable,CompShg,VinylSd,VinylSd,BrkFace,162.0,Gd,TA,PConc,Gd,TA,Mn,GLQ,486,Unf,0,434,920,GasA,Ex,Y,SBrkr,920,866,0,1786,1,0,2,1,3,1,Gd,6,Typ,1,TA,Attchd,2001.0,RFn,2,608,TA,TA,Y,0,42,0,0,0,0,,,,0,9,2008,WD,Normal,223500
3,4,70,RL,60.0,9550,Pave,,IR1,Lvl,AllPub,Corner,Gtl,Crawfor,Norm,Norm,1Fam,2Story,7,5,1915,1970,Gable,CompShg,Wd Sdng,Wd Shng,,0.0,TA,TA,BrkTil,TA,Gd,No,ALQ,216,Unf,0,540,756,GasA,Gd,Y,SBrkr,961,756,0,1717,1,0,1,0,3,1,Gd,7,Typ,1,Gd,Detchd,1998.0,Unf,3,642,TA,TA,Y,0,35,272,0,0,0,,,,0,2,2006,WD,Abnorml,140000
4,5,60,RL,84.0,14260,Pave,,IR1,Lvl,AllPub,FR2,Gtl,NoRidge,Norm,Norm,1Fam,2Story,8,5,2000,2000,Gable,CompShg,VinylSd,VinylSd,BrkFace,350.0,Gd,TA,PConc,Gd,TA,Av,GLQ,655,Unf,0,490,1145,GasA,Ex,Y,SBrkr,1145,1053,0,2198,1,0,2,1,4,1,Gd,9,Typ,1,TA,Attchd,2000.0,RFn,3,836,TA,TA,Y,192,84,0,0,0,0,,,,0,12,2008,WD,Normal,250000
5,6,50,RL,85.0,14115,Pave,,IR1,Lvl,AllPub,Inside,Gtl,Mitchel,Norm,Norm,1Fam,1.5Fin,5,5,1993,1995,Gable,CompShg,VinylSd,VinylSd,,0.0,TA,TA,Wood,Gd,TA,No,GLQ,732,Unf,0,64,796,GasA,Ex,Y,SBrkr,796,566,0,1362,1,0,1,1,1,1,TA,5,Typ,0,,Attchd,1993.0,Unf,2,480,TA,TA,Y,40,30,0,320,0,0,,MnPrv,Shed,700,10,2009,WD,Normal,143000
6,7,20,RL,75.0,10084,Pave,,Reg,Lvl,AllPub,Inside,Gtl,Somerst,Norm,Norm,1Fam,1Story,8,5,2004,2005,Gable,CompShg,VinylSd,VinylSd,Stone,186.0,Gd,TA,PConc,Ex,TA,Av,GLQ,1369,Unf,0,317,1686,GasA,Ex,Y,SBrkr,1694,0,0,1694,1,0,2,0,3,1,Gd,7,Typ,1,Gd,Attchd,2004.0,RFn,2,636,TA,TA,Y,255,57,0,0,0,0,,,,0,8,2007,WD,Normal,307000
7,8,60,RL,,10382,Pave,,IR1,Lvl,AllPub,Corner,Gtl,NWAmes,PosN,Norm,1Fam,2Story,7,6,1973,1973,Gable,CompShg,HdBoard,HdBoard,Stone,240.0,TA,TA,CBlock,Gd,TA,Mn,ALQ,859,BLQ,32,216,1107,GasA,Ex,Y,SBrkr,1107,983,0,2090,1,0,2,1,3,1,TA,7,Typ,2,TA,Attchd,1973.0,RFn,2,484,TA,TA,Y,235,204,228,0,0,0,,,Shed,350,11,2009,WD,Normal,200000
8,9,50,RM,51.0,6120,Pave,,Reg,Lvl,AllPub,Inside,Gtl,OldTown,Artery,Norm,1Fam,1.5Fin,7,5,1931,1950,Gable,CompShg,BrkFace,Wd Shng,,0.0,TA,TA,BrkTil,TA,TA,No,Unf,0,Unf,0,952,952,GasA,Gd,Y,FuseF,1022,752,0,1774,0,0,2,0,2,2,TA,8,Min1,2,TA,Detchd,1931.0,Unf,2,468,Fa,TA,Y,90,0,205,0,0,0,,,,0,4,2008,WD,Abnorml,129900
9,10,190,RL,50.0,7420,Pave,,Reg,Lvl,AllPub,Corner,Gtl,BrkSide,Artery,Artery,2fmCon,1.5Unf,5,6,1939,1950,Gable,CompShg,MetalSd,MetalSd,,0.0,TA,TA,BrkTil,TA,TA,No,GLQ,851,Unf,0,140,991,GasA,Ex,Y,SBrkr,1077,0,0,1077,1,0,1,0,2,2,TA,5,Typ,2,TA,Attchd,1939.0,RFn,1,205,Gd,TA,Y,0,4,0,0,0,0,,,,0,1,2008,WD,Normal,118000


In [15]:
# Popisná statistika, include='all' vytvoří statistiku i pro nenumerické sloupce
houses.describe(include='all')

Unnamed: 0,Id,MSSubClass,MSZoning,LotFrontage,LotArea,Street,Alley,LotShape,LandContour,Utilities,LotConfig,LandSlope,Neighborhood,Condition1,Condition2,BldgType,HouseStyle,OverallQual,OverallCond,YearBuilt,YearRemodAdd,RoofStyle,RoofMatl,Exterior1st,Exterior2nd,MasVnrType,MasVnrArea,ExterQual,ExterCond,Foundation,BsmtQual,BsmtCond,BsmtExposure,BsmtFinType1,BsmtFinSF1,BsmtFinType2,BsmtFinSF2,BsmtUnfSF,TotalBsmtSF,Heating,HeatingQC,CentralAir,Electrical,1stFlrSF,2ndFlrSF,LowQualFinSF,GrLivArea,BsmtFullBath,BsmtHalfBath,FullBath,HalfBath,BedroomAbvGr,KitchenAbvGr,KitchenQual,TotRmsAbvGrd,Functional,Fireplaces,FireplaceQu,GarageType,GarageYrBlt,GarageFinish,GarageCars,GarageArea,GarageQual,GarageCond,PavedDrive,WoodDeckSF,OpenPorchSF,EnclosedPorch,3SsnPorch,ScreenPorch,PoolArea,PoolQC,Fence,MiscFeature,MiscVal,MoSold,YrSold,SaleType,SaleCondition,SalePrice
count,1460.0,1460.0,1460,1201.0,1460.0,1460,91,1460,1460,1460,1460,1460,1460,1460,1460,1460,1460,1460.0,1460.0,1460.0,1460.0,1460,1460,1460,1460,1452.0,1452.0,1460,1460,1460,1423,1423,1422,1423,1460.0,1422,1460.0,1460.0,1460.0,1460,1460,1460,1459,1460.0,1460.0,1460.0,1460.0,1460.0,1460.0,1460.0,1460.0,1460.0,1460.0,1460,1460.0,1460,1460.0,770,1379,1379.0,1379,1460.0,1460.0,1379,1379,1460,1460.0,1460.0,1460.0,1460.0,1460.0,1460.0,7,281,54,1460.0,1460.0,1460.0,1460,1460,1460.0
unique,,,5,,,2,2,4,4,2,5,3,25,9,8,5,8,,,,,6,8,15,16,4.0,,4,5,6,4,4,4,6,,6,,,,6,5,2,5,,,,,,,,,,,4,,7,,5,6,,3,,,5,5,3,,,,,,,3,4,4,,,,9,6,
top,,,RL,,,Pave,Grvl,Reg,Lvl,AllPub,Inside,Gtl,NAmes,Norm,Norm,1Fam,1Story,,,,,Gable,CompShg,VinylSd,VinylSd,,,TA,TA,PConc,TA,TA,No,Unf,,Unf,,,,GasA,Ex,Y,SBrkr,,,,,,,,,,,TA,,Typ,,Gd,Attchd,,Unf,,,TA,TA,Y,,,,,,,Gd,MnPrv,Shed,,,,WD,Normal,
freq,,,1151,,,1454,50,925,1311,1459,1052,1382,225,1260,1445,1220,726,,,,,1141,1434,515,504,864.0,,906,1282,647,649,1311,953,430,,1256,,,,1428,741,1365,1334,,,,,,,,,,,735,,1360,,380,870,,605,,,1311,1326,1340,,,,,,,3,157,49,,,,1267,1198,
mean,730.5,56.89726,,70.049958,10516.828082,,,,,,,,,,,,,6.099315,5.575342,1971.267808,1984.865753,,,,,,103.685262,,,,,,,,443.639726,,46.549315,567.240411,1057.429452,,,,,1162.626712,346.992466,5.844521,1515.463699,0.425342,0.057534,1.565068,0.382877,2.866438,1.046575,,6.517808,,0.613014,,,1978.506164,,1.767123,472.980137,,,,94.244521,46.660274,21.95411,3.409589,15.060959,2.758904,,,,43.489041,6.321918,2007.815753,,,180921.19589
std,421.610009,42.300571,,24.284752,9981.264932,,,,,,,,,,,,,1.382997,1.112799,30.202904,20.645407,,,,,,181.066207,,,,,,,,456.098091,,161.319273,441.866955,438.705324,,,,,386.587738,436.528436,48.623081,525.480383,0.518911,0.238753,0.550916,0.502885,0.815778,0.220338,,1.625393,,0.644666,,,24.689725,,0.747315,213.804841,,,,125.338794,66.256028,61.119149,29.317331,55.757415,40.177307,,,,496.123024,2.703626,1.328095,,,79442.502883
min,1.0,20.0,,21.0,1300.0,,,,,,,,,,,,,1.0,1.0,1872.0,1950.0,,,,,,0.0,,,,,,,,0.0,,0.0,0.0,0.0,,,,,334.0,0.0,0.0,334.0,0.0,0.0,0.0,0.0,0.0,0.0,,2.0,,0.0,,,1900.0,,0.0,0.0,,,,0.0,0.0,0.0,0.0,0.0,0.0,,,,0.0,1.0,2006.0,,,34900.0
25%,365.75,20.0,,59.0,7553.5,,,,,,,,,,,,,5.0,5.0,1954.0,1967.0,,,,,,0.0,,,,,,,,0.0,,0.0,223.0,795.75,,,,,882.0,0.0,0.0,1129.5,0.0,0.0,1.0,0.0,2.0,1.0,,5.0,,0.0,,,1961.0,,1.0,334.5,,,,0.0,0.0,0.0,0.0,0.0,0.0,,,,0.0,5.0,2007.0,,,129975.0
50%,730.5,50.0,,69.0,9478.5,,,,,,,,,,,,,6.0,5.0,1973.0,1994.0,,,,,,0.0,,,,,,,,383.5,,0.0,477.5,991.5,,,,,1087.0,0.0,0.0,1464.0,0.0,0.0,2.0,0.0,3.0,1.0,,6.0,,1.0,,,1980.0,,2.0,480.0,,,,0.0,25.0,0.0,0.0,0.0,0.0,,,,0.0,6.0,2008.0,,,163000.0
75%,1095.25,70.0,,80.0,11601.5,,,,,,,,,,,,,7.0,6.0,2000.0,2004.0,,,,,,166.0,,,,,,,,712.25,,0.0,808.0,1298.25,,,,,1391.25,728.0,0.0,1776.75,1.0,0.0,2.0,1.0,3.0,1.0,,7.0,,1.0,,,2002.0,,2.0,576.0,,,,168.0,68.0,0.0,0.0,0.0,0.0,,,,0.0,8.0,2009.0,,,214000.0


In [16]:
houses.count()

Id               1460
MSSubClass       1460
MSZoning         1460
LotFrontage      1201
LotArea          1460
Street           1460
Alley              91
LotShape         1460
LandContour      1460
Utilities        1460
LotConfig        1460
LandSlope        1460
Neighborhood     1460
Condition1       1460
Condition2       1460
BldgType         1460
HouseStyle       1460
OverallQual      1460
OverallCond      1460
YearBuilt        1460
YearRemodAdd     1460
RoofStyle        1460
RoofMatl         1460
Exterior1st      1460
Exterior2nd      1460
MasVnrType       1452
MasVnrArea       1452
ExterQual        1460
ExterCond        1460
Foundation       1460
BsmtQual         1423
BsmtCond         1423
BsmtExposure     1422
BsmtFinType1     1423
BsmtFinSF1       1460
BsmtFinType2     1422
BsmtFinSF2       1460
BsmtUnfSF        1460
TotalBsmtSF      1460
Heating          1460
HeatingQC        1460
CentralAir       1460
Electrical       1459
1stFlrSF         1460
2ndFlrSF         1460
LowQualFin

In [17]:
# Zjistit datový typ
houses.SalePrice.dtype

dtype('int64')

In [18]:
# Zkontrolovat zda Series obsahuje alespoň jednu chybějící hodnotu
houses.SalePrice.isnull().any()

False

## Operace se sloupci

- Vektorové operace se sloupci jsou rychlé
    - Sloupce lze sčítat, násobit, dělit, atd.
    - Příklad `houses.LotArea*0.09290304`
- Na téměř všechno, co chcete v pandas se sloupci udělat, už je metoda 😉

In [19]:
# Převedení čtverečních stop na metry čtvereční
houses.LotArea*0.09290304

0         785.030688
1         891.869184
2        1045.159200
3         887.224032
4        1324.797350
5        1311.326410
6         936.834255
7         964.519361
8         568.566605
9         689.340557
10       1040.514048
11       1107.775849
12       1204.766623
13        989.603182
14       1014.501197
15        568.566605
16       1044.323073
17       1002.516705
18       1272.307133
19        702.346982
20       1320.616714
21        692.034745
22        905.061416
23        392.422441
24        766.078468
25       1322.010259
26        668.901888
27       1066.341093
28       1516.270516
29        587.518825
30        789.675840
31        793.763574
32       1026.485689
33        980.312878
34        679.399932
35       1246.572991
36       1008.834111
37        792.648737
38        735.977883
39        561.134362
40        804.354520
41       1570.525891
42        852.849907
43        854.707968
44        738.114653
45        711.451480
46       1191.202779
47       1030

In [20]:
# Vytvoříme nový sloupec, použijeme syntax s hranatými závorkami
houses['LotAreaMeters'] = houses.LotArea/0.09290304

In [21]:
houses.head()

Unnamed: 0,Id,MSSubClass,MSZoning,LotFrontage,LotArea,Street,Alley,LotShape,LandContour,Utilities,LotConfig,LandSlope,Neighborhood,Condition1,Condition2,BldgType,HouseStyle,OverallQual,OverallCond,YearBuilt,YearRemodAdd,RoofStyle,RoofMatl,Exterior1st,Exterior2nd,MasVnrType,MasVnrArea,ExterQual,ExterCond,Foundation,BsmtQual,BsmtCond,BsmtExposure,BsmtFinType1,BsmtFinSF1,BsmtFinType2,BsmtFinSF2,BsmtUnfSF,TotalBsmtSF,Heating,HeatingQC,CentralAir,Electrical,1stFlrSF,2ndFlrSF,LowQualFinSF,GrLivArea,BsmtFullBath,BsmtHalfBath,FullBath,HalfBath,BedroomAbvGr,KitchenAbvGr,KitchenQual,TotRmsAbvGrd,Functional,Fireplaces,FireplaceQu,GarageType,GarageYrBlt,GarageFinish,GarageCars,GarageArea,GarageQual,GarageCond,PavedDrive,WoodDeckSF,OpenPorchSF,EnclosedPorch,3SsnPorch,ScreenPorch,PoolArea,PoolQC,Fence,MiscFeature,MiscVal,MoSold,YrSold,SaleType,SaleCondition,SalePrice,LotAreaMeters
0,1,60,RL,65.0,8450,Pave,,Reg,Lvl,AllPub,Inside,Gtl,CollgCr,Norm,Norm,1Fam,2Story,7,5,2003,2003,Gable,CompShg,VinylSd,VinylSd,BrkFace,196.0,Gd,TA,PConc,Gd,TA,No,GLQ,706,Unf,0,150,856,GasA,Ex,Y,SBrkr,856,854,0,1710,1,0,2,1,3,1,Gd,8,Typ,0,,Attchd,2003.0,RFn,2,548,TA,TA,Y,0,61,0,0,0,0,,,,0,2,2008,WD,Normal,208500,90955.043021
1,2,20,RL,80.0,9600,Pave,,Reg,Lvl,AllPub,FR2,Gtl,Veenker,Feedr,Norm,1Fam,1Story,6,8,1976,1976,Gable,CompShg,MetalSd,MetalSd,,0.0,TA,TA,CBlock,Gd,TA,Gd,ALQ,978,Unf,0,284,1262,GasA,Ex,Y,SBrkr,1262,0,0,1262,0,1,2,0,3,1,TA,6,Typ,1,TA,Attchd,1976.0,RFn,2,460,TA,TA,Y,298,0,0,0,0,0,,,,0,5,2007,WD,Normal,181500,103333.54
2,3,60,RL,68.0,11250,Pave,,IR1,Lvl,AllPub,Inside,Gtl,CollgCr,Norm,Norm,1Fam,2Story,7,5,2001,2002,Gable,CompShg,VinylSd,VinylSd,BrkFace,162.0,Gd,TA,PConc,Gd,TA,Mn,GLQ,486,Unf,0,434,920,GasA,Ex,Y,SBrkr,920,866,0,1786,1,0,2,1,3,1,Gd,6,Typ,1,TA,Attchd,2001.0,RFn,2,608,TA,TA,Y,0,42,0,0,0,0,,,,0,9,2008,WD,Normal,223500,121093.992188
3,4,70,RL,60.0,9550,Pave,,IR1,Lvl,AllPub,Corner,Gtl,Crawfor,Norm,Norm,1Fam,2Story,7,5,1915,1970,Gable,CompShg,Wd Sdng,Wd Shng,,0.0,TA,TA,BrkTil,TA,Gd,No,ALQ,216,Unf,0,540,756,GasA,Gd,Y,SBrkr,961,756,0,1717,1,0,1,0,3,1,Gd,7,Typ,1,Gd,Detchd,1998.0,Unf,3,642,TA,TA,Y,0,35,272,0,0,0,,,,0,2,2006,WD,Abnorml,140000,102795.34448
4,5,60,RL,84.0,14260,Pave,,IR1,Lvl,AllPub,FR2,Gtl,NoRidge,Norm,Norm,1Fam,2Story,8,5,2000,2000,Gable,CompShg,VinylSd,VinylSd,BrkFace,350.0,Gd,TA,PConc,Gd,TA,Av,GLQ,655,Unf,0,490,1145,GasA,Ex,Y,SBrkr,1145,1053,0,2198,1,0,2,1,4,1,Gd,9,Typ,1,TA,Attchd,2000.0,RFn,3,836,TA,TA,Y,192,84,0,0,0,0,,,,0,12,2008,WD,Normal,250000,153493.362542


## Filtrování
```
data[podmínka]

data[(podmínka1) & (podmínka2)]
```
Povolené operátory

- `&` logické AND
- `|` logické OR
- `~` logické NOT

Příklady
```python
houses[houses.SalePrice < 100000]

houses[(houses.SalePrice < 100000) & (houses.OverallCond >= 8)]
```

In [22]:
# Domy, které se prodaly za méně než 100000 USD
houses[houses.SalePrice < 100000]

Unnamed: 0,Id,MSSubClass,MSZoning,LotFrontage,LotArea,Street,Alley,LotShape,LandContour,Utilities,LotConfig,LandSlope,Neighborhood,Condition1,Condition2,BldgType,HouseStyle,OverallQual,OverallCond,YearBuilt,YearRemodAdd,RoofStyle,RoofMatl,Exterior1st,Exterior2nd,MasVnrType,MasVnrArea,ExterQual,ExterCond,Foundation,BsmtQual,BsmtCond,BsmtExposure,BsmtFinType1,BsmtFinSF1,BsmtFinType2,BsmtFinSF2,BsmtUnfSF,TotalBsmtSF,Heating,HeatingQC,CentralAir,Electrical,1stFlrSF,2ndFlrSF,LowQualFinSF,GrLivArea,BsmtFullBath,BsmtHalfBath,FullBath,HalfBath,BedroomAbvGr,KitchenAbvGr,KitchenQual,TotRmsAbvGrd,Functional,Fireplaces,FireplaceQu,GarageType,GarageYrBlt,GarageFinish,GarageCars,GarageArea,GarageQual,GarageCond,PavedDrive,WoodDeckSF,OpenPorchSF,EnclosedPorch,3SsnPorch,ScreenPorch,PoolArea,PoolQC,Fence,MiscFeature,MiscVal,MoSold,YrSold,SaleType,SaleCondition,SalePrice,LotAreaMeters
17,18,90,RL,72.0,10791,Pave,,Reg,Lvl,AllPub,Inside,Gtl,Sawyer,Norm,Norm,Duplex,1Story,4,5,1967,1967,Gable,CompShg,MetalSd,MetalSd,,0.0,TA,TA,Slab,,,,,0,,0,0,0,GasA,TA,Y,SBrkr,1296,0,0,1296,0,0,2,0,2,2,TA,6,Typ,0,,CarPort,1967.0,Unf,2,516,TA,TA,Y,0,0,0,0,0,0,,,Shed,500,10,2006,WD,Normal,90000,116153.357307
29,30,30,RM,60.0,6324,Pave,,IR1,Lvl,AllPub,Inside,Gtl,BrkSide,Feedr,RRNn,1Fam,1Story,4,6,1927,1950,Gable,CompShg,MetalSd,MetalSd,,0.0,TA,TA,BrkTil,TA,TA,No,Unf,0,Unf,0,520,520,GasA,Fa,N,SBrkr,520,0,0,520,0,0,1,0,1,1,Fa,4,Typ,0,,Detchd,1920.0,Unf,1,240,Fa,TA,Y,49,0,87,0,0,0,,,,0,5,2008,WD,Normal,68500,68070.969475
30,31,70,C (all),50.0,8500,Pave,Pave,Reg,Lvl,AllPub,Inside,Gtl,IDOTRR,Feedr,Norm,1Fam,2Story,4,4,1920,1950,Gambrel,CompShg,BrkFace,BrkFace,,0.0,TA,Fa,BrkTil,TA,TA,No,Unf,0,Unf,0,649,649,GasA,TA,N,SBrkr,649,668,0,1317,0,0,1,0,3,1,TA,6,Typ,0,,Detchd,1920.0,Unf,1,250,TA,Fa,N,0,54,172,0,0,0,,MnPrv,,0,7,2008,WD,Normal,40000,91493.238542
39,40,90,RL,65.0,6040,Pave,,Reg,Lvl,AllPub,Inside,Gtl,Edwards,Norm,Norm,Duplex,1Story,4,5,1955,1955,Gable,CompShg,AsbShng,Plywood,,0.0,TA,TA,PConc,,,,,0,,0,0,0,GasA,TA,N,FuseP,1152,0,0,1152,0,0,2,0,2,2,Fa,6,Typ,0,,,,,0,0,,,N,0,0,0,0,0,0,,,,0,6,2008,WD,AdjLand,82000,65014.018917
68,69,30,RM,47.0,4608,Pave,,Reg,Lvl,AllPub,Corner,Gtl,OldTown,Artery,Norm,1Fam,1Story,4,6,1945,1950,Gable,CompShg,MetalSd,MetalSd,,0.0,TA,Gd,CBlock,TA,TA,No,Unf,0,Unf,0,747,747,GasA,TA,Y,SBrkr,747,0,0,747,0,0,1,0,2,1,TA,4,Typ,0,,Attchd,1945.0,Unf,1,220,TA,TA,Y,0,0,0,0,0,0,,,,0,6,2010,WD,Normal,80000,49600.0992
75,76,180,RM,21.0,1596,Pave,,Reg,Lvl,AllPub,Inside,Gtl,MeadowV,Norm,Norm,Twnhs,SLvl,4,5,1973,1973,Gable,CompShg,CemntBd,CmentBd,,0.0,TA,TA,CBlock,Gd,TA,Gd,GLQ,462,Unf,0,0,462,GasA,TA,Y,SBrkr,526,462,0,988,1,0,1,0,2,1,TA,5,Typ,0,,BuiltIn,1973.0,Unf,1,297,TA,TA,Y,120,101,0,0,0,0,,GdWo,,0,11,2009,WD,Normal,91000,17179.201025
88,89,50,C (all),105.0,8470,Pave,,IR1,Lvl,AllPub,Corner,Gtl,IDOTRR,Feedr,Feedr,1Fam,1.5Fin,3,2,1915,1982,Hip,CompShg,Plywood,Plywood,,0.0,Fa,Fa,CBlock,TA,Fa,No,Unf,0,Unf,0,1013,1013,GasA,TA,N,SBrkr,1013,0,513,1526,0,0,1,0,2,1,Fa,6,Typ,0,,,,,0,0,,,N,0,0,156,0,0,0,,MnPrv,,0,10,2009,ConLD,Abnorml,85000,91170.32123
91,92,20,RL,85.0,8500,Pave,,Reg,Lvl,AllPub,Inside,Gtl,NAmes,Norm,Norm,1Fam,1Story,5,3,1961,1961,Hip,CompShg,HdBoard,HdBoard,BrkCmn,203.0,TA,TA,CBlock,TA,TA,No,Rec,600,Unf,0,635,1235,GasA,TA,Y,SBrkr,1235,0,0,1235,0,0,1,0,2,1,TA,6,Typ,0,,Attchd,1961.0,Unf,2,480,TA,TA,Y,0,0,0,0,0,0,,GdWo,,0,12,2006,WD,Abnorml,98600,91493.238542
97,98,20,RL,73.0,10921,Pave,,Reg,HLS,AllPub,Inside,Gtl,Edwards,Norm,Norm,1Fam,1Story,4,5,1965,1965,Hip,CompShg,HdBoard,HdBoard,BrkFace,48.0,TA,TA,CBlock,TA,TA,No,Rec,520,Unf,0,440,960,GasA,TA,Y,FuseF,960,0,0,960,1,0,1,0,3,1,TA,6,Typ,0,,Attchd,1965.0,Fin,1,432,TA,TA,P,120,0,0,0,0,0,,,,0,5,2007,WD,Normal,94750,117552.665661
98,99,30,RL,85.0,10625,Pave,,Reg,Lvl,AllPub,Corner,Gtl,Edwards,Norm,Norm,1Fam,1Story,5,5,1920,1950,Gable,CompShg,Wd Sdng,Wd Sdng,,0.0,TA,TA,BrkTil,TA,TA,No,ALQ,108,Unf,0,350,458,GasA,Fa,N,SBrkr,835,0,0,835,0,0,1,0,2,1,TA,5,Typ,0,,Basment,1920.0,Unf,1,366,Fa,TA,Y,0,0,77,0,0,0,,,Shed,400,5,2010,COD,Abnorml,83000,114366.548178


In [23]:
# Domy, které se prodaly za méně než 100000 USD a zároveň mají velmi dobrý nebo výborný stav
houses[(houses.SalePrice < 100000) & (houses.OverallCond >= 8)]

Unnamed: 0,Id,MSSubClass,MSZoning,LotFrontage,LotArea,Street,Alley,LotShape,LandContour,Utilities,LotConfig,LandSlope,Neighborhood,Condition1,Condition2,BldgType,HouseStyle,OverallQual,OverallCond,YearBuilt,YearRemodAdd,RoofStyle,RoofMatl,Exterior1st,Exterior2nd,MasVnrType,MasVnrArea,ExterQual,ExterCond,Foundation,BsmtQual,BsmtCond,BsmtExposure,BsmtFinType1,BsmtFinSF1,BsmtFinType2,BsmtFinSF2,BsmtUnfSF,TotalBsmtSF,Heating,HeatingQC,CentralAir,Electrical,1stFlrSF,2ndFlrSF,LowQualFinSF,GrLivArea,BsmtFullBath,BsmtHalfBath,FullBath,HalfBath,BedroomAbvGr,KitchenAbvGr,KitchenQual,TotRmsAbvGrd,Functional,Fireplaces,FireplaceQu,GarageType,GarageYrBlt,GarageFinish,GarageCars,GarageArea,GarageQual,GarageCond,PavedDrive,WoodDeckSF,OpenPorchSF,EnclosedPorch,3SsnPorch,ScreenPorch,PoolArea,PoolQC,Fence,MiscFeature,MiscVal,MoSold,YrSold,SaleType,SaleCondition,SalePrice,LotAreaMeters
125,126,190,RM,60.0,6780,Pave,,Reg,Lvl,AllPub,Inside,Gtl,IDOTRR,Norm,Norm,2fmCon,1.5Fin,6,8,1935,1982,Gable,CompShg,Wd Sdng,Wd Sdng,,0.0,TA,Fa,CBlock,TA,TA,Av,GLQ,490,Unf,0,30,520,GasA,Gd,N,SBrkr,520,0,234,754,1,0,1,0,2,1,TA,5,Typ,0,,,,,0,0,,,N,53,0,0,0,0,0,,,,0,6,2006,WD,Normal,84500,72979.312625
489,490,180,RM,21.0,1526,Pave,,Reg,Lvl,AllPub,Inside,Gtl,MeadowV,Norm,Norm,Twnhs,SFoyer,4,8,1970,2002,Gable,CompShg,CemntBd,CmentBd,,0.0,TA,Gd,CBlock,Gd,TA,Av,GLQ,515,Unf,0,115,630,GasA,TA,Y,SBrkr,630,0,0,630,1,0,1,0,1,1,Gd,3,Typ,0,,Attchd,1970.0,Unf,1,286,TA,TA,Y,0,0,0,0,0,0,,,,0,5,2009,WD,Normal,86000,16425.727296
494,495,30,RM,50.0,5784,Pave,,Reg,Lvl,AllPub,Inside,Gtl,OldTown,Artery,Norm,1Fam,1Story,5,8,1938,1996,Gable,CompShg,MetalSd,MetalSd,,0.0,TA,TA,BrkTil,Fa,TA,No,Unf,0,Unf,0,190,190,GasA,Gd,Y,FuseA,886,0,0,886,0,0,1,0,2,1,TA,4,Typ,0,,Attchd,1938.0,Unf,1,273,TA,TA,Y,144,20,80,0,0,0,,,,0,12,2009,WD,Normal,91300,62258.45785
1137,1138,50,RL,54.0,6342,Pave,,Reg,Lvl,AllPub,Inside,Gtl,Sawyer,Feedr,Norm,1Fam,1.5Fin,5,8,1875,1996,Gable,CompShg,VinylSd,VinylSd,,0.0,TA,Gd,CBlock,TA,TA,No,Unf,0,Unf,0,780,780,GasA,Gd,N,SBrkr,780,240,0,1020,0,0,1,0,2,1,TA,6,Typ,0,,,,,0,0,,,N,0,0,176,0,0,0,,,,0,5,2010,WD,Normal,94000,68264.719863
1257,1258,30,RL,56.0,4060,Pave,,Reg,Lvl,AllPub,Corner,Gtl,Edwards,Feedr,Norm,1Fam,1Story,5,8,1922,1950,Gable,CompShg,Wd Sdng,Wd Sdng,,0.0,TA,TA,PConc,Fa,TA,No,Unf,0,Unf,0,864,864,GasA,Ex,Y,SBrkr,864,0,0,864,0,0,1,0,2,1,TA,4,Typ,0,,,,,0,0,,,Y,0,96,0,0,0,0,,,,0,7,2009,WD,Normal,99900,43701.476292


## Cvičení

Vyberte domy, které splňují všechny podmínky:
- Mají plochu pozemku více než 1000 metrů čtverečních (`LotAreaMeters`)
- Pozemek je pravidelného tvaru (`LotShape`)
- D;m byl postaven v roce 2000 nebo později (`YearBuilt`)
- Cena je maximálně 150000 USD (`SalePrice`)

Kolik je domů, které splňují všechny podmínky?

Kolik je domů, které splňují alespoň jednu podmínku?

In [24]:
conditions = (
    (houses.LotAreaMeters > 1000) &
    (houses.LotShape == 'Reg') &
    (houses.YearBuilt >= 2000) &
    (houses.SalePrice <= 150000)
)

In [25]:
houses[
    (houses.LotAreaMeters > 1000) &
    (houses.LotShape == 'Reg') &
    (houses.YearBuilt >= 2000) &
    (houses.SalePrice <= 150000)
]

Unnamed: 0,Id,MSSubClass,MSZoning,LotFrontage,LotArea,Street,Alley,LotShape,LandContour,Utilities,LotConfig,LandSlope,Neighborhood,Condition1,Condition2,BldgType,HouseStyle,OverallQual,OverallCond,YearBuilt,YearRemodAdd,RoofStyle,RoofMatl,Exterior1st,Exterior2nd,MasVnrType,MasVnrArea,ExterQual,ExterCond,Foundation,BsmtQual,BsmtCond,BsmtExposure,BsmtFinType1,BsmtFinSF1,BsmtFinType2,BsmtFinSF2,BsmtUnfSF,TotalBsmtSF,Heating,HeatingQC,CentralAir,Electrical,1stFlrSF,2ndFlrSF,LowQualFinSF,GrLivArea,BsmtFullBath,BsmtHalfBath,FullBath,HalfBath,BedroomAbvGr,KitchenAbvGr,KitchenQual,TotRmsAbvGrd,Functional,Fireplaces,FireplaceQu,GarageType,GarageYrBlt,GarageFinish,GarageCars,GarageArea,GarageQual,GarageCond,PavedDrive,WoodDeckSF,OpenPorchSF,EnclosedPorch,3SsnPorch,ScreenPorch,PoolArea,PoolQC,Fence,MiscFeature,MiscVal,MoSold,YrSold,SaleType,SaleCondition,SalePrice,LotAreaMeters
145,146,160,RM,24.0,2522,Pave,,Reg,Lvl,AllPub,Inside,Gtl,Edwards,Norm,Norm,Twnhs,2Story,6,5,2004,2006,Gable,CompShg,VinylSd,VinylSd,Stone,50.0,Gd,TA,PConc,Gd,TA,No,Unf,0,Unf,0,970,970,GasA,Ex,Y,SBrkr,970,739,0,1709,0,0,2,0,3,1,Gd,7,Maj1,0,,Detchd,2004.0,Unf,2,380,TA,TA,Y,0,40,0,0,0,0,,,,0,4,2006,WD,Normal,130000,27146.582071
148,149,20,RL,63.0,7500,Pave,,Reg,Lvl,AllPub,Inside,Gtl,SawyerW,Norm,Norm,1Fam,1Story,7,5,2004,2005,Gable,CompShg,VinylSd,VinylSd,BrkFace,120.0,TA,TA,PConc,Gd,TA,No,GLQ,680,Unf,0,400,1080,GasA,Ex,Y,SBrkr,1080,0,0,1080,1,0,1,0,3,1,Gd,6,Typ,0,,,,,0,0,,,Y,0,0,0,0,0,0,,,,0,4,2008,WD,Normal,141000,80729.328125
193,194,160,RM,24.0,2522,Pave,,Reg,Lvl,AllPub,Inside,Gtl,Edwards,Norm,Norm,Twnhs,2Story,7,5,2004,2004,Gable,CompShg,VinylSd,VinylSd,Stone,50.0,Gd,TA,PConc,Gd,TA,No,Unf,0,Unf,0,970,970,GasA,Ex,Y,SBrkr,970,739,0,1709,0,0,2,0,3,1,Gd,7,Maj1,0,,Detchd,2004.0,Unf,2,380,TA,TA,Y,0,40,0,0,0,0,,,,0,5,2006,WD,Normal,130000,27146.582071
200,201,20,RM,80.0,8546,Pave,,Reg,Lvl,AllPub,Corner,Gtl,Edwards,Norm,Norm,1Fam,1Story,4,5,2003,2004,Gable,CompShg,VinylSd,VinylSd,,0.0,TA,TA,PConc,Gd,TA,No,Unf,0,Unf,0,1121,1121,GasA,Ex,Y,SBrkr,1121,0,0,1121,0,0,2,0,2,1,TA,5,Typ,0,,Attchd,2003.0,RFn,2,440,TA,TA,Y,132,64,0,0,0,0,,,,0,3,2010,WD,Normal,140000,91988.378421
203,204,120,RM,,4438,Pave,,Reg,Lvl,AllPub,Inside,Gtl,CollgCr,Norm,Norm,TwnhsE,1Story,6,5,2004,2004,Gable,CompShg,VinylSd,VinylSd,BrkFace,205.0,Gd,TA,PConc,Gd,TA,Av,GLQ,662,Unf,0,186,848,GasA,Ex,Y,SBrkr,848,0,0,848,1,0,1,0,1,1,Gd,3,Typ,1,Gd,Attchd,2004.0,RFn,2,420,TA,TA,Y,149,0,0,0,0,0,,,,0,1,2008,WD,Normal,149000,47770.234429
472,473,180,RM,35.0,3675,Pave,,Reg,Lvl,AllPub,Inside,Gtl,Edwards,Norm,Norm,TwnhsE,SLvl,6,5,2005,2005,Gable,CompShg,VinylSd,VinylSd,BrkFace,80.0,TA,TA,PConc,Gd,TA,Gd,GLQ,459,Unf,0,88,547,GasA,Ex,Y,SBrkr,1072,0,0,1072,1,0,1,0,2,1,TA,5,Typ,0,,Basment,2005.0,RFn,2,525,TA,TA,Y,0,28,0,0,0,0,,,,0,6,2008,WD,Normal,148000,39557.370781
578,579,160,FV,34.0,3604,Pave,Pave,Reg,Lvl,AllPub,Corner,Gtl,Somerst,Norm,Norm,TwnhsE,2Story,7,5,2007,2007,Gable,CompShg,VinylSd,VinylSd,,0.0,Gd,TA,PConc,Gd,TA,No,Unf,0,Unf,0,689,689,GasA,Ex,Y,SBrkr,703,689,0,1392,0,0,2,0,2,1,Gd,5,Typ,0,,Detchd,2007.0,Unf,2,540,TA,TA,Y,0,102,0,0,0,0,,,,0,2,2008,WD,Abnorml,146000,38793.133142
593,594,120,RM,,4435,Pave,,Reg,Lvl,AllPub,Inside,Gtl,CollgCr,Norm,Norm,TwnhsE,1Story,6,5,2003,2003,Gable,CompShg,VinylSd,VinylSd,BrkFace,170.0,Gd,TA,PConc,Gd,TA,Av,GLQ,685,Unf,0,163,848,GasA,Ex,Y,SBrkr,848,0,0,848,1,0,1,0,1,1,Gd,4,Typ,0,,Attchd,2003.0,Fin,2,420,TA,TA,Y,140,0,0,0,0,0,,,,0,5,2009,WD,Normal,140000,47737.942698
613,614,20,RL,70.0,8402,Pave,,Reg,Lvl,AllPub,Inside,Gtl,Mitchel,Feedr,Norm,1Fam,1Story,5,5,2007,2007,Gable,CompShg,VinylSd,VinylSd,,0.0,TA,TA,PConc,Gd,TA,No,ALQ,206,Unf,0,914,1120,GasA,Ex,Y,SBrkr,1120,0,0,1120,0,0,1,0,3,1,TA,6,Typ,0,,,,,0,0,,,Y,0,30,0,0,0,0,,,,0,12,2007,New,Partial,147000,90438.375321
690,691,120,RM,,4426,Pave,,Reg,Lvl,AllPub,Inside,Gtl,CollgCr,Norm,Norm,TwnhsE,1Story,6,5,2004,2004,Gable,CompShg,VinylSd,VinylSd,BrkFace,147.0,Gd,TA,PConc,Gd,TA,Gd,GLQ,697,Unf,0,151,848,GasA,Ex,Y,SBrkr,848,0,0,848,1,0,1,0,1,1,Gd,3,Typ,1,TA,Attchd,2004.0,RFn,2,420,TA,TA,Y,149,0,0,0,0,0,,,,0,5,2008,WD,Normal,141000,47641.067504


## Práce s názvy sloupců

Můžeme pracovát s názvy sloupců (`houses.columns`) jako se seznamem řetězců.

Můžeme použit klasickou smyčku nebo [list comprehensions](https://docs.python.org/3/tutorial/datastructures.html#list-comprehensions).

Taky můžeme filtrovat názvy sloupců pomocí metody [`filter`](https://pandas.pydata.org/pandas-docs/stable/reference/api/pandas.DataFrame.filter.html), kde specifikujeme `axis=1`.

In [26]:
lot_col = [col for col in houses.columns if ('Lot' in col)]
lot_col

['LotFrontage', 'LotArea', 'LotShape', 'LotConfig', 'LotAreaMeters']

In [27]:
houses[lot_col]

Unnamed: 0,LotFrontage,LotArea,LotShape,LotConfig,LotAreaMeters
0,65.0,8450,Reg,Inside,9.095504e+04
1,80.0,9600,Reg,FR2,1.033335e+05
2,68.0,11250,IR1,Inside,1.210940e+05
3,60.0,9550,IR1,Corner,1.027953e+05
4,84.0,14260,IR1,FR2,1.534934e+05
5,85.0,14115,IR1,Inside,1.519326e+05
6,75.0,10084,Reg,Inside,1.085433e+05
7,,10382,IR1,Corner,1.117509e+05
8,51.0,6120,Reg,Inside,6.587513e+04
9,50.0,7420,Reg,Corner,7.986822e+04


In [28]:
houses.filter(like='Lot', axis=1)

Unnamed: 0,LotFrontage,LotArea,LotShape,LotConfig,LotAreaMeters
0,65.0,8450,Reg,Inside,9.095504e+04
1,80.0,9600,Reg,FR2,1.033335e+05
2,68.0,11250,IR1,Inside,1.210940e+05
3,60.0,9550,IR1,Corner,1.027953e+05
4,84.0,14260,IR1,FR2,1.534934e+05
5,85.0,14115,IR1,Inside,1.519326e+05
6,75.0,10084,Reg,Inside,1.085433e+05
7,,10382,IR1,Corner,1.117509e+05
8,51.0,6120,Reg,Inside,6.587513e+04
9,50.0,7420,Reg,Corner,7.986822e+04


## Agregace dat

Použijeme metody `groupby`, která seskupí řádky dle jednoho nebo více sloupců a vrátí iterator.

Na tento iterátor potom aplikujeme nějakou metodu (count, mean, sum).
Pokud chceme aplikovat více metod nebo máme vlastní funkci, použijeme metodu `agg`.

```python
houses.groupby('LotShape').SalePrice.mean()

houses.groupby(['LotShape', 'OverallCond']).SalePrice.mean()
```

[Dokumentace groupby](https://pandas.pydata.org/pandas-docs/stable/user_guide/groupby.html)

In [29]:
# Zjistíme cenu nemovitostí dle tvaru pozemku
houses.groupby('LotShape').SalePrice.mean()

LotShape
IR1    206101.665289
IR2    239833.365854
IR3    216036.500000
Reg    164754.818378
Name: SalePrice, dtype: float64

In [30]:
# Zjistíme průměrnou cenu nemovitostí dle tvaru pozemku a celkového stavu
houses.groupby(['LotShape', 'OverallCond']).SalePrice.mean()

LotShape  OverallCond
IR1       1               61000.000000
          2               80750.000000
          3              136656.000000
          4              116488.888889
          5              226462.943144
          6              174803.602740
          7              176484.271429
          8              178263.157895
          9              245957.142857
IR2       5              234471.310345
          6              190000.000000
          7              295875.000000
          8              212500.000000
          9              475000.000000
IR3       3              163500.000000
          5              221873.888889
Reg       2              182810.666667
          3               91905.550000
          4              121178.979167
          5              186517.900826
          6              144181.942529
          7              144140.656489
          8              144998.529412
          9              182528.571429
Name: SalePrice, dtype: float64

In [31]:
# Zjistíme průměrnou cenu nemovitostí dle tvaru pozemku a celkového stavu 
# a zároveň počet nemovitostí v každé kategorii
houses.groupby(['LotShape', 'OverallCond']).SalePrice.agg(['mean', 'count'])

Unnamed: 0_level_0,Unnamed: 1_level_0,mean,count
LotShape,OverallCond,Unnamed: 2_level_1,Unnamed: 3_level_1
IR1,1,61000.0,1
IR1,2,80750.0,2
IR1,3,136656.0,4
IR1,4,116488.888889,9
IR1,5,226462.943144,299
IR1,6,174803.60274,73
IR1,7,176484.271429,70
IR1,8,178263.157895,19
IR1,9,245957.142857,7
IR2,5,234471.310345,29


## Cvičení

Seskupte řádky dle druhu střechy (`RoofStyle`) a stavu exteriéru (`ExterCond`) a zobrazte minimální a maximální cenu a počet nemovitostí.

In [32]:
houses.groupby(['RoofStyle', 'ExterCond']).SalePrice.agg(
    ['min', 'max', 'count']
)

Unnamed: 0_level_0,Unnamed: 1_level_0,min,max,count
RoofStyle,ExterCond,Unnamed: 2_level_1,Unnamed: 3_level_1,Unnamed: 4_level_1
Flat,Gd,129000,274970,4
Flat,TA,82000,256000,9
Gable,Ex,325000,325000,1
Gable,Fa,39300,169500,22
Gable,Gd,68400,475000,116
Gable,Po,76500,76500,1
Gable,TA,34900,755000,1001
Gambrel,Ex,161000,161000,1
Gambrel,Fa,40000,81000,2
Gambrel,Gd,103600,107400,2


## Iterace řádky

Iterujeme `DataFrame` pomocí metody [`iterrows`](https://pandas.pydata.org/pandas-docs/stable/reference/api/pandas.DataFrame.iterrows.html), která vrací n-tici (první prvek číslo řádku, druhý prvek - obsah řádku). Hodí se například pro tvorbu popisků ke grafům.

[Dookumentace metody `format`](https://pyformat.info/).


In [33]:
text = []
for i, row in houses.iterrows():
    # :.0f uloží číslo bez desetinného místa
    text.append(
        'Cena (USD): {:.0f}, Plocha (m^2): {:.0f}'.format(
            row.SalePrice, row.LotAreaMeters
    ))
text

['Cena (USD): 208500, Plocha (m^2): 90955',
 'Cena (USD): 181500, Plocha (m^2): 103334',
 'Cena (USD): 223500, Plocha (m^2): 121094',
 'Cena (USD): 140000, Plocha (m^2): 102795',
 'Cena (USD): 250000, Plocha (m^2): 153493',
 'Cena (USD): 143000, Plocha (m^2): 151933',
 'Cena (USD): 307000, Plocha (m^2): 108543',
 'Cena (USD): 200000, Plocha (m^2): 111751',
 'Cena (USD): 129900, Plocha (m^2): 65875',
 'Cena (USD): 118000, Plocha (m^2): 79868',
 'Cena (USD): 129500, Plocha (m^2): 120556',
 'Cena (USD): 345000, Plocha (m^2): 128349',
 'Cena (USD): 144000, Plocha (m^2): 139586',
 'Cena (USD): 279500, Plocha (m^2): 114657',
 'Cena (USD): 157000, Plocha (m^2): 117542',
 'Cena (USD): 132000, Plocha (m^2): 65875',
 'Cena (USD): 149000, Plocha (m^2): 120997',
 'Cena (USD): 90000, Plocha (m^2): 116153',
 'Cena (USD): 159000, Plocha (m^2): 147412',
 'Cena (USD): 139000, Plocha (m^2): 81375',
 'Cena (USD): 325300, Plocha (m^2): 153009',
 'Cena (USD): 139400, Plocha (m^2): 80180',
 'Cena (USD): 230

In [34]:
text = []
for i, row in houses.iterrows():
    # :.0f uloží číslo bez desetinného místa
    text.append(
        'Cena (USD): {:}, Plocha (m^2): {:}'.format(
            row.SalePrice, row.LotAreaMeters
    ))
text

['Cena (USD): 208500, Plocha (m^2): 90955.04302119715',
 'Cena (USD): 181500, Plocha (m^2): 103333.54000041333',
 'Cena (USD): 223500, Plocha (m^2): 121093.99218798438',
 'Cena (USD): 140000, Plocha (m^2): 102795.34447957783',
 'Cena (USD): 250000, Plocha (m^2): 153493.36254228064',
 'Cena (USD): 143000, Plocha (m^2): 151932.59553185772',
 'Cena (USD): 307000, Plocha (m^2): 108543.27264210083',
 'Cena (USD): 200000, Plocha (m^2): 111750.91794628033',
 'Cena (USD): 129900, Plocha (m^2): 65875.13175026349',
 'Cena (USD): 118000, Plocha (m^2): 79868.21529198614',
 'Cena (USD): 129500, Plocha (m^2): 120555.79666714888',
 'Cena (USD): 345000, Plocha (m^2): 128348.86780884671',
 'Cena (USD): 144000, Plocha (m^2): 139586.39028389167',
 'Cena (USD): 279500, Plocha (m^2): 114657.17375879195',
 'Cena (USD): 157000, Plocha (m^2): 117541.90175047016',
 'Cena (USD): 132000, Plocha (m^2): 65875.13175026349',
 'Cena (USD): 149000, Plocha (m^2): 120997.11699423398',
 'Cena (USD): 90000, Plocha (m^2): 

In [35]:
text = []
for i, row in houses.iterrows():
    # :.0f uloží číslo bez desetinného místa
    text.append(
        f'Cena (USD): {row.SalePrice:.0f}, Plocha (m^2): {row.LotAreaMeters:.0f}'
    )
text

['Cena (USD): 208500, Plocha (m^2): 90955',
 'Cena (USD): 181500, Plocha (m^2): 103334',
 'Cena (USD): 223500, Plocha (m^2): 121094',
 'Cena (USD): 140000, Plocha (m^2): 102795',
 'Cena (USD): 250000, Plocha (m^2): 153493',
 'Cena (USD): 143000, Plocha (m^2): 151933',
 'Cena (USD): 307000, Plocha (m^2): 108543',
 'Cena (USD): 200000, Plocha (m^2): 111751',
 'Cena (USD): 129900, Plocha (m^2): 65875',
 'Cena (USD): 118000, Plocha (m^2): 79868',
 'Cena (USD): 129500, Plocha (m^2): 120556',
 'Cena (USD): 345000, Plocha (m^2): 128349',
 'Cena (USD): 144000, Plocha (m^2): 139586',
 'Cena (USD): 279500, Plocha (m^2): 114657',
 'Cena (USD): 157000, Plocha (m^2): 117542',
 'Cena (USD): 132000, Plocha (m^2): 65875',
 'Cena (USD): 149000, Plocha (m^2): 120997',
 'Cena (USD): 90000, Plocha (m^2): 116153',
 'Cena (USD): 159000, Plocha (m^2): 147412',
 'Cena (USD): 139000, Plocha (m^2): 81375',
 'Cena (USD): 325300, Plocha (m^2): 153009',
 'Cena (USD): 139400, Plocha (m^2): 80180',
 'Cena (USD): 230

## Cvičení

Vytvořte seznam, kde každý prvek bude ve formátu: `'Cena (CZK): {:.0f}, Plocha (m^2): {:.0f}'`.

Pro vytvoření ceny v korunách, počitejte s kurzem 22.55 CZK.

In [36]:
houses['SalePriceCZK'] = houses.SalePrice * 22.55
text = []
for i, row in houses.iterrows():
    # :.0f uloží číslo bez desetinného místa
    text.append(
        'Cena (CZK): {:.0f}, Plocha (m^2): {:.0f}'.format(
            row.SalePriceCZK, row.LotAreaMeters
    ))
text

['Cena (CZK): 4701675, Plocha (m^2): 90955',
 'Cena (CZK): 4092825, Plocha (m^2): 103334',
 'Cena (CZK): 5039925, Plocha (m^2): 121094',
 'Cena (CZK): 3157000, Plocha (m^2): 102795',
 'Cena (CZK): 5637500, Plocha (m^2): 153493',
 'Cena (CZK): 3224650, Plocha (m^2): 151933',
 'Cena (CZK): 6922850, Plocha (m^2): 108543',
 'Cena (CZK): 4510000, Plocha (m^2): 111751',
 'Cena (CZK): 2929245, Plocha (m^2): 65875',
 'Cena (CZK): 2660900, Plocha (m^2): 79868',
 'Cena (CZK): 2920225, Plocha (m^2): 120556',
 'Cena (CZK): 7779750, Plocha (m^2): 128349',
 'Cena (CZK): 3247200, Plocha (m^2): 139586',
 'Cena (CZK): 6302725, Plocha (m^2): 114657',
 'Cena (CZK): 3540350, Plocha (m^2): 117542',
 'Cena (CZK): 2976600, Plocha (m^2): 65875',
 'Cena (CZK): 3359950, Plocha (m^2): 120997',
 'Cena (CZK): 2029500, Plocha (m^2): 116153',
 'Cena (CZK): 3585450, Plocha (m^2): 147412',
 'Cena (CZK): 3134450, Plocha (m^2): 81375',
 'Cena (CZK): 7335515, Plocha (m^2): 153009',
 'Cena (CZK): 3143470, Plocha (m^2): 80

In [37]:
text = []
for i, row in houses.iterrows():
    # :.0f uloží číslo bez desetinného místa
    text.append(
        'Cena (CZK): {:.0f}, Plocha (m^2): {:.0f}'.format(
            row.SalePrice*22.55, row.LotAreaMeters
    ))
text

['Cena (CZK): 4701675, Plocha (m^2): 90955',
 'Cena (CZK): 4092825, Plocha (m^2): 103334',
 'Cena (CZK): 5039925, Plocha (m^2): 121094',
 'Cena (CZK): 3157000, Plocha (m^2): 102795',
 'Cena (CZK): 5637500, Plocha (m^2): 153493',
 'Cena (CZK): 3224650, Plocha (m^2): 151933',
 'Cena (CZK): 6922850, Plocha (m^2): 108543',
 'Cena (CZK): 4510000, Plocha (m^2): 111751',
 'Cena (CZK): 2929245, Plocha (m^2): 65875',
 'Cena (CZK): 2660900, Plocha (m^2): 79868',
 'Cena (CZK): 2920225, Plocha (m^2): 120556',
 'Cena (CZK): 7779750, Plocha (m^2): 128349',
 'Cena (CZK): 3247200, Plocha (m^2): 139586',
 'Cena (CZK): 6302725, Plocha (m^2): 114657',
 'Cena (CZK): 3540350, Plocha (m^2): 117542',
 'Cena (CZK): 2976600, Plocha (m^2): 65875',
 'Cena (CZK): 3359950, Plocha (m^2): 120997',
 'Cena (CZK): 2029500, Plocha (m^2): 116153',
 'Cena (CZK): 3585450, Plocha (m^2): 147412',
 'Cena (CZK): 3134450, Plocha (m^2): 81375',
 'Cena (CZK): 7335515, Plocha (m^2): 153009',
 'Cena (CZK): 3143470, Plocha (m^2): 80

## Práce s několikaúrovňovými indexy

In [38]:
filipuv_experiment = houses.groupby(['RoofStyle', 'ExterCond']).agg(
    ['min', 'max', 'count']
)
filipuv_experiment

Unnamed: 0_level_0,Unnamed: 1_level_0,Id,Id,Id,MSSubClass,MSSubClass,MSSubClass,MSZoning,MSZoning,MSZoning,LotFrontage,LotFrontage,LotFrontage,LotArea,LotArea,LotArea,Street,Street,Street,LotShape,LotShape,LotShape,LandContour,LandContour,LandContour,Utilities,Utilities,Utilities,LotConfig,LotConfig,LotConfig,LandSlope,LandSlope,LandSlope,Neighborhood,Neighborhood,Neighborhood,Condition1,Condition1,Condition1,Condition2,Condition2,Condition2,BldgType,BldgType,BldgType,HouseStyle,HouseStyle,HouseStyle,OverallQual,OverallQual,OverallQual,OverallCond,OverallCond,OverallCond,YearBuilt,YearBuilt,YearBuilt,YearRemodAdd,YearRemodAdd,YearRemodAdd,RoofMatl,RoofMatl,RoofMatl,Exterior1st,Exterior1st,Exterior1st,Exterior2nd,Exterior2nd,Exterior2nd,MasVnrArea,MasVnrArea,MasVnrArea,ExterQual,ExterQual,ExterQual,Foundation,Foundation,Foundation,BsmtFinSF1,BsmtFinSF1,BsmtFinSF1,BsmtFinSF2,BsmtFinSF2,BsmtFinSF2,BsmtUnfSF,BsmtUnfSF,BsmtUnfSF,TotalBsmtSF,TotalBsmtSF,TotalBsmtSF,Heating,Heating,Heating,HeatingQC,HeatingQC,HeatingQC,CentralAir,CentralAir,CentralAir,1stFlrSF,1stFlrSF,1stFlrSF,2ndFlrSF,2ndFlrSF,2ndFlrSF,LowQualFinSF,LowQualFinSF,LowQualFinSF,GrLivArea,GrLivArea,GrLivArea,BsmtFullBath,BsmtFullBath,BsmtFullBath,BsmtHalfBath,BsmtHalfBath,BsmtHalfBath,FullBath,FullBath,FullBath,HalfBath,HalfBath,HalfBath,BedroomAbvGr,BedroomAbvGr,BedroomAbvGr,KitchenAbvGr,KitchenAbvGr,KitchenAbvGr,KitchenQual,KitchenQual,KitchenQual,TotRmsAbvGrd,TotRmsAbvGrd,TotRmsAbvGrd,Functional,Functional,Functional,Fireplaces,Fireplaces,Fireplaces,GarageYrBlt,GarageYrBlt,GarageYrBlt,GarageCars,GarageCars,GarageCars,GarageArea,GarageArea,GarageArea,PavedDrive,PavedDrive,PavedDrive,WoodDeckSF,WoodDeckSF,WoodDeckSF,OpenPorchSF,OpenPorchSF,OpenPorchSF,EnclosedPorch,EnclosedPorch,EnclosedPorch,3SsnPorch,3SsnPorch,3SsnPorch,ScreenPorch,ScreenPorch,ScreenPorch,PoolArea,PoolArea,PoolArea,MiscVal,MiscVal,MiscVal,MoSold,MoSold,MoSold,YrSold,YrSold,YrSold,SaleType,SaleType,SaleType,SaleCondition,SaleCondition,SaleCondition,SalePrice,SalePrice,SalePrice,LotAreaMeters,LotAreaMeters,LotAreaMeters,SalePriceCZK,SalePriceCZK,SalePriceCZK
Unnamed: 0_level_1,Unnamed: 1_level_1,min,max,count,min,max,count,min,max,count,min,max,count,min,max,count,min,max,count,min,max,count,min,max,count,min,max,count,min,max,count,min,max,count,min,max,count,min,max,count,min,max,count,min,max,count,min,max,count,min,max,count,min,max,count,min,max,count,min,max,count,min,max,count,min,max,count,min,max,count,min,max,count,min,max,count,min,max,count,min,max,count,min,max,count,min,max,count,min,max,count,min,max,count,min,max,count,min,max,count,min,max,count,min,max,count,min,max,count,min,max,count,min,max,count,min,max,count,min,max,count,min,max,count,min,max,count,min,max,count,min,max,count,min,max,count,min,max,count,min,max,count,min,max,count,min,max,count,min,max,count,min,max,count,min,max,count,min,max,count,min,max,count,min,max,count,min,max,count,min,max,count,min,max,count,min,max,count,min,max,count,min,max,count,min,max,count,min,max,count,min,max,count,min,max,count
RoofStyle,ExterCond,Unnamed: 2_level_2,Unnamed: 3_level_2,Unnamed: 4_level_2,Unnamed: 5_level_2,Unnamed: 6_level_2,Unnamed: 7_level_2,Unnamed: 8_level_2,Unnamed: 9_level_2,Unnamed: 10_level_2,Unnamed: 11_level_2,Unnamed: 12_level_2,Unnamed: 13_level_2,Unnamed: 14_level_2,Unnamed: 15_level_2,Unnamed: 16_level_2,Unnamed: 17_level_2,Unnamed: 18_level_2,Unnamed: 19_level_2,Unnamed: 20_level_2,Unnamed: 21_level_2,Unnamed: 22_level_2,Unnamed: 23_level_2,Unnamed: 24_level_2,Unnamed: 25_level_2,Unnamed: 26_level_2,Unnamed: 27_level_2,Unnamed: 28_level_2,Unnamed: 29_level_2,Unnamed: 30_level_2,Unnamed: 31_level_2,Unnamed: 32_level_2,Unnamed: 33_level_2,Unnamed: 34_level_2,Unnamed: 35_level_2,Unnamed: 36_level_2,Unnamed: 37_level_2,Unnamed: 38_level_2,Unnamed: 39_level_2,Unnamed: 40_level_2,Unnamed: 41_level_2,Unnamed: 42_level_2,Unnamed: 43_level_2,Unnamed: 44_level_2,Unnamed: 45_level_2,Unnamed: 46_level_2,Unnamed: 47_level_2,Unnamed: 48_level_2,Unnamed: 49_level_2,Unnamed: 50_level_2,Unnamed: 51_level_2,Unnamed: 52_level_2,Unnamed: 53_level_2,Unnamed: 54_level_2,Unnamed: 55_level_2,Unnamed: 56_level_2,Unnamed: 57_level_2,Unnamed: 58_level_2,Unnamed: 59_level_2,Unnamed: 60_level_2,Unnamed: 61_level_2,Unnamed: 62_level_2,Unnamed: 63_level_2,Unnamed: 64_level_2,Unnamed: 65_level_2,Unnamed: 66_level_2,Unnamed: 67_level_2,Unnamed: 68_level_2,Unnamed: 69_level_2,Unnamed: 70_level_2,Unnamed: 71_level_2,Unnamed: 72_level_2,Unnamed: 73_level_2,Unnamed: 74_level_2,Unnamed: 75_level_2,Unnamed: 76_level_2,Unnamed: 77_level_2,Unnamed: 78_level_2,Unnamed: 79_level_2,Unnamed: 80_level_2,Unnamed: 81_level_2,Unnamed: 82_level_2,Unnamed: 83_level_2,Unnamed: 84_level_2,Unnamed: 85_level_2,Unnamed: 86_level_2,Unnamed: 87_level_2,Unnamed: 88_level_2,Unnamed: 89_level_2,Unnamed: 90_level_2,Unnamed: 91_level_2,Unnamed: 92_level_2,Unnamed: 93_level_2,Unnamed: 94_level_2,Unnamed: 95_level_2,Unnamed: 96_level_2,Unnamed: 97_level_2,Unnamed: 98_level_2,Unnamed: 99_level_2,Unnamed: 100_level_2,Unnamed: 101_level_2,Unnamed: 102_level_2,Unnamed: 103_level_2,Unnamed: 104_level_2,Unnamed: 105_level_2,Unnamed: 106_level_2,Unnamed: 107_level_2,Unnamed: 108_level_2,Unnamed: 109_level_2,Unnamed: 110_level_2,Unnamed: 111_level_2,Unnamed: 112_level_2,Unnamed: 113_level_2,Unnamed: 114_level_2,Unnamed: 115_level_2,Unnamed: 116_level_2,Unnamed: 117_level_2,Unnamed: 118_level_2,Unnamed: 119_level_2,Unnamed: 120_level_2,Unnamed: 121_level_2,Unnamed: 122_level_2,Unnamed: 123_level_2,Unnamed: 124_level_2,Unnamed: 125_level_2,Unnamed: 126_level_2,Unnamed: 127_level_2,Unnamed: 128_level_2,Unnamed: 129_level_2,Unnamed: 130_level_2,Unnamed: 131_level_2,Unnamed: 132_level_2,Unnamed: 133_level_2,Unnamed: 134_level_2,Unnamed: 135_level_2,Unnamed: 136_level_2,Unnamed: 137_level_2,Unnamed: 138_level_2,Unnamed: 139_level_2,Unnamed: 140_level_2,Unnamed: 141_level_2,Unnamed: 142_level_2,Unnamed: 143_level_2,Unnamed: 144_level_2,Unnamed: 145_level_2,Unnamed: 146_level_2,Unnamed: 147_level_2,Unnamed: 148_level_2,Unnamed: 149_level_2,Unnamed: 150_level_2,Unnamed: 151_level_2,Unnamed: 152_level_2,Unnamed: 153_level_2,Unnamed: 154_level_2,Unnamed: 155_level_2,Unnamed: 156_level_2,Unnamed: 157_level_2,Unnamed: 158_level_2,Unnamed: 159_level_2,Unnamed: 160_level_2,Unnamed: 161_level_2,Unnamed: 162_level_2,Unnamed: 163_level_2,Unnamed: 164_level_2,Unnamed: 165_level_2,Unnamed: 166_level_2,Unnamed: 167_level_2,Unnamed: 168_level_2,Unnamed: 169_level_2,Unnamed: 170_level_2,Unnamed: 171_level_2,Unnamed: 172_level_2,Unnamed: 173_level_2,Unnamed: 174_level_2,Unnamed: 175_level_2,Unnamed: 176_level_2,Unnamed: 177_level_2,Unnamed: 178_level_2,Unnamed: 179_level_2,Unnamed: 180_level_2,Unnamed: 181_level_2,Unnamed: 182_level_2,Unnamed: 183_level_2,Unnamed: 184_level_2,Unnamed: 185_level_2,Unnamed: 186_level_2,Unnamed: 187_level_2,Unnamed: 188_level_2,Unnamed: 189_level_2,Unnamed: 190_level_2,Unnamed: 191_level_2,Unnamed: 192_level_2,Unnamed: 193_level_2,Unnamed: 194_level_2,Unnamed: 195_level_2,Unnamed: 196_level_2
Flat,Gd,467,1424,4,20,80,4,RL,RL,4,85.0,95.0,2,10628,19690,4,Pave,Pave,4,IR1,Reg,4,Lvl,Lvl,4,AllPub,AllPub,4,CulDSac,Inside,4,Gtl,Mod,4,Edwards,NAmes,4,Norm,Norm,4,Norm,Norm,4,1Fam,1Fam,4,1Story,SLvl,4,6,8,4,5,7,4,1965,1970,4,1966,1979,4,Tar&Grv,Tar&Grv,4,Plywood,Wd Sdng,4,Plywood,Wd Sdng,4,0.0,113.0,4,Gd,TA,4,CBlock,PConc,4,0,778,4,0,823,4,111,1043,4,697,2217,4,GasA,GasA,4,Ex,TA,4,Y,Y,4,1277,2217,4,0,915,4,0,0,4,1277,2380,4,0,1,4,0,0,4,1,2,4,0,1,4,2,4,4,1,1,4,Gd,TA,4,5,8,4,Sev,Typ,4,1,1,4,1965.0,1970.0,4,2,2,4,432,621,4,Y,Y,4,0,586,4,0,265,4,0,0,4,0,0,4,0,224,4,0,738,4,0,0,4,4,10,4,2006,2007,4,WD,WD,4,Abnorml,Normal,4,129000,274970,4,114398.839909,211941.4,4,2908950.0,6200573.5,4
Flat,TA,121,1001,9,20,90,9,RL,RL,9,35.0,313.0,5,9400,53227,9,Pave,Pave,9,IR1,Reg,9,HLS,Lvl,9,AllPub,AllPub,9,Corner,Inside,9,Gtl,Sev,9,ClearCr,NAmes,9,Norm,PosA,9,Norm,Norm,9,1Fam,Duplex,9,1Story,SLvl,9,3,7,9,3,8,9,1952,1975,9,1952,2007,9,CompShg,Tar&Grv,9,BrkComm,WdShing,9,Brk Cmn,Wd Sdng,9,0.0,250.0,9,TA,TA,9,CBlock,Slab,9,0,1324,9,0,1080,9,0,248,9,0,1602,9,GasA,GasW,9,Ex,TA,9,N,Y,9,944,2069,9,0,966,9,0,0,9,944,2126,9,0,2,9,0,2,9,1,2,9,0,1,9,1,4,9,0,1,9,Ex,TA,9,4,9,9,Min1,Typ,9,0,2,9,1954.0,1969.0,8,0,2,9,0,564,9,Y,Y,9,0,486,9,0,137,9,0,0,9,0,130,9,0,225,9,0,0,9,0,0,9,3,11,9,2006,2009,9,WD,WD,9,Abnorml,Partial,9,82000,256000,9,101180.757917,572930.7,9,1849100.0,5772800.0,9
Gable,Ex,584,584,1,75,75,1,RM,RM,1,75.0,75.0,1,13500,13500,1,Pave,Pave,1,Reg,Reg,1,Lvl,Lvl,1,AllPub,AllPub,1,Inside,Inside,1,Gtl,Gtl,1,OldTown,OldTown,1,Artery,Artery,1,PosA,PosA,1,1Fam,1Fam,1,2.5Unf,2.5Unf,1,10,10,1,9,9,1,1893,1893,1,2000,2000,1,CompShg,CompShg,1,Wd Sdng,Wd Sdng,1,Wd Sdng,Wd Sdng,1,0.0,0.0,1,Ex,Ex,1,BrkTil,BrkTil,1,0,0,1,0,0,1,1237,1237,1,1237,1237,1,GasA,GasA,1,Gd,Gd,1,Y,Y,1,1521,1521,1,1254,1254,1,0,0,1,2775,2775,1,0,0,1,0,0,1,3,3,1,1,1,1,3,3,1,1,1,1,Gd,Gd,1,9,9,1,Typ,Typ,1,1,1,1,1988.0,1988.0,1,2,2,1,880,880,1,Y,Y,1,105,105,1,502,502,1,0,0,1,0,0,1,0,0,1,0,0,1,0,0,1,7,7,1,2008,2008,1,WD,WD,1,Normal,Normal,1,325000,325000,1,145312.790626,145312.8,1,7328750.0,7328750.0,1
Gable,Fa,103,1381,22,20,190,22,RH,RM,22,45.0,98.0,19,5000,12180,22,Pave,Pave,22,IR1,Reg,22,Bnk,Lvl,22,AllPub,AllPub,22,Corner,Inside,22,Gtl,Sev,22,BrkSide,SawyerW,22,Artery,Norm,22,Norm,Norm,22,1Fam,Duplex,22,1.5Fin,2Story,22,1,7,22,1,8,22,1900,1979,22,1950,2000,22,CompShg,CompShg,22,AsbShng,Wd Sdng,22,AsbShng,Wd Shng,22,0.0,600.0,22,Fa,TA,22,BrkTil,Stone,22,0,903,22,0,0,22,0,1276,22,0,1276,22,GasA,Grav,22,Ex,TA,22,N,Y,22,334,1766,22,0,1020,22,0,514,22,334,2414,22,0,1,22,0,0,22,0,2,22,0,1,22,1,6,22,1,2,22,Fa,TA,22,2,10,22,Maj1,Typ,22,0,2,22,1910.0,2000.0,15,0,2,22,0,720,22,N,Y,22,0,250,22,0,184,22,0,216,22,0,0,22,0,184,22,0,0,22,0,0,22,1,11,22,2006,2010,22,COD,WD,22,Abnorml,Normal,22,39300,169500,22,53819.552084,131104.4,22,886215.0,3822225.0,22
Gable,Gd,25,1458,116,20,190,116,C (all),RM,116,21.0,174.0,90,1526,36500,116,Grvl,Pave,116,IR1,Reg,116,Bnk,Lvl,116,AllPub,AllPub,116,Corner,Inside,116,Gtl,Mod,116,Blueste,Veenker,116,Artery,RRAn,116,Feedr,RRNn,116,1Fam,TwnhsE,116,1.5Fin,SLvl,116,3,10,116,3,9,116,1872,1997,116,1950,2010,116,CompShg,WdShngl,116,AsbShng,WdShing,116,AsbShng,Wd Shng,116,0.0,1115.0,116,Ex,TA,116,BrkTil,Stone,116,0,1880,116,0,1061,116,0,1362,116,0,2223,116,GasA,OthW,116,Ex,TA,116,N,Y,116,575,2223,116,0,1518,116,0,572,116,630,3608,116,0,2,116,0,1,116,0,3,116,0,1,116,0,5,116,1,2,116,Ex,TA,116,3,12,116,Min1,Typ,116,0,2,116,1910.0,2007.0,108,0,4,116,0,1356,116,N,Y,116,0,550,116,0,547,116,0,552,116,0,216,116,0,480,116,0,512,116,0,2500,116,1,12,116,2006,2010,116,COD,WD,116,Abnorml,Normal,116,68400,475000,116,16425.727296,392882.7,116,1542420.0,10711250.0,116
Gable,Po,251,251,1,30,30,1,RL,RL,1,55.0,55.0,1,5350,5350,1,Pave,Pave,1,IR1,IR1,1,Lvl,Lvl,1,AllPub,AllPub,1,Inside,Inside,1,Gtl,Gtl,1,BrkSide,BrkSide,1,Norm,Norm,1,Norm,Norm,1,1Fam,1Fam,1,1Story,1Story,1,3,3,1,2,2,1,1940,1940,1,1966,1966,1,CompShg,CompShg,1,Wd Sdng,Wd Sdng,1,Plywood,Plywood,1,0.0,0.0,1,TA,TA,1,CBlock,CBlock,1,0,0,1,0,0,1,728,728,1,728,728,1,GasA,GasA,1,Ex,Ex,1,Y,Y,1,1306,1306,1,0,0,1,0,0,1,1306,1306,1,0,0,1,0,0,1,1,1,1,0,0,1,3,3,1,1,1,1,Fa,Fa,1,6,6,1,Mod,Mod,1,0,0,1,,,0,0,0,1,0,0,1,Y,Y,1,263,263,1,0,0,1,0,0,1,0,0,1,0,0,1,0,0,1,450,450,1,5,5,1,2010,2010,1,WD,WD,1,Normal,Normal,1,76500,76500,1,57586.920729,57586.92,1,1725075.0,1725075.0,1
Gable,TA,1,1460,1001,20,190,1001,C (all),RM,1001,21.0,174.0,824,1300,164660,1001,Grvl,Pave,1001,IR1,Reg,1001,Bnk,Lvl,1001,AllPub,NoSeWa,1001,Corner,Inside,1001,Gtl,Sev,1001,Blmngtn,Veenker,1001,Artery,RRNn,1001,Artery,RRNn,1001,1Fam,TwnhsE,1001,1.5Fin,SLvl,1001,2,10,1001,2,9,1001,1880,2009,1001,1950,2010,1001,CompShg,WdShngl,1001,AsbShng,WdShing,1001,AsbShng,Wd Shng,1001,0.0,1170.0,994,Ex,TA,1001,BrkTil,Wood,1001,0,1810,1001,0,1474,1001,0,2121,1001,0,3206,1001,GasA,Wall,1001,Ex,TA,1001,N,Y,1001,438,2444,1001,0,1872,1001,0,528,1001,438,4316,1001,0,3,1001,0,2,1001,0,3,1001,0,2,1001,0,6,1001,1,3,1001,Ex,TA,1001,3,12,1001,Maj1,Typ,1001,0,3,1001,1906.0,2010.0,946,0,4,1001,0,1248,1001,N,Y,1001,0,857,1001,0,523,1001,0,330,1001,0,508,1001,0,440,1001,0,576,1001,0,2000,1001,1,12,1001,2006,2010,1001,COD,WD,1001,Abnorml,Partial,1001,34900,755000,1001,13993.083542,1772385.0,1001,786995.0,17025250.0,1001
Gambrel,Ex,509,509,1,70,70,1,RM,RM,1,60.0,60.0,1,9600,9600,1,Pave,Pave,1,Reg,Reg,1,Lvl,Lvl,1,AllPub,AllPub,1,Inside,Inside,1,Gtl,Gtl,1,OldTown,OldTown,1,Norm,Norm,1,Norm,Norm,1,1Fam,1Fam,1,2Story,2Story,1,7,7,1,9,9,1,1928,1928,1,2005,2005,1,CompShg,CompShg,1,MetalSd,MetalSd,1,MetalSd,MetalSd,1,0.0,0.0,1,TA,TA,1,BrkTil,BrkTil,1,141,141,1,0,0,1,548,548,1,689,689,1,GasA,GasA,1,Ex,Ex,1,Y,Y,1,689,689,1,689,689,1,0,0,1,1378,1378,1,0,0,1,0,0,1,2,2,1,0,0,1,3,3,1,1,1,1,Gd,Gd,1,7,7,1,Typ,Typ,1,1,1,1,1928.0,1928.0,1,2,2,1,360,360,1,N,N,1,0,0,1,0,0,1,116,116,1,0,0,1,0,0,1,0,0,1,0,0,1,10,10,1,2008,2008,1,WD,WD,1,Normal,Normal,1,161000,161000,1,103333.54,103333.5,1,3630550.0,3630550.0,1
Gambrel,Fa,31,387,2,50,70,2,C (all),RL,2,50.0,58.0,2,8410,8500,2,Pave,Pave,2,Reg,Reg,2,Lvl,Lvl,2,AllPub,AllPub,2,FR2,Inside,2,Gtl,Gtl,2,Edwards,IDOTRR,2,Feedr,Feedr,2,Norm,Norm,2,1Fam,1Fam,2,1.5Fin,2Story,2,4,5,2,3,4,2,1910,1920,2,1950,1996,2,CompShg,CompShg,2,BrkFace,Wd Sdng,2,BrkFace,VinylSd,2,0.0,0.0,2,TA,TA,2,BrkTil,PConc,2,0,0,2,0,0,2,649,658,2,649,658,2,GasA,GasA,2,TA,TA,2,N,Y,2,649,658,2,526,668,2,0,0,2,1184,1317,2,0,0,2,0,0,2,1,1,2,0,0,2,3,5,2,1,1,2,TA,TA,2,6,8,2,Typ,Typ,2,0,0,2,1920.0,1920.0,1,0,1,2,0,250,2,N,N,2,0,0,2,54,151,2,0,172,2,0,0,2,0,0,2,0,0,2,0,0,2,5,7,2,2006,2008,2,WD,WD,2,AdjLand,Normal,2,40000,81000,2,90524.486605,91493.24,2,902000.0,1826550.0,2
Gambrel,Gd,75,672,2,50,70,2,RH,RM,2,54.0,60.0,2,5790,6629,2,Pave,Pave,2,Reg,Reg,2,Lvl,Lvl,2,AllPub,AllPub,2,Corner,Inside,2,Gtl,Gtl,2,Edwards,OldTown,2,Artery,Norm,2,Norm,Norm,2,1Fam,1Fam,2,2Story,2Story,2,3,6,2,6,6,2,1915,1925,2,1950,1950,2,CompShg,CompShg,2,VinylSd,Wd Sdng,2,VinylSd,Wd Sdng,2,0.0,0.0,2,Gd,TA,2,BrkTil,CBlock,2,0,551,2,0,0,2,121,840,2,672,840,2,GasA,GasA,2,Gd,TA,2,N,N,2,697,840,2,672,765,2,0,0,2,1369,1605,2,0,1,2,0,0,2,2,2,2,0,0,2,3,3,2,1,2,2,TA,TA,2,6,8,2,Typ,Typ,2,0,0,2,1915.0,1930.0,2,1,1,2,300,379,2,Y,Y,2,0,147,2,0,0,2,0,202,2,0,0,2,0,0,2,0,0,2,0,0,2,5,7,2,2009,2010,2,WD,WD,2,Normal,Normal,2,103600,107400,2,62323.041313,71353.96,2,2336180.0,2421870.0,2


In [39]:
filipuv_experiment.columns = ["_".join(values) for values in filipuv_experiment.columns]

In [40]:
filipuv_experiment

Unnamed: 0_level_0,Unnamed: 1_level_0,Id_min,Id_max,Id_count,MSSubClass_min,MSSubClass_max,MSSubClass_count,MSZoning_min,MSZoning_max,MSZoning_count,LotFrontage_min,LotFrontage_max,LotFrontage_count,LotArea_min,LotArea_max,LotArea_count,Street_min,Street_max,Street_count,LotShape_min,LotShape_max,LotShape_count,LandContour_min,LandContour_max,LandContour_count,Utilities_min,Utilities_max,Utilities_count,LotConfig_min,LotConfig_max,LotConfig_count,LandSlope_min,LandSlope_max,LandSlope_count,Neighborhood_min,Neighborhood_max,Neighborhood_count,Condition1_min,Condition1_max,Condition1_count,Condition2_min,Condition2_max,Condition2_count,BldgType_min,BldgType_max,BldgType_count,HouseStyle_min,HouseStyle_max,HouseStyle_count,OverallQual_min,OverallQual_max,OverallQual_count,OverallCond_min,OverallCond_max,OverallCond_count,YearBuilt_min,YearBuilt_max,YearBuilt_count,YearRemodAdd_min,YearRemodAdd_max,YearRemodAdd_count,RoofMatl_min,RoofMatl_max,RoofMatl_count,Exterior1st_min,Exterior1st_max,Exterior1st_count,Exterior2nd_min,Exterior2nd_max,Exterior2nd_count,MasVnrArea_min,MasVnrArea_max,MasVnrArea_count,ExterQual_min,ExterQual_max,ExterQual_count,Foundation_min,Foundation_max,Foundation_count,BsmtFinSF1_min,BsmtFinSF1_max,BsmtFinSF1_count,BsmtFinSF2_min,BsmtFinSF2_max,BsmtFinSF2_count,BsmtUnfSF_min,BsmtUnfSF_max,BsmtUnfSF_count,TotalBsmtSF_min,TotalBsmtSF_max,TotalBsmtSF_count,Heating_min,Heating_max,Heating_count,HeatingQC_min,HeatingQC_max,HeatingQC_count,CentralAir_min,CentralAir_max,CentralAir_count,1stFlrSF_min,1stFlrSF_max,1stFlrSF_count,2ndFlrSF_min,2ndFlrSF_max,2ndFlrSF_count,LowQualFinSF_min,LowQualFinSF_max,LowQualFinSF_count,GrLivArea_min,GrLivArea_max,GrLivArea_count,BsmtFullBath_min,BsmtFullBath_max,BsmtFullBath_count,BsmtHalfBath_min,BsmtHalfBath_max,BsmtHalfBath_count,FullBath_min,FullBath_max,FullBath_count,HalfBath_min,HalfBath_max,HalfBath_count,BedroomAbvGr_min,BedroomAbvGr_max,BedroomAbvGr_count,KitchenAbvGr_min,KitchenAbvGr_max,KitchenAbvGr_count,KitchenQual_min,KitchenQual_max,KitchenQual_count,TotRmsAbvGrd_min,TotRmsAbvGrd_max,TotRmsAbvGrd_count,Functional_min,Functional_max,Functional_count,Fireplaces_min,Fireplaces_max,Fireplaces_count,GarageYrBlt_min,GarageYrBlt_max,GarageYrBlt_count,GarageCars_min,GarageCars_max,GarageCars_count,GarageArea_min,GarageArea_max,GarageArea_count,PavedDrive_min,PavedDrive_max,PavedDrive_count,WoodDeckSF_min,WoodDeckSF_max,WoodDeckSF_count,OpenPorchSF_min,OpenPorchSF_max,OpenPorchSF_count,EnclosedPorch_min,EnclosedPorch_max,EnclosedPorch_count,3SsnPorch_min,3SsnPorch_max,3SsnPorch_count,ScreenPorch_min,ScreenPorch_max,ScreenPorch_count,PoolArea_min,PoolArea_max,PoolArea_count,MiscVal_min,MiscVal_max,MiscVal_count,MoSold_min,MoSold_max,MoSold_count,YrSold_min,YrSold_max,YrSold_count,SaleType_min,SaleType_max,SaleType_count,SaleCondition_min,SaleCondition_max,SaleCondition_count,SalePrice_min,SalePrice_max,SalePrice_count,LotAreaMeters_min,LotAreaMeters_max,LotAreaMeters_count,SalePriceCZK_min,SalePriceCZK_max,SalePriceCZK_count
RoofStyle,ExterCond,Unnamed: 2_level_1,Unnamed: 3_level_1,Unnamed: 4_level_1,Unnamed: 5_level_1,Unnamed: 6_level_1,Unnamed: 7_level_1,Unnamed: 8_level_1,Unnamed: 9_level_1,Unnamed: 10_level_1,Unnamed: 11_level_1,Unnamed: 12_level_1,Unnamed: 13_level_1,Unnamed: 14_level_1,Unnamed: 15_level_1,Unnamed: 16_level_1,Unnamed: 17_level_1,Unnamed: 18_level_1,Unnamed: 19_level_1,Unnamed: 20_level_1,Unnamed: 21_level_1,Unnamed: 22_level_1,Unnamed: 23_level_1,Unnamed: 24_level_1,Unnamed: 25_level_1,Unnamed: 26_level_1,Unnamed: 27_level_1,Unnamed: 28_level_1,Unnamed: 29_level_1,Unnamed: 30_level_1,Unnamed: 31_level_1,Unnamed: 32_level_1,Unnamed: 33_level_1,Unnamed: 34_level_1,Unnamed: 35_level_1,Unnamed: 36_level_1,Unnamed: 37_level_1,Unnamed: 38_level_1,Unnamed: 39_level_1,Unnamed: 40_level_1,Unnamed: 41_level_1,Unnamed: 42_level_1,Unnamed: 43_level_1,Unnamed: 44_level_1,Unnamed: 45_level_1,Unnamed: 46_level_1,Unnamed: 47_level_1,Unnamed: 48_level_1,Unnamed: 49_level_1,Unnamed: 50_level_1,Unnamed: 51_level_1,Unnamed: 52_level_1,Unnamed: 53_level_1,Unnamed: 54_level_1,Unnamed: 55_level_1,Unnamed: 56_level_1,Unnamed: 57_level_1,Unnamed: 58_level_1,Unnamed: 59_level_1,Unnamed: 60_level_1,Unnamed: 61_level_1,Unnamed: 62_level_1,Unnamed: 63_level_1,Unnamed: 64_level_1,Unnamed: 65_level_1,Unnamed: 66_level_1,Unnamed: 67_level_1,Unnamed: 68_level_1,Unnamed: 69_level_1,Unnamed: 70_level_1,Unnamed: 71_level_1,Unnamed: 72_level_1,Unnamed: 73_level_1,Unnamed: 74_level_1,Unnamed: 75_level_1,Unnamed: 76_level_1,Unnamed: 77_level_1,Unnamed: 78_level_1,Unnamed: 79_level_1,Unnamed: 80_level_1,Unnamed: 81_level_1,Unnamed: 82_level_1,Unnamed: 83_level_1,Unnamed: 84_level_1,Unnamed: 85_level_1,Unnamed: 86_level_1,Unnamed: 87_level_1,Unnamed: 88_level_1,Unnamed: 89_level_1,Unnamed: 90_level_1,Unnamed: 91_level_1,Unnamed: 92_level_1,Unnamed: 93_level_1,Unnamed: 94_level_1,Unnamed: 95_level_1,Unnamed: 96_level_1,Unnamed: 97_level_1,Unnamed: 98_level_1,Unnamed: 99_level_1,Unnamed: 100_level_1,Unnamed: 101_level_1,Unnamed: 102_level_1,Unnamed: 103_level_1,Unnamed: 104_level_1,Unnamed: 105_level_1,Unnamed: 106_level_1,Unnamed: 107_level_1,Unnamed: 108_level_1,Unnamed: 109_level_1,Unnamed: 110_level_1,Unnamed: 111_level_1,Unnamed: 112_level_1,Unnamed: 113_level_1,Unnamed: 114_level_1,Unnamed: 115_level_1,Unnamed: 116_level_1,Unnamed: 117_level_1,Unnamed: 118_level_1,Unnamed: 119_level_1,Unnamed: 120_level_1,Unnamed: 121_level_1,Unnamed: 122_level_1,Unnamed: 123_level_1,Unnamed: 124_level_1,Unnamed: 125_level_1,Unnamed: 126_level_1,Unnamed: 127_level_1,Unnamed: 128_level_1,Unnamed: 129_level_1,Unnamed: 130_level_1,Unnamed: 131_level_1,Unnamed: 132_level_1,Unnamed: 133_level_1,Unnamed: 134_level_1,Unnamed: 135_level_1,Unnamed: 136_level_1,Unnamed: 137_level_1,Unnamed: 138_level_1,Unnamed: 139_level_1,Unnamed: 140_level_1,Unnamed: 141_level_1,Unnamed: 142_level_1,Unnamed: 143_level_1,Unnamed: 144_level_1,Unnamed: 145_level_1,Unnamed: 146_level_1,Unnamed: 147_level_1,Unnamed: 148_level_1,Unnamed: 149_level_1,Unnamed: 150_level_1,Unnamed: 151_level_1,Unnamed: 152_level_1,Unnamed: 153_level_1,Unnamed: 154_level_1,Unnamed: 155_level_1,Unnamed: 156_level_1,Unnamed: 157_level_1,Unnamed: 158_level_1,Unnamed: 159_level_1,Unnamed: 160_level_1,Unnamed: 161_level_1,Unnamed: 162_level_1,Unnamed: 163_level_1,Unnamed: 164_level_1,Unnamed: 165_level_1,Unnamed: 166_level_1,Unnamed: 167_level_1,Unnamed: 168_level_1,Unnamed: 169_level_1,Unnamed: 170_level_1,Unnamed: 171_level_1,Unnamed: 172_level_1,Unnamed: 173_level_1,Unnamed: 174_level_1,Unnamed: 175_level_1,Unnamed: 176_level_1,Unnamed: 177_level_1,Unnamed: 178_level_1,Unnamed: 179_level_1,Unnamed: 180_level_1,Unnamed: 181_level_1,Unnamed: 182_level_1,Unnamed: 183_level_1,Unnamed: 184_level_1,Unnamed: 185_level_1,Unnamed: 186_level_1,Unnamed: 187_level_1,Unnamed: 188_level_1,Unnamed: 189_level_1,Unnamed: 190_level_1,Unnamed: 191_level_1,Unnamed: 192_level_1,Unnamed: 193_level_1,Unnamed: 194_level_1,Unnamed: 195_level_1,Unnamed: 196_level_1
Flat,Gd,467,1424,4,20,80,4,RL,RL,4,85.0,95.0,2,10628,19690,4,Pave,Pave,4,IR1,Reg,4,Lvl,Lvl,4,AllPub,AllPub,4,CulDSac,Inside,4,Gtl,Mod,4,Edwards,NAmes,4,Norm,Norm,4,Norm,Norm,4,1Fam,1Fam,4,1Story,SLvl,4,6,8,4,5,7,4,1965,1970,4,1966,1979,4,Tar&Grv,Tar&Grv,4,Plywood,Wd Sdng,4,Plywood,Wd Sdng,4,0.0,113.0,4,Gd,TA,4,CBlock,PConc,4,0,778,4,0,823,4,111,1043,4,697,2217,4,GasA,GasA,4,Ex,TA,4,Y,Y,4,1277,2217,4,0,915,4,0,0,4,1277,2380,4,0,1,4,0,0,4,1,2,4,0,1,4,2,4,4,1,1,4,Gd,TA,4,5,8,4,Sev,Typ,4,1,1,4,1965.0,1970.0,4,2,2,4,432,621,4,Y,Y,4,0,586,4,0,265,4,0,0,4,0,0,4,0,224,4,0,738,4,0,0,4,4,10,4,2006,2007,4,WD,WD,4,Abnorml,Normal,4,129000,274970,4,114398.839909,211941.4,4,2908950.0,6200573.5,4
Flat,TA,121,1001,9,20,90,9,RL,RL,9,35.0,313.0,5,9400,53227,9,Pave,Pave,9,IR1,Reg,9,HLS,Lvl,9,AllPub,AllPub,9,Corner,Inside,9,Gtl,Sev,9,ClearCr,NAmes,9,Norm,PosA,9,Norm,Norm,9,1Fam,Duplex,9,1Story,SLvl,9,3,7,9,3,8,9,1952,1975,9,1952,2007,9,CompShg,Tar&Grv,9,BrkComm,WdShing,9,Brk Cmn,Wd Sdng,9,0.0,250.0,9,TA,TA,9,CBlock,Slab,9,0,1324,9,0,1080,9,0,248,9,0,1602,9,GasA,GasW,9,Ex,TA,9,N,Y,9,944,2069,9,0,966,9,0,0,9,944,2126,9,0,2,9,0,2,9,1,2,9,0,1,9,1,4,9,0,1,9,Ex,TA,9,4,9,9,Min1,Typ,9,0,2,9,1954.0,1969.0,8,0,2,9,0,564,9,Y,Y,9,0,486,9,0,137,9,0,0,9,0,130,9,0,225,9,0,0,9,0,0,9,3,11,9,2006,2009,9,WD,WD,9,Abnorml,Partial,9,82000,256000,9,101180.757917,572930.7,9,1849100.0,5772800.0,9
Gable,Ex,584,584,1,75,75,1,RM,RM,1,75.0,75.0,1,13500,13500,1,Pave,Pave,1,Reg,Reg,1,Lvl,Lvl,1,AllPub,AllPub,1,Inside,Inside,1,Gtl,Gtl,1,OldTown,OldTown,1,Artery,Artery,1,PosA,PosA,1,1Fam,1Fam,1,2.5Unf,2.5Unf,1,10,10,1,9,9,1,1893,1893,1,2000,2000,1,CompShg,CompShg,1,Wd Sdng,Wd Sdng,1,Wd Sdng,Wd Sdng,1,0.0,0.0,1,Ex,Ex,1,BrkTil,BrkTil,1,0,0,1,0,0,1,1237,1237,1,1237,1237,1,GasA,GasA,1,Gd,Gd,1,Y,Y,1,1521,1521,1,1254,1254,1,0,0,1,2775,2775,1,0,0,1,0,0,1,3,3,1,1,1,1,3,3,1,1,1,1,Gd,Gd,1,9,9,1,Typ,Typ,1,1,1,1,1988.0,1988.0,1,2,2,1,880,880,1,Y,Y,1,105,105,1,502,502,1,0,0,1,0,0,1,0,0,1,0,0,1,0,0,1,7,7,1,2008,2008,1,WD,WD,1,Normal,Normal,1,325000,325000,1,145312.790626,145312.8,1,7328750.0,7328750.0,1
Gable,Fa,103,1381,22,20,190,22,RH,RM,22,45.0,98.0,19,5000,12180,22,Pave,Pave,22,IR1,Reg,22,Bnk,Lvl,22,AllPub,AllPub,22,Corner,Inside,22,Gtl,Sev,22,BrkSide,SawyerW,22,Artery,Norm,22,Norm,Norm,22,1Fam,Duplex,22,1.5Fin,2Story,22,1,7,22,1,8,22,1900,1979,22,1950,2000,22,CompShg,CompShg,22,AsbShng,Wd Sdng,22,AsbShng,Wd Shng,22,0.0,600.0,22,Fa,TA,22,BrkTil,Stone,22,0,903,22,0,0,22,0,1276,22,0,1276,22,GasA,Grav,22,Ex,TA,22,N,Y,22,334,1766,22,0,1020,22,0,514,22,334,2414,22,0,1,22,0,0,22,0,2,22,0,1,22,1,6,22,1,2,22,Fa,TA,22,2,10,22,Maj1,Typ,22,0,2,22,1910.0,2000.0,15,0,2,22,0,720,22,N,Y,22,0,250,22,0,184,22,0,216,22,0,0,22,0,184,22,0,0,22,0,0,22,1,11,22,2006,2010,22,COD,WD,22,Abnorml,Normal,22,39300,169500,22,53819.552084,131104.4,22,886215.0,3822225.0,22
Gable,Gd,25,1458,116,20,190,116,C (all),RM,116,21.0,174.0,90,1526,36500,116,Grvl,Pave,116,IR1,Reg,116,Bnk,Lvl,116,AllPub,AllPub,116,Corner,Inside,116,Gtl,Mod,116,Blueste,Veenker,116,Artery,RRAn,116,Feedr,RRNn,116,1Fam,TwnhsE,116,1.5Fin,SLvl,116,3,10,116,3,9,116,1872,1997,116,1950,2010,116,CompShg,WdShngl,116,AsbShng,WdShing,116,AsbShng,Wd Shng,116,0.0,1115.0,116,Ex,TA,116,BrkTil,Stone,116,0,1880,116,0,1061,116,0,1362,116,0,2223,116,GasA,OthW,116,Ex,TA,116,N,Y,116,575,2223,116,0,1518,116,0,572,116,630,3608,116,0,2,116,0,1,116,0,3,116,0,1,116,0,5,116,1,2,116,Ex,TA,116,3,12,116,Min1,Typ,116,0,2,116,1910.0,2007.0,108,0,4,116,0,1356,116,N,Y,116,0,550,116,0,547,116,0,552,116,0,216,116,0,480,116,0,512,116,0,2500,116,1,12,116,2006,2010,116,COD,WD,116,Abnorml,Normal,116,68400,475000,116,16425.727296,392882.7,116,1542420.0,10711250.0,116
Gable,Po,251,251,1,30,30,1,RL,RL,1,55.0,55.0,1,5350,5350,1,Pave,Pave,1,IR1,IR1,1,Lvl,Lvl,1,AllPub,AllPub,1,Inside,Inside,1,Gtl,Gtl,1,BrkSide,BrkSide,1,Norm,Norm,1,Norm,Norm,1,1Fam,1Fam,1,1Story,1Story,1,3,3,1,2,2,1,1940,1940,1,1966,1966,1,CompShg,CompShg,1,Wd Sdng,Wd Sdng,1,Plywood,Plywood,1,0.0,0.0,1,TA,TA,1,CBlock,CBlock,1,0,0,1,0,0,1,728,728,1,728,728,1,GasA,GasA,1,Ex,Ex,1,Y,Y,1,1306,1306,1,0,0,1,0,0,1,1306,1306,1,0,0,1,0,0,1,1,1,1,0,0,1,3,3,1,1,1,1,Fa,Fa,1,6,6,1,Mod,Mod,1,0,0,1,,,0,0,0,1,0,0,1,Y,Y,1,263,263,1,0,0,1,0,0,1,0,0,1,0,0,1,0,0,1,450,450,1,5,5,1,2010,2010,1,WD,WD,1,Normal,Normal,1,76500,76500,1,57586.920729,57586.92,1,1725075.0,1725075.0,1
Gable,TA,1,1460,1001,20,190,1001,C (all),RM,1001,21.0,174.0,824,1300,164660,1001,Grvl,Pave,1001,IR1,Reg,1001,Bnk,Lvl,1001,AllPub,NoSeWa,1001,Corner,Inside,1001,Gtl,Sev,1001,Blmngtn,Veenker,1001,Artery,RRNn,1001,Artery,RRNn,1001,1Fam,TwnhsE,1001,1.5Fin,SLvl,1001,2,10,1001,2,9,1001,1880,2009,1001,1950,2010,1001,CompShg,WdShngl,1001,AsbShng,WdShing,1001,AsbShng,Wd Shng,1001,0.0,1170.0,994,Ex,TA,1001,BrkTil,Wood,1001,0,1810,1001,0,1474,1001,0,2121,1001,0,3206,1001,GasA,Wall,1001,Ex,TA,1001,N,Y,1001,438,2444,1001,0,1872,1001,0,528,1001,438,4316,1001,0,3,1001,0,2,1001,0,3,1001,0,2,1001,0,6,1001,1,3,1001,Ex,TA,1001,3,12,1001,Maj1,Typ,1001,0,3,1001,1906.0,2010.0,946,0,4,1001,0,1248,1001,N,Y,1001,0,857,1001,0,523,1001,0,330,1001,0,508,1001,0,440,1001,0,576,1001,0,2000,1001,1,12,1001,2006,2010,1001,COD,WD,1001,Abnorml,Partial,1001,34900,755000,1001,13993.083542,1772385.0,1001,786995.0,17025250.0,1001
Gambrel,Ex,509,509,1,70,70,1,RM,RM,1,60.0,60.0,1,9600,9600,1,Pave,Pave,1,Reg,Reg,1,Lvl,Lvl,1,AllPub,AllPub,1,Inside,Inside,1,Gtl,Gtl,1,OldTown,OldTown,1,Norm,Norm,1,Norm,Norm,1,1Fam,1Fam,1,2Story,2Story,1,7,7,1,9,9,1,1928,1928,1,2005,2005,1,CompShg,CompShg,1,MetalSd,MetalSd,1,MetalSd,MetalSd,1,0.0,0.0,1,TA,TA,1,BrkTil,BrkTil,1,141,141,1,0,0,1,548,548,1,689,689,1,GasA,GasA,1,Ex,Ex,1,Y,Y,1,689,689,1,689,689,1,0,0,1,1378,1378,1,0,0,1,0,0,1,2,2,1,0,0,1,3,3,1,1,1,1,Gd,Gd,1,7,7,1,Typ,Typ,1,1,1,1,1928.0,1928.0,1,2,2,1,360,360,1,N,N,1,0,0,1,0,0,1,116,116,1,0,0,1,0,0,1,0,0,1,0,0,1,10,10,1,2008,2008,1,WD,WD,1,Normal,Normal,1,161000,161000,1,103333.54,103333.5,1,3630550.0,3630550.0,1
Gambrel,Fa,31,387,2,50,70,2,C (all),RL,2,50.0,58.0,2,8410,8500,2,Pave,Pave,2,Reg,Reg,2,Lvl,Lvl,2,AllPub,AllPub,2,FR2,Inside,2,Gtl,Gtl,2,Edwards,IDOTRR,2,Feedr,Feedr,2,Norm,Norm,2,1Fam,1Fam,2,1.5Fin,2Story,2,4,5,2,3,4,2,1910,1920,2,1950,1996,2,CompShg,CompShg,2,BrkFace,Wd Sdng,2,BrkFace,VinylSd,2,0.0,0.0,2,TA,TA,2,BrkTil,PConc,2,0,0,2,0,0,2,649,658,2,649,658,2,GasA,GasA,2,TA,TA,2,N,Y,2,649,658,2,526,668,2,0,0,2,1184,1317,2,0,0,2,0,0,2,1,1,2,0,0,2,3,5,2,1,1,2,TA,TA,2,6,8,2,Typ,Typ,2,0,0,2,1920.0,1920.0,1,0,1,2,0,250,2,N,N,2,0,0,2,54,151,2,0,172,2,0,0,2,0,0,2,0,0,2,0,0,2,5,7,2,2006,2008,2,WD,WD,2,AdjLand,Normal,2,40000,81000,2,90524.486605,91493.24,2,902000.0,1826550.0,2
Gambrel,Gd,75,672,2,50,70,2,RH,RM,2,54.0,60.0,2,5790,6629,2,Pave,Pave,2,Reg,Reg,2,Lvl,Lvl,2,AllPub,AllPub,2,Corner,Inside,2,Gtl,Gtl,2,Edwards,OldTown,2,Artery,Norm,2,Norm,Norm,2,1Fam,1Fam,2,2Story,2Story,2,3,6,2,6,6,2,1915,1925,2,1950,1950,2,CompShg,CompShg,2,VinylSd,Wd Sdng,2,VinylSd,Wd Sdng,2,0.0,0.0,2,Gd,TA,2,BrkTil,CBlock,2,0,551,2,0,0,2,121,840,2,672,840,2,GasA,GasA,2,Gd,TA,2,N,N,2,697,840,2,672,765,2,0,0,2,1369,1605,2,0,1,2,0,0,2,2,2,2,0,0,2,3,3,2,1,2,2,TA,TA,2,6,8,2,Typ,Typ,2,0,0,2,1915.0,1930.0,2,1,1,2,300,379,2,Y,Y,2,0,147,2,0,0,2,0,202,2,0,0,2,0,0,2,0,0,2,0,0,2,5,7,2,2009,2010,2,WD,WD,2,Normal,Normal,2,103600,107400,2,62323.041313,71353.96,2,2336180.0,2421870.0,2


In [41]:
filipuv_experiment = filipuv_experiment.reset_index()

In [42]:
filipuv_experiment.index = filipuv_experiment.RoofStyle + "_" + filipuv_experiment.ExterCond

In [43]:
filipuv_experiment

Unnamed: 0,RoofStyle,ExterCond,Id_min,Id_max,Id_count,MSSubClass_min,MSSubClass_max,MSSubClass_count,MSZoning_min,MSZoning_max,MSZoning_count,LotFrontage_min,LotFrontage_max,LotFrontage_count,LotArea_min,LotArea_max,LotArea_count,Street_min,Street_max,Street_count,LotShape_min,LotShape_max,LotShape_count,LandContour_min,LandContour_max,LandContour_count,Utilities_min,Utilities_max,Utilities_count,LotConfig_min,LotConfig_max,LotConfig_count,LandSlope_min,LandSlope_max,LandSlope_count,Neighborhood_min,Neighborhood_max,Neighborhood_count,Condition1_min,Condition1_max,Condition1_count,Condition2_min,Condition2_max,Condition2_count,BldgType_min,BldgType_max,BldgType_count,HouseStyle_min,HouseStyle_max,HouseStyle_count,OverallQual_min,OverallQual_max,OverallQual_count,OverallCond_min,OverallCond_max,OverallCond_count,YearBuilt_min,YearBuilt_max,YearBuilt_count,YearRemodAdd_min,YearRemodAdd_max,YearRemodAdd_count,RoofMatl_min,RoofMatl_max,RoofMatl_count,Exterior1st_min,Exterior1st_max,Exterior1st_count,Exterior2nd_min,Exterior2nd_max,Exterior2nd_count,MasVnrArea_min,MasVnrArea_max,MasVnrArea_count,ExterQual_min,ExterQual_max,ExterQual_count,Foundation_min,Foundation_max,Foundation_count,BsmtFinSF1_min,BsmtFinSF1_max,BsmtFinSF1_count,BsmtFinSF2_min,BsmtFinSF2_max,BsmtFinSF2_count,BsmtUnfSF_min,BsmtUnfSF_max,BsmtUnfSF_count,TotalBsmtSF_min,TotalBsmtSF_max,TotalBsmtSF_count,Heating_min,Heating_max,Heating_count,HeatingQC_min,HeatingQC_max,HeatingQC_count,CentralAir_min,CentralAir_max,CentralAir_count,1stFlrSF_min,1stFlrSF_max,1stFlrSF_count,2ndFlrSF_min,2ndFlrSF_max,2ndFlrSF_count,LowQualFinSF_min,LowQualFinSF_max,LowQualFinSF_count,GrLivArea_min,GrLivArea_max,GrLivArea_count,BsmtFullBath_min,BsmtFullBath_max,BsmtFullBath_count,BsmtHalfBath_min,BsmtHalfBath_max,BsmtHalfBath_count,FullBath_min,FullBath_max,FullBath_count,HalfBath_min,HalfBath_max,HalfBath_count,BedroomAbvGr_min,BedroomAbvGr_max,BedroomAbvGr_count,KitchenAbvGr_min,KitchenAbvGr_max,KitchenAbvGr_count,KitchenQual_min,KitchenQual_max,KitchenQual_count,TotRmsAbvGrd_min,TotRmsAbvGrd_max,TotRmsAbvGrd_count,Functional_min,Functional_max,Functional_count,Fireplaces_min,Fireplaces_max,Fireplaces_count,GarageYrBlt_min,GarageYrBlt_max,GarageYrBlt_count,GarageCars_min,GarageCars_max,GarageCars_count,GarageArea_min,GarageArea_max,GarageArea_count,PavedDrive_min,PavedDrive_max,PavedDrive_count,WoodDeckSF_min,WoodDeckSF_max,WoodDeckSF_count,OpenPorchSF_min,OpenPorchSF_max,OpenPorchSF_count,EnclosedPorch_min,EnclosedPorch_max,EnclosedPorch_count,3SsnPorch_min,3SsnPorch_max,3SsnPorch_count,ScreenPorch_min,ScreenPorch_max,ScreenPorch_count,PoolArea_min,PoolArea_max,PoolArea_count,MiscVal_min,MiscVal_max,MiscVal_count,MoSold_min,MoSold_max,MoSold_count,YrSold_min,YrSold_max,YrSold_count,SaleType_min,SaleType_max,SaleType_count,SaleCondition_min,SaleCondition_max,SaleCondition_count,SalePrice_min,SalePrice_max,SalePrice_count,LotAreaMeters_min,LotAreaMeters_max,LotAreaMeters_count,SalePriceCZK_min,SalePriceCZK_max,SalePriceCZK_count
Flat_Gd,Flat,Gd,467,1424,4,20,80,4,RL,RL,4,85.0,95.0,2,10628,19690,4,Pave,Pave,4,IR1,Reg,4,Lvl,Lvl,4,AllPub,AllPub,4,CulDSac,Inside,4,Gtl,Mod,4,Edwards,NAmes,4,Norm,Norm,4,Norm,Norm,4,1Fam,1Fam,4,1Story,SLvl,4,6,8,4,5,7,4,1965,1970,4,1966,1979,4,Tar&Grv,Tar&Grv,4,Plywood,Wd Sdng,4,Plywood,Wd Sdng,4,0.0,113.0,4,Gd,TA,4,CBlock,PConc,4,0,778,4,0,823,4,111,1043,4,697,2217,4,GasA,GasA,4,Ex,TA,4,Y,Y,4,1277,2217,4,0,915,4,0,0,4,1277,2380,4,0,1,4,0,0,4,1,2,4,0,1,4,2,4,4,1,1,4,Gd,TA,4,5,8,4,Sev,Typ,4,1,1,4,1965.0,1970.0,4,2,2,4,432,621,4,Y,Y,4,0,586,4,0,265,4,0,0,4,0,0,4,0,224,4,0,738,4,0,0,4,4,10,4,2006,2007,4,WD,WD,4,Abnorml,Normal,4,129000,274970,4,114398.839909,211941.4,4,2908950.0,6200573.5,4
Flat_TA,Flat,TA,121,1001,9,20,90,9,RL,RL,9,35.0,313.0,5,9400,53227,9,Pave,Pave,9,IR1,Reg,9,HLS,Lvl,9,AllPub,AllPub,9,Corner,Inside,9,Gtl,Sev,9,ClearCr,NAmes,9,Norm,PosA,9,Norm,Norm,9,1Fam,Duplex,9,1Story,SLvl,9,3,7,9,3,8,9,1952,1975,9,1952,2007,9,CompShg,Tar&Grv,9,BrkComm,WdShing,9,Brk Cmn,Wd Sdng,9,0.0,250.0,9,TA,TA,9,CBlock,Slab,9,0,1324,9,0,1080,9,0,248,9,0,1602,9,GasA,GasW,9,Ex,TA,9,N,Y,9,944,2069,9,0,966,9,0,0,9,944,2126,9,0,2,9,0,2,9,1,2,9,0,1,9,1,4,9,0,1,9,Ex,TA,9,4,9,9,Min1,Typ,9,0,2,9,1954.0,1969.0,8,0,2,9,0,564,9,Y,Y,9,0,486,9,0,137,9,0,0,9,0,130,9,0,225,9,0,0,9,0,0,9,3,11,9,2006,2009,9,WD,WD,9,Abnorml,Partial,9,82000,256000,9,101180.757917,572930.7,9,1849100.0,5772800.0,9
Gable_Ex,Gable,Ex,584,584,1,75,75,1,RM,RM,1,75.0,75.0,1,13500,13500,1,Pave,Pave,1,Reg,Reg,1,Lvl,Lvl,1,AllPub,AllPub,1,Inside,Inside,1,Gtl,Gtl,1,OldTown,OldTown,1,Artery,Artery,1,PosA,PosA,1,1Fam,1Fam,1,2.5Unf,2.5Unf,1,10,10,1,9,9,1,1893,1893,1,2000,2000,1,CompShg,CompShg,1,Wd Sdng,Wd Sdng,1,Wd Sdng,Wd Sdng,1,0.0,0.0,1,Ex,Ex,1,BrkTil,BrkTil,1,0,0,1,0,0,1,1237,1237,1,1237,1237,1,GasA,GasA,1,Gd,Gd,1,Y,Y,1,1521,1521,1,1254,1254,1,0,0,1,2775,2775,1,0,0,1,0,0,1,3,3,1,1,1,1,3,3,1,1,1,1,Gd,Gd,1,9,9,1,Typ,Typ,1,1,1,1,1988.0,1988.0,1,2,2,1,880,880,1,Y,Y,1,105,105,1,502,502,1,0,0,1,0,0,1,0,0,1,0,0,1,0,0,1,7,7,1,2008,2008,1,WD,WD,1,Normal,Normal,1,325000,325000,1,145312.790626,145312.8,1,7328750.0,7328750.0,1
Gable_Fa,Gable,Fa,103,1381,22,20,190,22,RH,RM,22,45.0,98.0,19,5000,12180,22,Pave,Pave,22,IR1,Reg,22,Bnk,Lvl,22,AllPub,AllPub,22,Corner,Inside,22,Gtl,Sev,22,BrkSide,SawyerW,22,Artery,Norm,22,Norm,Norm,22,1Fam,Duplex,22,1.5Fin,2Story,22,1,7,22,1,8,22,1900,1979,22,1950,2000,22,CompShg,CompShg,22,AsbShng,Wd Sdng,22,AsbShng,Wd Shng,22,0.0,600.0,22,Fa,TA,22,BrkTil,Stone,22,0,903,22,0,0,22,0,1276,22,0,1276,22,GasA,Grav,22,Ex,TA,22,N,Y,22,334,1766,22,0,1020,22,0,514,22,334,2414,22,0,1,22,0,0,22,0,2,22,0,1,22,1,6,22,1,2,22,Fa,TA,22,2,10,22,Maj1,Typ,22,0,2,22,1910.0,2000.0,15,0,2,22,0,720,22,N,Y,22,0,250,22,0,184,22,0,216,22,0,0,22,0,184,22,0,0,22,0,0,22,1,11,22,2006,2010,22,COD,WD,22,Abnorml,Normal,22,39300,169500,22,53819.552084,131104.4,22,886215.0,3822225.0,22
Gable_Gd,Gable,Gd,25,1458,116,20,190,116,C (all),RM,116,21.0,174.0,90,1526,36500,116,Grvl,Pave,116,IR1,Reg,116,Bnk,Lvl,116,AllPub,AllPub,116,Corner,Inside,116,Gtl,Mod,116,Blueste,Veenker,116,Artery,RRAn,116,Feedr,RRNn,116,1Fam,TwnhsE,116,1.5Fin,SLvl,116,3,10,116,3,9,116,1872,1997,116,1950,2010,116,CompShg,WdShngl,116,AsbShng,WdShing,116,AsbShng,Wd Shng,116,0.0,1115.0,116,Ex,TA,116,BrkTil,Stone,116,0,1880,116,0,1061,116,0,1362,116,0,2223,116,GasA,OthW,116,Ex,TA,116,N,Y,116,575,2223,116,0,1518,116,0,572,116,630,3608,116,0,2,116,0,1,116,0,3,116,0,1,116,0,5,116,1,2,116,Ex,TA,116,3,12,116,Min1,Typ,116,0,2,116,1910.0,2007.0,108,0,4,116,0,1356,116,N,Y,116,0,550,116,0,547,116,0,552,116,0,216,116,0,480,116,0,512,116,0,2500,116,1,12,116,2006,2010,116,COD,WD,116,Abnorml,Normal,116,68400,475000,116,16425.727296,392882.7,116,1542420.0,10711250.0,116
Gable_Po,Gable,Po,251,251,1,30,30,1,RL,RL,1,55.0,55.0,1,5350,5350,1,Pave,Pave,1,IR1,IR1,1,Lvl,Lvl,1,AllPub,AllPub,1,Inside,Inside,1,Gtl,Gtl,1,BrkSide,BrkSide,1,Norm,Norm,1,Norm,Norm,1,1Fam,1Fam,1,1Story,1Story,1,3,3,1,2,2,1,1940,1940,1,1966,1966,1,CompShg,CompShg,1,Wd Sdng,Wd Sdng,1,Plywood,Plywood,1,0.0,0.0,1,TA,TA,1,CBlock,CBlock,1,0,0,1,0,0,1,728,728,1,728,728,1,GasA,GasA,1,Ex,Ex,1,Y,Y,1,1306,1306,1,0,0,1,0,0,1,1306,1306,1,0,0,1,0,0,1,1,1,1,0,0,1,3,3,1,1,1,1,Fa,Fa,1,6,6,1,Mod,Mod,1,0,0,1,,,0,0,0,1,0,0,1,Y,Y,1,263,263,1,0,0,1,0,0,1,0,0,1,0,0,1,0,0,1,450,450,1,5,5,1,2010,2010,1,WD,WD,1,Normal,Normal,1,76500,76500,1,57586.920729,57586.92,1,1725075.0,1725075.0,1
Gable_TA,Gable,TA,1,1460,1001,20,190,1001,C (all),RM,1001,21.0,174.0,824,1300,164660,1001,Grvl,Pave,1001,IR1,Reg,1001,Bnk,Lvl,1001,AllPub,NoSeWa,1001,Corner,Inside,1001,Gtl,Sev,1001,Blmngtn,Veenker,1001,Artery,RRNn,1001,Artery,RRNn,1001,1Fam,TwnhsE,1001,1.5Fin,SLvl,1001,2,10,1001,2,9,1001,1880,2009,1001,1950,2010,1001,CompShg,WdShngl,1001,AsbShng,WdShing,1001,AsbShng,Wd Shng,1001,0.0,1170.0,994,Ex,TA,1001,BrkTil,Wood,1001,0,1810,1001,0,1474,1001,0,2121,1001,0,3206,1001,GasA,Wall,1001,Ex,TA,1001,N,Y,1001,438,2444,1001,0,1872,1001,0,528,1001,438,4316,1001,0,3,1001,0,2,1001,0,3,1001,0,2,1001,0,6,1001,1,3,1001,Ex,TA,1001,3,12,1001,Maj1,Typ,1001,0,3,1001,1906.0,2010.0,946,0,4,1001,0,1248,1001,N,Y,1001,0,857,1001,0,523,1001,0,330,1001,0,508,1001,0,440,1001,0,576,1001,0,2000,1001,1,12,1001,2006,2010,1001,COD,WD,1001,Abnorml,Partial,1001,34900,755000,1001,13993.083542,1772385.0,1001,786995.0,17025250.0,1001
Gambrel_Ex,Gambrel,Ex,509,509,1,70,70,1,RM,RM,1,60.0,60.0,1,9600,9600,1,Pave,Pave,1,Reg,Reg,1,Lvl,Lvl,1,AllPub,AllPub,1,Inside,Inside,1,Gtl,Gtl,1,OldTown,OldTown,1,Norm,Norm,1,Norm,Norm,1,1Fam,1Fam,1,2Story,2Story,1,7,7,1,9,9,1,1928,1928,1,2005,2005,1,CompShg,CompShg,1,MetalSd,MetalSd,1,MetalSd,MetalSd,1,0.0,0.0,1,TA,TA,1,BrkTil,BrkTil,1,141,141,1,0,0,1,548,548,1,689,689,1,GasA,GasA,1,Ex,Ex,1,Y,Y,1,689,689,1,689,689,1,0,0,1,1378,1378,1,0,0,1,0,0,1,2,2,1,0,0,1,3,3,1,1,1,1,Gd,Gd,1,7,7,1,Typ,Typ,1,1,1,1,1928.0,1928.0,1,2,2,1,360,360,1,N,N,1,0,0,1,0,0,1,116,116,1,0,0,1,0,0,1,0,0,1,0,0,1,10,10,1,2008,2008,1,WD,WD,1,Normal,Normal,1,161000,161000,1,103333.54,103333.5,1,3630550.0,3630550.0,1
Gambrel_Fa,Gambrel,Fa,31,387,2,50,70,2,C (all),RL,2,50.0,58.0,2,8410,8500,2,Pave,Pave,2,Reg,Reg,2,Lvl,Lvl,2,AllPub,AllPub,2,FR2,Inside,2,Gtl,Gtl,2,Edwards,IDOTRR,2,Feedr,Feedr,2,Norm,Norm,2,1Fam,1Fam,2,1.5Fin,2Story,2,4,5,2,3,4,2,1910,1920,2,1950,1996,2,CompShg,CompShg,2,BrkFace,Wd Sdng,2,BrkFace,VinylSd,2,0.0,0.0,2,TA,TA,2,BrkTil,PConc,2,0,0,2,0,0,2,649,658,2,649,658,2,GasA,GasA,2,TA,TA,2,N,Y,2,649,658,2,526,668,2,0,0,2,1184,1317,2,0,0,2,0,0,2,1,1,2,0,0,2,3,5,2,1,1,2,TA,TA,2,6,8,2,Typ,Typ,2,0,0,2,1920.0,1920.0,1,0,1,2,0,250,2,N,N,2,0,0,2,54,151,2,0,172,2,0,0,2,0,0,2,0,0,2,0,0,2,5,7,2,2006,2008,2,WD,WD,2,AdjLand,Normal,2,40000,81000,2,90524.486605,91493.24,2,902000.0,1826550.0,2
Gambrel_Gd,Gambrel,Gd,75,672,2,50,70,2,RH,RM,2,54.0,60.0,2,5790,6629,2,Pave,Pave,2,Reg,Reg,2,Lvl,Lvl,2,AllPub,AllPub,2,Corner,Inside,2,Gtl,Gtl,2,Edwards,OldTown,2,Artery,Norm,2,Norm,Norm,2,1Fam,1Fam,2,2Story,2Story,2,3,6,2,6,6,2,1915,1925,2,1950,1950,2,CompShg,CompShg,2,VinylSd,Wd Sdng,2,VinylSd,Wd Sdng,2,0.0,0.0,2,Gd,TA,2,BrkTil,CBlock,2,0,551,2,0,0,2,121,840,2,672,840,2,GasA,GasA,2,Gd,TA,2,N,N,2,697,840,2,672,765,2,0,0,2,1369,1605,2,0,1,2,0,0,2,2,2,2,0,0,2,3,3,2,1,2,2,TA,TA,2,6,8,2,Typ,Typ,2,0,0,2,1915.0,1930.0,2,1,1,2,300,379,2,Y,Y,2,0,147,2,0,0,2,0,202,2,0,0,2,0,0,2,0,0,2,0,0,2,5,7,2,2009,2010,2,WD,WD,2,Normal,Normal,2,103600,107400,2,62323.041313,71353.96,2,2336180.0,2421870.0,2


In [44]:
# Odstranit sloupce
filipuv_experiment.drop(['RoofStyle', 'ExterCond'], axis=1)

Unnamed: 0,Id_min,Id_max,Id_count,MSSubClass_min,MSSubClass_max,MSSubClass_count,MSZoning_min,MSZoning_max,MSZoning_count,LotFrontage_min,LotFrontage_max,LotFrontage_count,LotArea_min,LotArea_max,LotArea_count,Street_min,Street_max,Street_count,LotShape_min,LotShape_max,LotShape_count,LandContour_min,LandContour_max,LandContour_count,Utilities_min,Utilities_max,Utilities_count,LotConfig_min,LotConfig_max,LotConfig_count,LandSlope_min,LandSlope_max,LandSlope_count,Neighborhood_min,Neighborhood_max,Neighborhood_count,Condition1_min,Condition1_max,Condition1_count,Condition2_min,Condition2_max,Condition2_count,BldgType_min,BldgType_max,BldgType_count,HouseStyle_min,HouseStyle_max,HouseStyle_count,OverallQual_min,OverallQual_max,OverallQual_count,OverallCond_min,OverallCond_max,OverallCond_count,YearBuilt_min,YearBuilt_max,YearBuilt_count,YearRemodAdd_min,YearRemodAdd_max,YearRemodAdd_count,RoofMatl_min,RoofMatl_max,RoofMatl_count,Exterior1st_min,Exterior1st_max,Exterior1st_count,Exterior2nd_min,Exterior2nd_max,Exterior2nd_count,MasVnrArea_min,MasVnrArea_max,MasVnrArea_count,ExterQual_min,ExterQual_max,ExterQual_count,Foundation_min,Foundation_max,Foundation_count,BsmtFinSF1_min,BsmtFinSF1_max,BsmtFinSF1_count,BsmtFinSF2_min,BsmtFinSF2_max,BsmtFinSF2_count,BsmtUnfSF_min,BsmtUnfSF_max,BsmtUnfSF_count,TotalBsmtSF_min,TotalBsmtSF_max,TotalBsmtSF_count,Heating_min,Heating_max,Heating_count,HeatingQC_min,HeatingQC_max,HeatingQC_count,CentralAir_min,CentralAir_max,CentralAir_count,1stFlrSF_min,1stFlrSF_max,1stFlrSF_count,2ndFlrSF_min,2ndFlrSF_max,2ndFlrSF_count,LowQualFinSF_min,LowQualFinSF_max,LowQualFinSF_count,GrLivArea_min,GrLivArea_max,GrLivArea_count,BsmtFullBath_min,BsmtFullBath_max,BsmtFullBath_count,BsmtHalfBath_min,BsmtHalfBath_max,BsmtHalfBath_count,FullBath_min,FullBath_max,FullBath_count,HalfBath_min,HalfBath_max,HalfBath_count,BedroomAbvGr_min,BedroomAbvGr_max,BedroomAbvGr_count,KitchenAbvGr_min,KitchenAbvGr_max,KitchenAbvGr_count,KitchenQual_min,KitchenQual_max,KitchenQual_count,TotRmsAbvGrd_min,TotRmsAbvGrd_max,TotRmsAbvGrd_count,Functional_min,Functional_max,Functional_count,Fireplaces_min,Fireplaces_max,Fireplaces_count,GarageYrBlt_min,GarageYrBlt_max,GarageYrBlt_count,GarageCars_min,GarageCars_max,GarageCars_count,GarageArea_min,GarageArea_max,GarageArea_count,PavedDrive_min,PavedDrive_max,PavedDrive_count,WoodDeckSF_min,WoodDeckSF_max,WoodDeckSF_count,OpenPorchSF_min,OpenPorchSF_max,OpenPorchSF_count,EnclosedPorch_min,EnclosedPorch_max,EnclosedPorch_count,3SsnPorch_min,3SsnPorch_max,3SsnPorch_count,ScreenPorch_min,ScreenPorch_max,ScreenPorch_count,PoolArea_min,PoolArea_max,PoolArea_count,MiscVal_min,MiscVal_max,MiscVal_count,MoSold_min,MoSold_max,MoSold_count,YrSold_min,YrSold_max,YrSold_count,SaleType_min,SaleType_max,SaleType_count,SaleCondition_min,SaleCondition_max,SaleCondition_count,SalePrice_min,SalePrice_max,SalePrice_count,LotAreaMeters_min,LotAreaMeters_max,LotAreaMeters_count,SalePriceCZK_min,SalePriceCZK_max,SalePriceCZK_count
Flat_Gd,467,1424,4,20,80,4,RL,RL,4,85.0,95.0,2,10628,19690,4,Pave,Pave,4,IR1,Reg,4,Lvl,Lvl,4,AllPub,AllPub,4,CulDSac,Inside,4,Gtl,Mod,4,Edwards,NAmes,4,Norm,Norm,4,Norm,Norm,4,1Fam,1Fam,4,1Story,SLvl,4,6,8,4,5,7,4,1965,1970,4,1966,1979,4,Tar&Grv,Tar&Grv,4,Plywood,Wd Sdng,4,Plywood,Wd Sdng,4,0.0,113.0,4,Gd,TA,4,CBlock,PConc,4,0,778,4,0,823,4,111,1043,4,697,2217,4,GasA,GasA,4,Ex,TA,4,Y,Y,4,1277,2217,4,0,915,4,0,0,4,1277,2380,4,0,1,4,0,0,4,1,2,4,0,1,4,2,4,4,1,1,4,Gd,TA,4,5,8,4,Sev,Typ,4,1,1,4,1965.0,1970.0,4,2,2,4,432,621,4,Y,Y,4,0,586,4,0,265,4,0,0,4,0,0,4,0,224,4,0,738,4,0,0,4,4,10,4,2006,2007,4,WD,WD,4,Abnorml,Normal,4,129000,274970,4,114398.839909,211941.4,4,2908950.0,6200573.5,4
Flat_TA,121,1001,9,20,90,9,RL,RL,9,35.0,313.0,5,9400,53227,9,Pave,Pave,9,IR1,Reg,9,HLS,Lvl,9,AllPub,AllPub,9,Corner,Inside,9,Gtl,Sev,9,ClearCr,NAmes,9,Norm,PosA,9,Norm,Norm,9,1Fam,Duplex,9,1Story,SLvl,9,3,7,9,3,8,9,1952,1975,9,1952,2007,9,CompShg,Tar&Grv,9,BrkComm,WdShing,9,Brk Cmn,Wd Sdng,9,0.0,250.0,9,TA,TA,9,CBlock,Slab,9,0,1324,9,0,1080,9,0,248,9,0,1602,9,GasA,GasW,9,Ex,TA,9,N,Y,9,944,2069,9,0,966,9,0,0,9,944,2126,9,0,2,9,0,2,9,1,2,9,0,1,9,1,4,9,0,1,9,Ex,TA,9,4,9,9,Min1,Typ,9,0,2,9,1954.0,1969.0,8,0,2,9,0,564,9,Y,Y,9,0,486,9,0,137,9,0,0,9,0,130,9,0,225,9,0,0,9,0,0,9,3,11,9,2006,2009,9,WD,WD,9,Abnorml,Partial,9,82000,256000,9,101180.757917,572930.7,9,1849100.0,5772800.0,9
Gable_Ex,584,584,1,75,75,1,RM,RM,1,75.0,75.0,1,13500,13500,1,Pave,Pave,1,Reg,Reg,1,Lvl,Lvl,1,AllPub,AllPub,1,Inside,Inside,1,Gtl,Gtl,1,OldTown,OldTown,1,Artery,Artery,1,PosA,PosA,1,1Fam,1Fam,1,2.5Unf,2.5Unf,1,10,10,1,9,9,1,1893,1893,1,2000,2000,1,CompShg,CompShg,1,Wd Sdng,Wd Sdng,1,Wd Sdng,Wd Sdng,1,0.0,0.0,1,Ex,Ex,1,BrkTil,BrkTil,1,0,0,1,0,0,1,1237,1237,1,1237,1237,1,GasA,GasA,1,Gd,Gd,1,Y,Y,1,1521,1521,1,1254,1254,1,0,0,1,2775,2775,1,0,0,1,0,0,1,3,3,1,1,1,1,3,3,1,1,1,1,Gd,Gd,1,9,9,1,Typ,Typ,1,1,1,1,1988.0,1988.0,1,2,2,1,880,880,1,Y,Y,1,105,105,1,502,502,1,0,0,1,0,0,1,0,0,1,0,0,1,0,0,1,7,7,1,2008,2008,1,WD,WD,1,Normal,Normal,1,325000,325000,1,145312.790626,145312.8,1,7328750.0,7328750.0,1
Gable_Fa,103,1381,22,20,190,22,RH,RM,22,45.0,98.0,19,5000,12180,22,Pave,Pave,22,IR1,Reg,22,Bnk,Lvl,22,AllPub,AllPub,22,Corner,Inside,22,Gtl,Sev,22,BrkSide,SawyerW,22,Artery,Norm,22,Norm,Norm,22,1Fam,Duplex,22,1.5Fin,2Story,22,1,7,22,1,8,22,1900,1979,22,1950,2000,22,CompShg,CompShg,22,AsbShng,Wd Sdng,22,AsbShng,Wd Shng,22,0.0,600.0,22,Fa,TA,22,BrkTil,Stone,22,0,903,22,0,0,22,0,1276,22,0,1276,22,GasA,Grav,22,Ex,TA,22,N,Y,22,334,1766,22,0,1020,22,0,514,22,334,2414,22,0,1,22,0,0,22,0,2,22,0,1,22,1,6,22,1,2,22,Fa,TA,22,2,10,22,Maj1,Typ,22,0,2,22,1910.0,2000.0,15,0,2,22,0,720,22,N,Y,22,0,250,22,0,184,22,0,216,22,0,0,22,0,184,22,0,0,22,0,0,22,1,11,22,2006,2010,22,COD,WD,22,Abnorml,Normal,22,39300,169500,22,53819.552084,131104.4,22,886215.0,3822225.0,22
Gable_Gd,25,1458,116,20,190,116,C (all),RM,116,21.0,174.0,90,1526,36500,116,Grvl,Pave,116,IR1,Reg,116,Bnk,Lvl,116,AllPub,AllPub,116,Corner,Inside,116,Gtl,Mod,116,Blueste,Veenker,116,Artery,RRAn,116,Feedr,RRNn,116,1Fam,TwnhsE,116,1.5Fin,SLvl,116,3,10,116,3,9,116,1872,1997,116,1950,2010,116,CompShg,WdShngl,116,AsbShng,WdShing,116,AsbShng,Wd Shng,116,0.0,1115.0,116,Ex,TA,116,BrkTil,Stone,116,0,1880,116,0,1061,116,0,1362,116,0,2223,116,GasA,OthW,116,Ex,TA,116,N,Y,116,575,2223,116,0,1518,116,0,572,116,630,3608,116,0,2,116,0,1,116,0,3,116,0,1,116,0,5,116,1,2,116,Ex,TA,116,3,12,116,Min1,Typ,116,0,2,116,1910.0,2007.0,108,0,4,116,0,1356,116,N,Y,116,0,550,116,0,547,116,0,552,116,0,216,116,0,480,116,0,512,116,0,2500,116,1,12,116,2006,2010,116,COD,WD,116,Abnorml,Normal,116,68400,475000,116,16425.727296,392882.7,116,1542420.0,10711250.0,116
Gable_Po,251,251,1,30,30,1,RL,RL,1,55.0,55.0,1,5350,5350,1,Pave,Pave,1,IR1,IR1,1,Lvl,Lvl,1,AllPub,AllPub,1,Inside,Inside,1,Gtl,Gtl,1,BrkSide,BrkSide,1,Norm,Norm,1,Norm,Norm,1,1Fam,1Fam,1,1Story,1Story,1,3,3,1,2,2,1,1940,1940,1,1966,1966,1,CompShg,CompShg,1,Wd Sdng,Wd Sdng,1,Plywood,Plywood,1,0.0,0.0,1,TA,TA,1,CBlock,CBlock,1,0,0,1,0,0,1,728,728,1,728,728,1,GasA,GasA,1,Ex,Ex,1,Y,Y,1,1306,1306,1,0,0,1,0,0,1,1306,1306,1,0,0,1,0,0,1,1,1,1,0,0,1,3,3,1,1,1,1,Fa,Fa,1,6,6,1,Mod,Mod,1,0,0,1,,,0,0,0,1,0,0,1,Y,Y,1,263,263,1,0,0,1,0,0,1,0,0,1,0,0,1,0,0,1,450,450,1,5,5,1,2010,2010,1,WD,WD,1,Normal,Normal,1,76500,76500,1,57586.920729,57586.92,1,1725075.0,1725075.0,1
Gable_TA,1,1460,1001,20,190,1001,C (all),RM,1001,21.0,174.0,824,1300,164660,1001,Grvl,Pave,1001,IR1,Reg,1001,Bnk,Lvl,1001,AllPub,NoSeWa,1001,Corner,Inside,1001,Gtl,Sev,1001,Blmngtn,Veenker,1001,Artery,RRNn,1001,Artery,RRNn,1001,1Fam,TwnhsE,1001,1.5Fin,SLvl,1001,2,10,1001,2,9,1001,1880,2009,1001,1950,2010,1001,CompShg,WdShngl,1001,AsbShng,WdShing,1001,AsbShng,Wd Shng,1001,0.0,1170.0,994,Ex,TA,1001,BrkTil,Wood,1001,0,1810,1001,0,1474,1001,0,2121,1001,0,3206,1001,GasA,Wall,1001,Ex,TA,1001,N,Y,1001,438,2444,1001,0,1872,1001,0,528,1001,438,4316,1001,0,3,1001,0,2,1001,0,3,1001,0,2,1001,0,6,1001,1,3,1001,Ex,TA,1001,3,12,1001,Maj1,Typ,1001,0,3,1001,1906.0,2010.0,946,0,4,1001,0,1248,1001,N,Y,1001,0,857,1001,0,523,1001,0,330,1001,0,508,1001,0,440,1001,0,576,1001,0,2000,1001,1,12,1001,2006,2010,1001,COD,WD,1001,Abnorml,Partial,1001,34900,755000,1001,13993.083542,1772385.0,1001,786995.0,17025250.0,1001
Gambrel_Ex,509,509,1,70,70,1,RM,RM,1,60.0,60.0,1,9600,9600,1,Pave,Pave,1,Reg,Reg,1,Lvl,Lvl,1,AllPub,AllPub,1,Inside,Inside,1,Gtl,Gtl,1,OldTown,OldTown,1,Norm,Norm,1,Norm,Norm,1,1Fam,1Fam,1,2Story,2Story,1,7,7,1,9,9,1,1928,1928,1,2005,2005,1,CompShg,CompShg,1,MetalSd,MetalSd,1,MetalSd,MetalSd,1,0.0,0.0,1,TA,TA,1,BrkTil,BrkTil,1,141,141,1,0,0,1,548,548,1,689,689,1,GasA,GasA,1,Ex,Ex,1,Y,Y,1,689,689,1,689,689,1,0,0,1,1378,1378,1,0,0,1,0,0,1,2,2,1,0,0,1,3,3,1,1,1,1,Gd,Gd,1,7,7,1,Typ,Typ,1,1,1,1,1928.0,1928.0,1,2,2,1,360,360,1,N,N,1,0,0,1,0,0,1,116,116,1,0,0,1,0,0,1,0,0,1,0,0,1,10,10,1,2008,2008,1,WD,WD,1,Normal,Normal,1,161000,161000,1,103333.54,103333.5,1,3630550.0,3630550.0,1
Gambrel_Fa,31,387,2,50,70,2,C (all),RL,2,50.0,58.0,2,8410,8500,2,Pave,Pave,2,Reg,Reg,2,Lvl,Lvl,2,AllPub,AllPub,2,FR2,Inside,2,Gtl,Gtl,2,Edwards,IDOTRR,2,Feedr,Feedr,2,Norm,Norm,2,1Fam,1Fam,2,1.5Fin,2Story,2,4,5,2,3,4,2,1910,1920,2,1950,1996,2,CompShg,CompShg,2,BrkFace,Wd Sdng,2,BrkFace,VinylSd,2,0.0,0.0,2,TA,TA,2,BrkTil,PConc,2,0,0,2,0,0,2,649,658,2,649,658,2,GasA,GasA,2,TA,TA,2,N,Y,2,649,658,2,526,668,2,0,0,2,1184,1317,2,0,0,2,0,0,2,1,1,2,0,0,2,3,5,2,1,1,2,TA,TA,2,6,8,2,Typ,Typ,2,0,0,2,1920.0,1920.0,1,0,1,2,0,250,2,N,N,2,0,0,2,54,151,2,0,172,2,0,0,2,0,0,2,0,0,2,0,0,2,5,7,2,2006,2008,2,WD,WD,2,AdjLand,Normal,2,40000,81000,2,90524.486605,91493.24,2,902000.0,1826550.0,2
Gambrel_Gd,75,672,2,50,70,2,RH,RM,2,54.0,60.0,2,5790,6629,2,Pave,Pave,2,Reg,Reg,2,Lvl,Lvl,2,AllPub,AllPub,2,Corner,Inside,2,Gtl,Gtl,2,Edwards,OldTown,2,Artery,Norm,2,Norm,Norm,2,1Fam,1Fam,2,2Story,2Story,2,3,6,2,6,6,2,1915,1925,2,1950,1950,2,CompShg,CompShg,2,VinylSd,Wd Sdng,2,VinylSd,Wd Sdng,2,0.0,0.0,2,Gd,TA,2,BrkTil,CBlock,2,0,551,2,0,0,2,121,840,2,672,840,2,GasA,GasA,2,Gd,TA,2,N,N,2,697,840,2,672,765,2,0,0,2,1369,1605,2,0,1,2,0,0,2,2,2,2,0,0,2,3,3,2,1,2,2,TA,TA,2,6,8,2,Typ,Typ,2,0,0,2,1915.0,1930.0,2,1,1,2,300,379,2,Y,Y,2,0,147,2,0,0,2,0,202,2,0,0,2,0,0,2,0,0,2,0,0,2,5,7,2,2009,2010,2,WD,WD,2,Normal,Normal,2,103600,107400,2,62323.041313,71353.96,2,2336180.0,2421870.0,2


In [45]:
# filipuv_experiment = filipuv_experiment.drop(['RoofStyle', 'ExterCond'], axis=1)
filipuv_experiment.drop(['RoofStyle', 'ExterCond'], axis=1, inplace=True)

In [46]:
filipuv_experiment

Unnamed: 0,Id_min,Id_max,Id_count,MSSubClass_min,MSSubClass_max,MSSubClass_count,MSZoning_min,MSZoning_max,MSZoning_count,LotFrontage_min,LotFrontage_max,LotFrontage_count,LotArea_min,LotArea_max,LotArea_count,Street_min,Street_max,Street_count,LotShape_min,LotShape_max,LotShape_count,LandContour_min,LandContour_max,LandContour_count,Utilities_min,Utilities_max,Utilities_count,LotConfig_min,LotConfig_max,LotConfig_count,LandSlope_min,LandSlope_max,LandSlope_count,Neighborhood_min,Neighborhood_max,Neighborhood_count,Condition1_min,Condition1_max,Condition1_count,Condition2_min,Condition2_max,Condition2_count,BldgType_min,BldgType_max,BldgType_count,HouseStyle_min,HouseStyle_max,HouseStyle_count,OverallQual_min,OverallQual_max,OverallQual_count,OverallCond_min,OverallCond_max,OverallCond_count,YearBuilt_min,YearBuilt_max,YearBuilt_count,YearRemodAdd_min,YearRemodAdd_max,YearRemodAdd_count,RoofMatl_min,RoofMatl_max,RoofMatl_count,Exterior1st_min,Exterior1st_max,Exterior1st_count,Exterior2nd_min,Exterior2nd_max,Exterior2nd_count,MasVnrArea_min,MasVnrArea_max,MasVnrArea_count,ExterQual_min,ExterQual_max,ExterQual_count,Foundation_min,Foundation_max,Foundation_count,BsmtFinSF1_min,BsmtFinSF1_max,BsmtFinSF1_count,BsmtFinSF2_min,BsmtFinSF2_max,BsmtFinSF2_count,BsmtUnfSF_min,BsmtUnfSF_max,BsmtUnfSF_count,TotalBsmtSF_min,TotalBsmtSF_max,TotalBsmtSF_count,Heating_min,Heating_max,Heating_count,HeatingQC_min,HeatingQC_max,HeatingQC_count,CentralAir_min,CentralAir_max,CentralAir_count,1stFlrSF_min,1stFlrSF_max,1stFlrSF_count,2ndFlrSF_min,2ndFlrSF_max,2ndFlrSF_count,LowQualFinSF_min,LowQualFinSF_max,LowQualFinSF_count,GrLivArea_min,GrLivArea_max,GrLivArea_count,BsmtFullBath_min,BsmtFullBath_max,BsmtFullBath_count,BsmtHalfBath_min,BsmtHalfBath_max,BsmtHalfBath_count,FullBath_min,FullBath_max,FullBath_count,HalfBath_min,HalfBath_max,HalfBath_count,BedroomAbvGr_min,BedroomAbvGr_max,BedroomAbvGr_count,KitchenAbvGr_min,KitchenAbvGr_max,KitchenAbvGr_count,KitchenQual_min,KitchenQual_max,KitchenQual_count,TotRmsAbvGrd_min,TotRmsAbvGrd_max,TotRmsAbvGrd_count,Functional_min,Functional_max,Functional_count,Fireplaces_min,Fireplaces_max,Fireplaces_count,GarageYrBlt_min,GarageYrBlt_max,GarageYrBlt_count,GarageCars_min,GarageCars_max,GarageCars_count,GarageArea_min,GarageArea_max,GarageArea_count,PavedDrive_min,PavedDrive_max,PavedDrive_count,WoodDeckSF_min,WoodDeckSF_max,WoodDeckSF_count,OpenPorchSF_min,OpenPorchSF_max,OpenPorchSF_count,EnclosedPorch_min,EnclosedPorch_max,EnclosedPorch_count,3SsnPorch_min,3SsnPorch_max,3SsnPorch_count,ScreenPorch_min,ScreenPorch_max,ScreenPorch_count,PoolArea_min,PoolArea_max,PoolArea_count,MiscVal_min,MiscVal_max,MiscVal_count,MoSold_min,MoSold_max,MoSold_count,YrSold_min,YrSold_max,YrSold_count,SaleType_min,SaleType_max,SaleType_count,SaleCondition_min,SaleCondition_max,SaleCondition_count,SalePrice_min,SalePrice_max,SalePrice_count,LotAreaMeters_min,LotAreaMeters_max,LotAreaMeters_count,SalePriceCZK_min,SalePriceCZK_max,SalePriceCZK_count
Flat_Gd,467,1424,4,20,80,4,RL,RL,4,85.0,95.0,2,10628,19690,4,Pave,Pave,4,IR1,Reg,4,Lvl,Lvl,4,AllPub,AllPub,4,CulDSac,Inside,4,Gtl,Mod,4,Edwards,NAmes,4,Norm,Norm,4,Norm,Norm,4,1Fam,1Fam,4,1Story,SLvl,4,6,8,4,5,7,4,1965,1970,4,1966,1979,4,Tar&Grv,Tar&Grv,4,Plywood,Wd Sdng,4,Plywood,Wd Sdng,4,0.0,113.0,4,Gd,TA,4,CBlock,PConc,4,0,778,4,0,823,4,111,1043,4,697,2217,4,GasA,GasA,4,Ex,TA,4,Y,Y,4,1277,2217,4,0,915,4,0,0,4,1277,2380,4,0,1,4,0,0,4,1,2,4,0,1,4,2,4,4,1,1,4,Gd,TA,4,5,8,4,Sev,Typ,4,1,1,4,1965.0,1970.0,4,2,2,4,432,621,4,Y,Y,4,0,586,4,0,265,4,0,0,4,0,0,4,0,224,4,0,738,4,0,0,4,4,10,4,2006,2007,4,WD,WD,4,Abnorml,Normal,4,129000,274970,4,114398.839909,211941.4,4,2908950.0,6200573.5,4
Flat_TA,121,1001,9,20,90,9,RL,RL,9,35.0,313.0,5,9400,53227,9,Pave,Pave,9,IR1,Reg,9,HLS,Lvl,9,AllPub,AllPub,9,Corner,Inside,9,Gtl,Sev,9,ClearCr,NAmes,9,Norm,PosA,9,Norm,Norm,9,1Fam,Duplex,9,1Story,SLvl,9,3,7,9,3,8,9,1952,1975,9,1952,2007,9,CompShg,Tar&Grv,9,BrkComm,WdShing,9,Brk Cmn,Wd Sdng,9,0.0,250.0,9,TA,TA,9,CBlock,Slab,9,0,1324,9,0,1080,9,0,248,9,0,1602,9,GasA,GasW,9,Ex,TA,9,N,Y,9,944,2069,9,0,966,9,0,0,9,944,2126,9,0,2,9,0,2,9,1,2,9,0,1,9,1,4,9,0,1,9,Ex,TA,9,4,9,9,Min1,Typ,9,0,2,9,1954.0,1969.0,8,0,2,9,0,564,9,Y,Y,9,0,486,9,0,137,9,0,0,9,0,130,9,0,225,9,0,0,9,0,0,9,3,11,9,2006,2009,9,WD,WD,9,Abnorml,Partial,9,82000,256000,9,101180.757917,572930.7,9,1849100.0,5772800.0,9
Gable_Ex,584,584,1,75,75,1,RM,RM,1,75.0,75.0,1,13500,13500,1,Pave,Pave,1,Reg,Reg,1,Lvl,Lvl,1,AllPub,AllPub,1,Inside,Inside,1,Gtl,Gtl,1,OldTown,OldTown,1,Artery,Artery,1,PosA,PosA,1,1Fam,1Fam,1,2.5Unf,2.5Unf,1,10,10,1,9,9,1,1893,1893,1,2000,2000,1,CompShg,CompShg,1,Wd Sdng,Wd Sdng,1,Wd Sdng,Wd Sdng,1,0.0,0.0,1,Ex,Ex,1,BrkTil,BrkTil,1,0,0,1,0,0,1,1237,1237,1,1237,1237,1,GasA,GasA,1,Gd,Gd,1,Y,Y,1,1521,1521,1,1254,1254,1,0,0,1,2775,2775,1,0,0,1,0,0,1,3,3,1,1,1,1,3,3,1,1,1,1,Gd,Gd,1,9,9,1,Typ,Typ,1,1,1,1,1988.0,1988.0,1,2,2,1,880,880,1,Y,Y,1,105,105,1,502,502,1,0,0,1,0,0,1,0,0,1,0,0,1,0,0,1,7,7,1,2008,2008,1,WD,WD,1,Normal,Normal,1,325000,325000,1,145312.790626,145312.8,1,7328750.0,7328750.0,1
Gable_Fa,103,1381,22,20,190,22,RH,RM,22,45.0,98.0,19,5000,12180,22,Pave,Pave,22,IR1,Reg,22,Bnk,Lvl,22,AllPub,AllPub,22,Corner,Inside,22,Gtl,Sev,22,BrkSide,SawyerW,22,Artery,Norm,22,Norm,Norm,22,1Fam,Duplex,22,1.5Fin,2Story,22,1,7,22,1,8,22,1900,1979,22,1950,2000,22,CompShg,CompShg,22,AsbShng,Wd Sdng,22,AsbShng,Wd Shng,22,0.0,600.0,22,Fa,TA,22,BrkTil,Stone,22,0,903,22,0,0,22,0,1276,22,0,1276,22,GasA,Grav,22,Ex,TA,22,N,Y,22,334,1766,22,0,1020,22,0,514,22,334,2414,22,0,1,22,0,0,22,0,2,22,0,1,22,1,6,22,1,2,22,Fa,TA,22,2,10,22,Maj1,Typ,22,0,2,22,1910.0,2000.0,15,0,2,22,0,720,22,N,Y,22,0,250,22,0,184,22,0,216,22,0,0,22,0,184,22,0,0,22,0,0,22,1,11,22,2006,2010,22,COD,WD,22,Abnorml,Normal,22,39300,169500,22,53819.552084,131104.4,22,886215.0,3822225.0,22
Gable_Gd,25,1458,116,20,190,116,C (all),RM,116,21.0,174.0,90,1526,36500,116,Grvl,Pave,116,IR1,Reg,116,Bnk,Lvl,116,AllPub,AllPub,116,Corner,Inside,116,Gtl,Mod,116,Blueste,Veenker,116,Artery,RRAn,116,Feedr,RRNn,116,1Fam,TwnhsE,116,1.5Fin,SLvl,116,3,10,116,3,9,116,1872,1997,116,1950,2010,116,CompShg,WdShngl,116,AsbShng,WdShing,116,AsbShng,Wd Shng,116,0.0,1115.0,116,Ex,TA,116,BrkTil,Stone,116,0,1880,116,0,1061,116,0,1362,116,0,2223,116,GasA,OthW,116,Ex,TA,116,N,Y,116,575,2223,116,0,1518,116,0,572,116,630,3608,116,0,2,116,0,1,116,0,3,116,0,1,116,0,5,116,1,2,116,Ex,TA,116,3,12,116,Min1,Typ,116,0,2,116,1910.0,2007.0,108,0,4,116,0,1356,116,N,Y,116,0,550,116,0,547,116,0,552,116,0,216,116,0,480,116,0,512,116,0,2500,116,1,12,116,2006,2010,116,COD,WD,116,Abnorml,Normal,116,68400,475000,116,16425.727296,392882.7,116,1542420.0,10711250.0,116
Gable_Po,251,251,1,30,30,1,RL,RL,1,55.0,55.0,1,5350,5350,1,Pave,Pave,1,IR1,IR1,1,Lvl,Lvl,1,AllPub,AllPub,1,Inside,Inside,1,Gtl,Gtl,1,BrkSide,BrkSide,1,Norm,Norm,1,Norm,Norm,1,1Fam,1Fam,1,1Story,1Story,1,3,3,1,2,2,1,1940,1940,1,1966,1966,1,CompShg,CompShg,1,Wd Sdng,Wd Sdng,1,Plywood,Plywood,1,0.0,0.0,1,TA,TA,1,CBlock,CBlock,1,0,0,1,0,0,1,728,728,1,728,728,1,GasA,GasA,1,Ex,Ex,1,Y,Y,1,1306,1306,1,0,0,1,0,0,1,1306,1306,1,0,0,1,0,0,1,1,1,1,0,0,1,3,3,1,1,1,1,Fa,Fa,1,6,6,1,Mod,Mod,1,0,0,1,,,0,0,0,1,0,0,1,Y,Y,1,263,263,1,0,0,1,0,0,1,0,0,1,0,0,1,0,0,1,450,450,1,5,5,1,2010,2010,1,WD,WD,1,Normal,Normal,1,76500,76500,1,57586.920729,57586.92,1,1725075.0,1725075.0,1
Gable_TA,1,1460,1001,20,190,1001,C (all),RM,1001,21.0,174.0,824,1300,164660,1001,Grvl,Pave,1001,IR1,Reg,1001,Bnk,Lvl,1001,AllPub,NoSeWa,1001,Corner,Inside,1001,Gtl,Sev,1001,Blmngtn,Veenker,1001,Artery,RRNn,1001,Artery,RRNn,1001,1Fam,TwnhsE,1001,1.5Fin,SLvl,1001,2,10,1001,2,9,1001,1880,2009,1001,1950,2010,1001,CompShg,WdShngl,1001,AsbShng,WdShing,1001,AsbShng,Wd Shng,1001,0.0,1170.0,994,Ex,TA,1001,BrkTil,Wood,1001,0,1810,1001,0,1474,1001,0,2121,1001,0,3206,1001,GasA,Wall,1001,Ex,TA,1001,N,Y,1001,438,2444,1001,0,1872,1001,0,528,1001,438,4316,1001,0,3,1001,0,2,1001,0,3,1001,0,2,1001,0,6,1001,1,3,1001,Ex,TA,1001,3,12,1001,Maj1,Typ,1001,0,3,1001,1906.0,2010.0,946,0,4,1001,0,1248,1001,N,Y,1001,0,857,1001,0,523,1001,0,330,1001,0,508,1001,0,440,1001,0,576,1001,0,2000,1001,1,12,1001,2006,2010,1001,COD,WD,1001,Abnorml,Partial,1001,34900,755000,1001,13993.083542,1772385.0,1001,786995.0,17025250.0,1001
Gambrel_Ex,509,509,1,70,70,1,RM,RM,1,60.0,60.0,1,9600,9600,1,Pave,Pave,1,Reg,Reg,1,Lvl,Lvl,1,AllPub,AllPub,1,Inside,Inside,1,Gtl,Gtl,1,OldTown,OldTown,1,Norm,Norm,1,Norm,Norm,1,1Fam,1Fam,1,2Story,2Story,1,7,7,1,9,9,1,1928,1928,1,2005,2005,1,CompShg,CompShg,1,MetalSd,MetalSd,1,MetalSd,MetalSd,1,0.0,0.0,1,TA,TA,1,BrkTil,BrkTil,1,141,141,1,0,0,1,548,548,1,689,689,1,GasA,GasA,1,Ex,Ex,1,Y,Y,1,689,689,1,689,689,1,0,0,1,1378,1378,1,0,0,1,0,0,1,2,2,1,0,0,1,3,3,1,1,1,1,Gd,Gd,1,7,7,1,Typ,Typ,1,1,1,1,1928.0,1928.0,1,2,2,1,360,360,1,N,N,1,0,0,1,0,0,1,116,116,1,0,0,1,0,0,1,0,0,1,0,0,1,10,10,1,2008,2008,1,WD,WD,1,Normal,Normal,1,161000,161000,1,103333.54,103333.5,1,3630550.0,3630550.0,1
Gambrel_Fa,31,387,2,50,70,2,C (all),RL,2,50.0,58.0,2,8410,8500,2,Pave,Pave,2,Reg,Reg,2,Lvl,Lvl,2,AllPub,AllPub,2,FR2,Inside,2,Gtl,Gtl,2,Edwards,IDOTRR,2,Feedr,Feedr,2,Norm,Norm,2,1Fam,1Fam,2,1.5Fin,2Story,2,4,5,2,3,4,2,1910,1920,2,1950,1996,2,CompShg,CompShg,2,BrkFace,Wd Sdng,2,BrkFace,VinylSd,2,0.0,0.0,2,TA,TA,2,BrkTil,PConc,2,0,0,2,0,0,2,649,658,2,649,658,2,GasA,GasA,2,TA,TA,2,N,Y,2,649,658,2,526,668,2,0,0,2,1184,1317,2,0,0,2,0,0,2,1,1,2,0,0,2,3,5,2,1,1,2,TA,TA,2,6,8,2,Typ,Typ,2,0,0,2,1920.0,1920.0,1,0,1,2,0,250,2,N,N,2,0,0,2,54,151,2,0,172,2,0,0,2,0,0,2,0,0,2,0,0,2,5,7,2,2006,2008,2,WD,WD,2,AdjLand,Normal,2,40000,81000,2,90524.486605,91493.24,2,902000.0,1826550.0,2
Gambrel_Gd,75,672,2,50,70,2,RH,RM,2,54.0,60.0,2,5790,6629,2,Pave,Pave,2,Reg,Reg,2,Lvl,Lvl,2,AllPub,AllPub,2,Corner,Inside,2,Gtl,Gtl,2,Edwards,OldTown,2,Artery,Norm,2,Norm,Norm,2,1Fam,1Fam,2,2Story,2Story,2,3,6,2,6,6,2,1915,1925,2,1950,1950,2,CompShg,CompShg,2,VinylSd,Wd Sdng,2,VinylSd,Wd Sdng,2,0.0,0.0,2,Gd,TA,2,BrkTil,CBlock,2,0,551,2,0,0,2,121,840,2,672,840,2,GasA,GasA,2,Gd,TA,2,N,N,2,697,840,2,672,765,2,0,0,2,1369,1605,2,0,1,2,0,0,2,2,2,2,0,0,2,3,3,2,1,2,2,TA,TA,2,6,8,2,Typ,Typ,2,0,0,2,1915.0,1930.0,2,1,1,2,300,379,2,Y,Y,2,0,147,2,0,0,2,0,202,2,0,0,2,0,0,2,0,0,2,0,0,2,5,7,2,2009,2010,2,WD,WD,2,Normal,Normal,2,103600,107400,2,62323.041313,71353.96,2,2336180.0,2421870.0,2


## Export do souboru

Analogicky s načtením dat, nejčastěji použijeme [`to_csv`](https://pandas.pydata.org/pandas-docs/stable/reference/api/pandas.DataFrame.to_csv.html) (ovšem voláme tuto metodu na `DataFrame`, který chceme uložit), ale jsou i [další funkce](https://pandas.pydata.org/pandas-docs/stable/user_guide/io.html)

In [47]:
# Pokud nechceme mít zapisovat index, nastavíme index=False
houses.to_csv('houses_processed.csv', index=False)

# Práce se soubory

[Dask](http://docs.dask.org/en/latest/dataframe.html) umožňuje paralelizovat zpracování `DataFrame`.

Načítat se dají i exporty z SQL databáze pomocí [`read_sql`](https://pandas.pydata.org/pandas-docs/stable/user_guide/io.html#io-sql).

`Pickle` [není](https://www.benfrederickson.com/dont-pickle-your-data/) ten nejvhodnější formát.