## Overview

This notebook will show you how to create and query a table or DataFrame that you uploaded to DBFS. [DBFS](https://docs.databricks.com/user-guide/dbfs-databricks-file-system.html) is a Databricks File System that allows you to store data for querying inside of Databricks. This notebook assumes that you have a file already inside of DBFS that you would like to read from.

This notebook is written in **Python** so the default cell type is Python. However, you can use different languages by using the `%LANGUAGE` syntax. Python, Scala, SQL, and R are all supported.

## STAGING DATA
### Raw CSV -> Raw PARQUET (Optional depending on file size. Affects processing time needed for transformation.)

In [3]:
# Sberbank Russian Housing Market
# https://www.kaggle.com/c/sberbank-russian-housing-market
# Training.csv and Macro.csv was used in this data engineering flow example

# File location and type
file_location = "/FileStore/tables/train.csv"
file_type = "csv"

# CSV options
infer_schema = "true"
first_row_is_header = "true"
delimiter = ","

# The applied options are for CSV files. For other file types, these will be ignored.
df1 = spark.read.format(file_type) \
  .option("inferSchema", infer_schema) \
  .option("header", first_row_is_header) \
  .option("sep", delimiter) \
  .load(file_location)

display(df1)

id,timestamp,full_sq,life_sq,floor,max_floor,material,build_year,num_room,kitch_sq,state,product_type,sub_area,area_m,raion_popul,green_zone_part,indust_part,children_preschool,preschool_quota,preschool_education_centers_raion,children_school,school_quota,school_education_centers_raion,school_education_centers_top_20_raion,hospital_beds_raion,healthcare_centers_raion,university_top_20_raion,sport_objects_raion,additional_education_raion,culture_objects_top_25,culture_objects_top_25_raion,shopping_centers_raion,office_raion,thermal_power_plant_raion,incineration_raion,oil_chemistry_raion,radiation_raion,railroad_terminal_raion,big_market_raion,nuclear_reactor_raion,detention_facility_raion,full_all,male_f,female_f,young_all,young_male,young_female,work_all,work_male,work_female,ekder_all,ekder_male,ekder_female,0_6_all,0_6_male,0_6_female,7_14_all,7_14_male,7_14_female,0_17_all,0_17_male,0_17_female,16_29_all,16_29_male,16_29_female,0_13_all,0_13_male,0_13_female,raion_build_count_with_material_info,build_count_block,build_count_wood,build_count_frame,build_count_brick,build_count_monolith,build_count_panel,build_count_foam,build_count_slag,build_count_mix,raion_build_count_with_builddate_info,build_count_before_1920,build_count_1921-1945,build_count_1946-1970,build_count_1971-1995,build_count_after_1995,ID_metro,metro_min_avto,metro_km_avto,metro_min_walk,metro_km_walk,kindergarten_km,school_km,park_km,green_zone_km,industrial_km,water_treatment_km,cemetery_km,incineration_km,railroad_station_walk_km,railroad_station_walk_min,ID_railroad_station_walk,railroad_station_avto_km,railroad_station_avto_min,ID_railroad_station_avto,public_transport_station_km,public_transport_station_min_walk,water_km,water_1line,mkad_km,ttk_km,sadovoe_km,bulvar_ring_km,kremlin_km,big_road1_km,ID_big_road1,big_road1_1line,big_road2_km,ID_big_road2,railroad_km,railroad_1line,zd_vokzaly_avto_km,ID_railroad_terminal,bus_terminal_avto_km,ID_bus_terminal,oil_chemistry_km,nuclear_reactor_km,radiation_km,power_transmission_line_km,thermal_power_plant_km,ts_km,big_market_km,market_shop_km,fitness_km,swim_pool_km,ice_rink_km,stadium_km,basketball_km,hospice_morgue_km,detention_facility_km,public_healthcare_km,university_km,workplaces_km,shopping_centers_km,office_km,additional_education_km,preschool_km,big_church_km,church_synagogue_km,mosque_km,theater_km,museum_km,exhibition_km,catering_km,ecology,green_part_500,prom_part_500,office_count_500,office_sqm_500,trc_count_500,trc_sqm_500,cafe_count_500,cafe_sum_500_min_price_avg,cafe_sum_500_max_price_avg,cafe_avg_price_500,cafe_count_500_na_price,cafe_count_500_price_500,cafe_count_500_price_1000,cafe_count_500_price_1500,cafe_count_500_price_2500,cafe_count_500_price_4000,cafe_count_500_price_high,big_church_count_500,church_count_500,mosque_count_500,leisure_count_500,sport_count_500,market_count_500,green_part_1000,prom_part_1000,office_count_1000,office_sqm_1000,trc_count_1000,trc_sqm_1000,cafe_count_1000,cafe_sum_1000_min_price_avg,cafe_sum_1000_max_price_avg,cafe_avg_price_1000,cafe_count_1000_na_price,cafe_count_1000_price_500,cafe_count_1000_price_1000,cafe_count_1000_price_1500,cafe_count_1000_price_2500,cafe_count_1000_price_4000,cafe_count_1000_price_high,big_church_count_1000,church_count_1000,mosque_count_1000,leisure_count_1000,sport_count_1000,market_count_1000,green_part_1500,prom_part_1500,office_count_1500,office_sqm_1500,trc_count_1500,trc_sqm_1500,cafe_count_1500,cafe_sum_1500_min_price_avg,cafe_sum_1500_max_price_avg,cafe_avg_price_1500,cafe_count_1500_na_price,cafe_count_1500_price_500,cafe_count_1500_price_1000,cafe_count_1500_price_1500,cafe_count_1500_price_2500,cafe_count_1500_price_4000,cafe_count_1500_price_high,big_church_count_1500,church_count_1500,mosque_count_1500,leisure_count_1500,sport_count_1500,market_count_1500,green_part_2000,prom_part_2000,office_count_2000,office_sqm_2000,trc_count_2000,trc_sqm_2000,cafe_count_2000,cafe_sum_2000_min_price_avg,cafe_sum_2000_max_price_avg,cafe_avg_price_2000,cafe_count_2000_na_price,cafe_count_2000_price_500,cafe_count_2000_price_1000,cafe_count_2000_price_1500,cafe_count_2000_price_2500,cafe_count_2000_price_4000,cafe_count_2000_price_high,big_church_count_2000,church_count_2000,mosque_count_2000,leisure_count_2000,sport_count_2000,market_count_2000,green_part_3000,prom_part_3000,office_count_3000,office_sqm_3000,trc_count_3000,trc_sqm_3000,cafe_count_3000,cafe_sum_3000_min_price_avg,cafe_sum_3000_max_price_avg,cafe_avg_price_3000,cafe_count_3000_na_price,cafe_count_3000_price_500,cafe_count_3000_price_1000,cafe_count_3000_price_1500,cafe_count_3000_price_2500,cafe_count_3000_price_4000,cafe_count_3000_price_high,big_church_count_3000,church_count_3000,mosque_count_3000,leisure_count_3000,sport_count_3000,market_count_3000,green_part_5000,prom_part_5000,office_count_5000,office_sqm_5000,trc_count_5000,trc_sqm_5000,cafe_count_5000,cafe_sum_5000_min_price_avg,cafe_sum_5000_max_price_avg,cafe_avg_price_5000,cafe_count_5000_na_price,cafe_count_5000_price_500,cafe_count_5000_price_1000,cafe_count_5000_price_1500,cafe_count_5000_price_2500,cafe_count_5000_price_4000,cafe_count_5000_price_high,big_church_count_5000,church_count_5000,mosque_count_5000,leisure_count_5000,sport_count_5000,market_count_5000,price_doc
1,2011-08-20T00:00:00.000+0000,43,27.0,4.0,,,,,,,Investment,Bibirevo,6407578.1,155572,0.189727117,6.99893e-05,9576,5001,5,10309,11065,5,0,240.0,1,0,7,3,no,0,16,1,no,no,no,no,no,no,no,no,86206,40477,45729,21154,11007,10147,98207,52277,45930,36211,10580,25631,9576,4899,4677,10309,5463,4846,23603,12286,11317,17508,9425,8083,18654,9709,8945,211,25,0,0,0,2,184,0,0,0,211,0,0,0,206,5,1,2.590241095,1.131259906,13.57511887,1.131259906,0.145699552,0.17797535,2.158587074,0.600973099,1.080934313,23.68346,1.804127,3.633334,5.419893032,65.03871639,1,5.419893032,6.905892968,1,0.274985143,3.299821714,0.992631058,no,1.42239141,10.9185867,13.10061764,13.67565705,15.15621058,1.422391404,1,no,3.830951404,5,1.305159492,no,14.23196091,101,24.2924061,1,18.152338,5.718518835,1.210027392,1.062513046,5.814134663,4.308127002,10.81417151,1.676258313,0.485841388,3.065047099,1.107594209,8.148590774,3.516512911,2.392353035,4.248035887,0.974742843,6.715025787,0.884350021,0.648487637,0.637188832,0.947961657,0.17797535,0.625783434,0.628186549,3.932040333,14.05304655,7.389497904,7.023704919,0.516838085,good,0.0,0.0,0,0,0,0,0,,,,0,0,0,0,0,0,0,0,0,0,0,1,0,7.36,0.0,1,30500,3,55600,19,527.78,888.89,708.33,1,10,4,3,1,0,0,1,2,0,0,6,1,14.27,6.92,3,39554,9,171420,34,566.67,969.7,768.18,1,14,11,6,2,0,0,1,2,0,0,7,1,11.77,15.97,9,188854,19,1244891,36,614.29,1042.86,828.57,1,15,11,6,2,1,0,1,2,0,0,10,1,11.98,13.55,12,251554,23,1419204,68,639.68,1079.37,859.52,5,21,22,16,3,1,0,2,4,0,0,21,1,13.09,13.31,29,807385,52,4036616,152,708.57,1185.71,947.14,12,39,48,40,9,4,0,13,22,1,0,52,4,5850000
2,2011-08-23T00:00:00.000+0000,34,19.0,3.0,,,,,,,Investment,Nagatinskij Zaton,9589336.912,115352,0.372602044,0.049637257,6880,3119,5,7759,6237,8,0,229.0,1,0,6,1,yes,1,3,0,no,no,no,no,no,no,no,no,76284,34200,42084,15727,7925,7802,70194,35622,34572,29431,9266,20165,6880,3466,3414,7759,3909,3850,17700,8998,8702,15164,7571,7593,13729,6929,6800,245,83,1,0,67,4,90,0,0,0,244,1,1,143,84,15,2,0.936699728,0.647336757,7.620630408,0.635052534,0.147754269,0.273345319,0.550689737,0.065321162,0.966479097,1.317476,4.655004,8.648587,3.411993084,40.943917,2,3.641772591,4.679744508,2,0.065263344,0.78316013,0.698081318,no,9.503405157,3.103995954,6.444333466,8.132640073,8.698054189,2.887376585,2,no,3.103995974,4,0.694535727,no,9.242585522,32,5.706113234,2,9.034641872,3.489954443,2.72429538,1.246148739,3.419574049,0.725560431,6.910567711,3.424716092,0.668363679,2.000153804,8.97282283,6.127072782,1.161578983,2.543746975,12.64987875,1.47772267,1.852560245,0.686251693,0.519311324,0.688796317,1.072315063,0.273345319,0.967820571,0.471446524,4.841543888,6.829888847,0.709260033,2.358840498,0.23028691,excellent,25.14,0.0,0,0,0,0,5,860.0,1500.0,1180.0,0,1,3,0,0,1,0,0,1,0,0,0,0,26.66,0.07,2,86600,5,94065,13,615.38,1076.92,846.15,0,5,6,1,0,1,0,1,2,0,4,2,0,21.53,7.71,3,102910,7,127065,17,694.12,1205.88,950.0,0,6,7,1,2,1,0,1,5,0,4,9,0,22.37,19.25,4,165510,8,179065,21,695.24,1190.48,942.86,0,7,8,3,2,1,0,1,5,0,4,11,0,18.07,27.32,12,821986,14,491565,30,631.03,1086.21,858.62,1,11,11,4,2,1,0,1,7,0,6,19,1,10.26,27.47,66,2690465,40,2034942,177,673.81,1148.81,911.31,9,49,65,36,15,3,0,15,29,1,10,66,14,6000000
3,2011-08-27T00:00:00.000+0000,43,29.0,2.0,,,,,,,Investment,Tekstil'shhiki,4808269.831,101708,0.112559644,0.118537385,5879,1463,4,6207,5580,7,0,1183.0,1,0,5,1,no,0,0,1,no,no,no,yes,no,no,no,no,101982,46076,55906,13028,6835,6193,63388,31813,31575,25292,7609,17683,5879,3095,2784,6207,3269,2938,14884,7821,7063,19401,9045,10356,11252,5916,5336,330,59,0,0,206,4,60,0,1,0,330,1,0,246,63,20,3,2.120998901,1.637996285,17.3515154,1.445959617,0.049101536,0.158071895,0.374847751,0.453172405,0.939275144,4.91266,3.381083,11.99648,1.277658039,15.33189647,3,1.277658039,1.701419537,3,0.328756044,3.945072522,0.468264622,no,5.60479992,2.927487097,6.963402995,8.054252314,9.067884956,0.647249803,3,no,2.927487099,4,0.70069112,no,9.540544478,5,6.710302485,3,5.777393501,7.50661249,0.772216104,1.60218297,3.682454651,3.562187704,5.75236835,1.375442778,0.733101062,1.239303854,1.978517187,0.767568769,1.952770629,0.621357002,7.682302975,0.097143527,0.841254102,1.510088854,1.48653302,1.543048836,0.391957389,0.158071895,3.178751487,0.755946015,7.92215157,4.273200485,3.156422843,4.958214283,0.190461977,poor,1.67,0.0,0,0,0,0,3,666.67,1166.67,916.67,0,0,2,1,0,0,0,0,0,0,0,0,0,4.99,0.29,0,0,0,0,9,642.86,1142.86,892.86,2,0,5,2,0,0,0,0,1,0,0,5,3,9.92,6.73,0,0,1,2600,14,516.67,916.67,716.67,2,4,6,2,0,0,0,0,4,0,0,6,5,12.99,12.75,4,100200,7,52550,24,563.64,977.27,770.45,2,8,9,4,1,0,0,0,4,0,0,8,5,12.14,26.46,8,110856,7,52550,41,697.44,1192.31,944.87,2,9,17,9,3,1,0,0,11,0,0,20,6,13.69,21.58,43,1478160,35,1572990,122,702.68,1196.43,949.55,10,29,45,25,10,3,0,11,27,0,4,67,10,5700000
4,2011-09-01T00:00:00.000+0000,89,50.0,9.0,,,,,,,Investment,Mitino,12583535.69,178473,0.194702869,0.069753361,13087,6839,9,13670,17063,10,0,,1,0,17,6,no,0,11,4,no,no,no,no,no,no,no,no,21155,9828,11327,28563,14680,13883,120381,60040,60341,29529,9083,20446,13087,6645,6442,13670,7126,6544,32063,16513,15550,3292,1450,1842,24934,12782,12152,458,9,51,12,124,50,201,0,9,2,459,13,24,40,130,252,4,1.489049154,0.984536582,11.56562408,0.963802007,0.179440956,0.236455018,0.078090293,0.106124506,0.451173311,15.62371,2.01708,14.31764,4.2914325,51.49719001,4,3.816044582,5.271136062,4,0.131596959,1.579163513,1.200336487,no,2.677824281,14.60650078,17.45719794,18.30943312,19.48700542,2.677824284,1,no,2.780448941,17,1.999265421,no,17.47838035,83,6.734618018,1,27.6678632,9.522537611,6.348716334,1.767612439,11.17833328,0.583024969,27.89271688,0.811275289,0.62348431,1.950316967,6.483171621,7.385520691,4.923843177,3.549557568,8.789894266,2.163735157,10.9031613,0.622271644,0.599913582,0.934273498,0.8926743,0.236455018,1.03177679,1.561504846,15.30044908,16.99067736,16.04152067,5.02969633,0.465820158,good,17.36,0.57,0,0,0,0,2,1000.0,1500.0,1250.0,0,0,0,2,0,0,0,0,0,0,0,0,0,19.25,10.35,1,11000,6,80780,12,658.33,1083.33,870.83,0,3,4,5,0,0,0,0,0,0,0,3,1,28.38,6.57,2,11000,7,89492,23,673.91,1130.43,902.17,0,5,9,8,1,0,0,1,0,0,0,9,2,32.29,5.73,2,11000,7,89492,25,660.0,1120.0,890.0,0,5,11,8,1,0,0,1,1,0,0,13,2,20.79,3.57,4,167000,12,205756,32,718.75,1218.75,968.75,0,5,14,10,3,0,0,1,2,0,0,18,3,14.18,3.89,8,244166,22,942180,61,931.58,1552.63,1242.11,4,7,21,15,11,2,1,4,4,0,0,26,3,13100000
5,2011-09-05T00:00:00.000+0000,77,77.0,4.0,,,,,,,Investment,Basmannoe,8398460.622,108171,0.015233744,0.037316452,5706,3240,7,6748,7770,9,0,562.0,4,2,25,2,no,0,10,93,no,no,no,yes,yes,no,no,no,28179,13522,14657,13368,7159,6209,68043,34236,33807,26760,8563,18197,5706,2982,2724,6748,3664,3084,15237,8113,7124,5164,2583,2581,11631,6223,5408,746,48,0,0,643,16,35,0,3,1,746,371,114,146,62,53,5,1.257186453,0.876620232,8.266305238,0.68885877,0.247901208,0.376838057,0.258288769,0.236214054,0.392870988,10.68354,2.936581,11.90391,0.853960072,10.24752087,5,1.59589817,2.156283865,113,0.071480323,0.857763874,0.820294318,no,11.61665314,1.721833675,0.046809568,0.787593311,2.578670647,1.721833683,4,no,3.133530966,10,0.084112545,yes,1.59589817,113,1.423427954,4,6.515857089,8.671015673,1.638318096,3.632640421,4.587916559,2.60941961,9.15505713,1.969737724,0.220287667,2.544696,3.975401349,3.610753828,0.307915375,1.864637406,3.779781109,1.121702845,0.991682626,0.892667526,0.429052137,0.077900959,0.810801456,0.376838057,0.378755838,0.121680643,2.584369607,1.11248589,1.800124877,1.339652258,0.026102416,excellent,3.56,4.44,15,293699,1,45000,48,702.22,1166.67,934.44,3,17,10,11,7,0,0,1,4,0,2,3,0,3.34,8.29,46,420952,3,158200,153,763.45,1272.41,1017.93,8,39,45,39,19,2,1,7,12,0,6,7,0,4.12,4.83,93,1195735,9,445900,272,766.8,1272.73,1019.76,19,70,74,72,30,6,1,18,30,0,10,14,2,4.53,5.02,149,1625130,17,564843,483,765.93,1269.23,1017.58,28,130,129,131,50,14,1,35,61,0,17,21,3,5.06,8.62,305,3420907,60,2296870,1068,853.03,1410.45,1131.74,63,266,267,262,149,57,4,70,121,1,40,77,5,8.38,10.92,689,8404624,114,3503058,2283,853.88,1411.45,1132.66,143,566,578,552,319,108,17,135,236,2,91,195,14,16331452
6,2011-09-06T00:00:00.000+0000,67,46.0,14.0,,,,,,,Investment,Nizhegorodskoe,7506452.02,43795,0.007670134,0.486245621,2418,852,2,2514,2012,3,0,,0,0,7,0,no,0,6,19,yes,no,no,yes,no,no,no,no,19940,9400,10540,5291,2744,2547,29660,15793,13867,8844,2608,6236,2418,1224,1194,2514,1328,1186,5866,3035,2831,4851,2329,2522,4632,2399,2233,188,24,0,0,147,2,15,0,0,0,188,0,5,152,25,6,6,2.735883907,1.593246481,18.37816963,1.531514136,0.145954816,0.113466218,1.073495427,1.497902638,0.256487453,7.18674,0.78033,14.07514,0.375311695,4.503740339,6,0.375311695,1.407418835,6,0.189227153,2.270725835,0.612447325,no,8.296086727,0.284868107,3.519388985,4.395057477,5.645795859,0.284868136,4,no,1.478528507,3,0.244670412,no,5.070196504,5,6.682088764,4,3.95950924,8.757686082,0.193126987,2.34156168,1.272894442,1.438003448,5.374563767,3.447863628,0.81041306,1.911842782,2.108923435,4.233094726,1.450974874,3.391116928,4.356122442,1.698723584,3.830021305,1.042261834,0.440707312,0.422357874,3.066285203,0.113466218,0.686931702,0.870446514,4.787705729,3.388809733,3.71355663,2.553423533,0.004469307,poor,0.0,19.42,5,227705,3,102000,7,1000.0,1625.0,1312.5,3,0,1,2,1,0,0,0,0,0,0,0,0,0.0,40.27,10,275135,5,164000,9,883.33,1416.67,1150.0,3,1,1,3,1,0,0,3,1,0,0,1,0,0.0,50.64,18,431090,6,186400,14,718.18,1181.82,950.0,3,3,3,4,1,0,0,4,2,0,0,11,0,0.38,51.58,21,471290,14,683945,33,741.38,1258.62,1000.0,4,5,13,8,2,1,0,6,5,0,0,21,1,1.82,39.99,54,1181009,29,1059171,120,737.96,1231.48,984.72,12,24,37,35,11,1,0,12,12,0,2,31,7,5.92,25.79,253,4274339,63,2010320,567,769.92,1280.08,1025.0,35,137,163,155,62,14,1,53,78,1,20,113,17,9100000
7,2011-09-08T00:00:00.000+0000,25,14.0,10.0,,,,,,,Investment,Sokol'niki,10320473.23,57405,0.523439075,0.042307271,2459,933,5,2810,5050,5,0,4849.0,3,0,17,6,no,0,6,9,no,no,no,no,no,no,no,yes,85956,40724,45232,5682,2925,2757,35003,17490,17513,16720,5351,11369,2459,1241,1218,2810,1472,1338,6510,3345,3165,19445,10085,9360,4884,2507,2377,217,23,1,0,139,12,41,0,0,1,216,11,38,90,58,19,7,1.453761834,1.026916052,12.32299262,1.026916052,0.501165569,0.516979177,2.393238738,0.632444746,0.707268095,13.9007,1.371596,10.32988,2.603670532,31.24404639,7,3.031337419,3.959288563,7,0.050211053,0.602532635,0.399865062,no,8.618597103,2.026622694,3.940565824,4.686895435,6.468718768,2.026622672,4,no,3.657904476,20,0.464015082,no,3.028547461,97,4.062549158,4,6.164593843,11.80753189,2.10648677,1.298839815,5.106633095,3.345431932,13.70496751,1.825870151,0.132256233,1.278911604,1.733264299,1.877747146,0.588118977,0.198894455,1.374586648,0.868403997,3.168781583,0.544311589,0.513689337,0.067724564,0.27301919,0.516979177,0.822377431,0.318230275,4.48494144,0.75222301,2.969314705,3.373342934,0.447173592,poor,0.0,0.0,3,7719,0,0,2,750.0,1250.0,1000.0,0,0,1,1,0,0,0,0,2,0,1,2,0,14.59,4.16,8,151911,3,19400,16,760.0,1300.0,1030.0,1,3,7,3,1,1,0,2,4,0,2,7,0,20.5,5.57,20,453371,9,105241,44,673.17,1158.54,915.85,3,12,17,7,4,1,0,4,11,0,3,20,1,23.45,5.25,42,791467,17,510141,71,720.0,1215.38,967.69,6,16,24,17,7,1,0,9,14,0,4,28,1,22.2,7.57,72,1387991,24,743439,160,698.04,1183.01,940.52,7,41,57,37,16,2,0,17,29,0,8,56,2,25.23,12.76,228,3305831,49,1656529,635,771.05,1291.18,1031.11,34,163,194,144,81,16,3,38,80,1,27,127,8,5500000
8,2011-09-09T00:00:00.000+0000,44,44.0,5.0,,,,,,,Investment,Bibirevo,6407578.1,155572,0.189727117,6.99893e-05,9576,5001,5,10309,11065,5,0,240.0,1,0,7,3,no,0,16,1,no,no,no,no,no,no,no,no,86206,40477,45729,21154,11007,10147,98207,52277,45930,36211,10580,25631,9576,4899,4677,10309,5463,4846,23603,12286,11317,17508,9425,8083,18654,9709,8945,211,25,0,0,0,2,184,0,0,0,211,0,0,0,206,5,8,3.623270141,1.593203382,11.52778764,0.96064897,0.134365152,0.581510695,1.602056341,0.165124113,1.166991896,23.11838,1.131798,4.042541,4.309574655,51.71489587,8,5.622394422,7.453825079,1,0.25481389,3.057766679,0.567006418,no,1.04229051,11.53414568,13.71375173,14.2861768,15.77042634,1.042290487,1,no,3.303551988,36,1.964850803,no,14.4344623,101,23.39166932,1,18.84810404,5.33292585,1.149883196,0.952974662,5.713887982,4.871096986,9.913434734,2.795147723,0.342545086,2.090237161,2.359385347,8.114016035,3.495441189,1.69654313,4.213461148,0.746259906,6.917527177,1.09857248,0.657587332,0.872231483,0.371856693,0.581510695,1.079769307,0.558238725,4.469517135,13.15230977,8.018089004,7.718966846,0.373715085,good,17.62,0.0,0,0,0,0,4,1000.0,1625.0,1312.5,0,0,1,2,1,0,0,0,0,0,0,1,0,26.27,0.0,1,30500,3,13200,13,784.62,1269.23,1026.92,0,4,2,5,2,0,0,0,1,0,0,3,0,17.42,1.97,2,70500,11,207671,29,585.71,982.14,783.93,1,13,7,6,2,0,0,1,3,0,0,7,1,15.02,7.13,6,113554,12,226291,36,614.29,1042.86,828.57,1,15,11,6,2,1,0,2,4,0,0,9,1,11.47,13.56,12,251554,21,1377234,55,623.53,1049.02,836.27,4,21,15,11,3,1,0,3,4,0,0,18,2,15.8,11.44,24,711780,45,2197401,143,709.09,1189.39,949.24,11,37,46,36,9,4,0,11,18,1,0,47,4,2000000
9,2011-09-10T00:00:00.000+0000,42,27.0,5.0,,,,,,,Investment,Koptevo,4787424.272,96959,0.03845587,0.122090347,6507,3272,4,6566,8751,6,0,1894.0,4,0,7,6,no,0,0,7,no,no,no,no,no,no,no,no,78810,36091,42719,13901,7344,6557,59120,29664,29456,23938,6980,16958,6507,3456,3051,6566,3453,3113,15510,8201,7309,17662,8488,9174,12269,6487,5782,390,101,2,0,216,11,60,0,0,0,390,1,9,290,39,51,9,2.175430788,1.338707436,14.74228941,1.228524117,0.11290453,0.256589329,0.939584123,0.148956733,0.670431737,20.94097,1.759887,3.925497,1.540958494,18.49150192,9,1.540958494,2.428244611,9,0.219775841,2.637310091,0.322954506,no,6.53914608,5.514867811,8.290836714,8.963023472,10.357535,2.438125457,5,no,2.725990867,14,0.83357986,no,7.50071515,121,8.571925744,5,16.911467,1.652274682,2.679181439,1.307362663,6.178131848,4.421560784,18.18034021,1.263583558,0.0,2.467325771,0.540760283,0.731141912,0.799921119,1.782439237,3.432144582,1.032523625,1.204220449,0.256589329,2.332387982,1.143674241,0.217477464,0.256589329,0.554880314,0.555962558,4.43275291,6.788241041,5.273171011,2.449453233,0.123609825,poor,7.71,0.0,0,0,0,0,7,633.33,1083.33,858.33,1,1,3,2,0,0,0,0,0,0,0,5,0,19.25,6.55,0,0,0,0,10,566.67,1000.0,783.33,1,2,5,2,0,0,0,2,2,0,0,6,0,19.83,19.76,5,116353,0,0,15,592.31,1038.46,815.38,2,4,6,2,1,0,0,2,3,0,0,9,2,22.0,19.72,12,144351,0,0,18,593.75,1031.25,812.5,2,5,7,3,1,0,0,2,3,0,0,14,3,17.87,21.59,33,872741,12,503283,98,572.41,965.52,768.97,11,36,28,21,1,1,0,4,8,0,1,24,6,18.73,14.52,94,2283313,41,1051314,292,711.36,1185.61,948.48,28,86,81,69,19,8,1,18,34,1,3,85,11,5300000
10,2011-09-13T00:00:00.000+0000,36,21.0,9.0,,,,,,,Investment,Kuncevo,52351766.54,142462,0.070662054,0.035145267,9347,4050,7,9292,11234,8,0,2620.0,0,0,5,2,no,0,3,3,no,no,no,yes,no,no,no,no,78507,37052,41455,19784,10339,9445,85551,43352,42199,37127,11024,26103,9347,4806,4541,9292,4919,4373,22071,11484,10587,15929,8541,7388,17469,9082,8387,737,68,204,14,237,21,180,2,10,1,738,47,88,413,94,96,10,1.383373263,0.396274723,4.755296675,0.396274723,0.309672803,0.150686282,1.969773852,0.191191251,2.024970785,16.75309,2.790526,16.5619,2.522421641,30.26905969,10,1.673206368,2.735303613,22,0.218409543,2.62091451,1.021499072,no,3.953500268,5.885973152,8.955785282,10.00345994,10.86912007,1.101813754,6,no,1.102384108,9,0.147894252,no,10.22914523,50,13.77381942,5,20.09408343,7.353920421,0.61502045,0.453543783,4.154171002,2.12435988,20.46122679,5.174844257,0.508801924,1.281865943,5.741238918,5.076912538,2.969620143,1.35479157,5.673912846,2.604184808,5.23895598,0.150686282,0.912378535,0.459233791,1.794687627,0.150686282,1.167638745,1.150550082,3.557616808,4.373416959,5.148563093,0.711874878,0.345225166,satisfactory,33.79,0.0,1,15565,0,0,2,400.0,750.0,575.0,0,1,1,0,0,0,0,0,0,0,0,0,0,29.35,0.0,2,46565,1,16600,5,620.0,1100.0,860.0,0,2,2,0,1,0,0,0,0,0,1,7,0,25.02,0.0,2,46565,1,16600,9,788.89,1388.89,1088.89,0,2,4,0,3,0,0,2,2,0,1,10,0,20.47,0.0,2,46565,4,33440,22,861.9,1476.19,1169.05,1,2,10,4,4,1,0,2,2,0,1,13,0,22.56,7.15,10,635269,12,545761,56,1050.94,1735.85,1393.4,3,9,15,12,9,8,0,5,9,0,1,32,0,23.35,10.5,30,1132519,32,1404650,143,921.71,1531.01,1226.36,14,28,39,30,19,13,0,10,20,1,3,67,1,2000000


In [4]:
# With this registered as a temp view, it will only be available to this particular notebook. If you'd like other users to be able to query this table, you can also create a table from the DataFrame.
# Once saved, this table will persist across cluster restarts as well as allow various users across different notebooks to query this data.
# To do so, choose your table name and uncomment the bottom line.

permanent_table_name = "sberbank_housing"
df1.write.format("parquet").mode('overwrite').saveAsTable(permanent_table_name)

In [5]:
# Sberbank Russian Housing Market
# https://www.kaggle.com/c/sberbank-russian-housing-market

# File location and type
file_location = "/FileStore/tables/macro.csv"
file_type = "csv"

# CSV options
infer_schema = "true"
first_row_is_header = "true"
delimiter = ","

# The applied options are for CSV files. For other file types, these will be ignored.
df2 = spark.read.format(file_type) \
  .option("inferSchema", infer_schema) \
  .option("header", first_row_is_header) \
  .option("sep", delimiter) \
  .load(file_location)

display(df2)

timestamp,oil_urals,gdp_quart,gdp_quart_growth,cpi,ppi,gdp_deflator,balance_trade,balance_trade_growth,usdrub,eurrub,brent,net_capital_export,gdp_annual,gdp_annual_growth,average_provision_of_build_contract,average_provision_of_build_contract_moscow,rts,micex,micex_rgbi_tr,micex_cbi_tr,deposits_value,deposits_growth,deposits_rate,mortgage_value,mortgage_growth,mortgage_rate,grp,grp_growth,income_per_cap,real_dispos_income_per_cap_growth,salary,salary_growth,fixed_basket,retail_trade_turnover,retail_trade_turnover_per_cap,retail_trade_turnover_growth,labor_force,unemployment,employment,invest_fixed_capital_per_cap,invest_fixed_assets,profitable_enterpr_share,unprofitable_enterpr_share,share_own_revenues,overdue_wages_per_cap,fin_res_per_cap,marriages_per_1000_cap,divorce_rate,construction_value,invest_fixed_assets_phys,pop_natural_increase,pop_migration,pop_total_inc,childbirth,mortality,housing_fund_sqm,lodging_sqm_per_cap,water_pipes_share,baths_share,sewerage_share,gas_share,hot_water_share,electric_stove_share,heating_share,old_house_share,average_life_exp,infant_mortarity_per_1000_cap,perinatal_mort_per_1000_cap,incidence_population,rent_price_4+room_bus,rent_price_3room_bus,rent_price_2room_bus,rent_price_1room_bus,rent_price_3room_eco,rent_price_2room_eco,rent_price_1room_eco,load_of_teachers_preschool_per_teacher,child_on_acc_pre_school,load_of_teachers_school_per_teacher,students_state_oneshift,modern_education_share,old_education_build_share,provision_doctors,provision_nurse,load_on_doctors,power_clinics,hospital_beds_available_per_cap,hospital_bed_occupancy_per_year,provision_retail_space_sqm,provision_retail_space_modern_sqm,turnover_catering_per_cap,theaters_viewers_per_1000_cap,seats_theather_rfmin_per_100000_cap,museum_visitis_per_100_cap,bandwidth_sports,population_reg_sports_share,students_reg_sports_share,apartment_build,apartment_fund_sqm
2010-01-01T00:00:00.000+0000,76.1,,,,,,,,,,,,38807.2,-0.078208582,5.0,,,,,,7484970,,,142968,,13.72,8375.8638,,30789.2,0.038,38410.5,,11443.63,2882.4169,251.484,106.8,6436.244,0.018,0.69,63932.0,732.760604,0.716,0.284,0.962915058,12865,233.3515293,8.0,3.9,553874.9,95.8,-0.3,14.1,13.8,10.7,11.0,216,18.71588251,99.9,99.8,99.6,44.4,95.7,55.0,99.9,0.3,74.2,6.1,5.87,696.6,,,,,,,,721.4777653,45713,1356.112607,,,,18.0,99.4,7872.848285,162.9,,,,690.0,6221,527,0.41,993,,,63.03,22825,
2010-01-02T00:00:00.000+0000,76.1,,,,,,,,,,,,38807.2,-0.078208582,5.0,,,,,175.15,7484970,,,142968,,13.72,8375.8638,,30789.2,0.038,38410.5,,11443.63,2882.4169,251.484,106.8,6436.244,0.018,0.69,63932.0,732.760604,0.716,0.284,0.962915058,12865,233.3515293,8.0,3.9,553874.9,95.8,-0.3,14.1,13.8,10.7,11.0,216,18.71588251,99.9,99.8,99.6,44.4,95.7,55.0,99.9,0.3,74.2,6.1,5.87,696.6,,,,,,,,721.4777653,45713,1356.112607,,,,18.0,99.4,7872.848285,162.9,,,,690.0,6221,527,0.41,993,,,63.03,22825,
2010-01-03T00:00:00.000+0000,76.1,,,,,,,,,,,,38807.2,-0.078208582,5.0,,,,,178.66,7484970,,,142968,,13.72,8375.8638,,30789.2,0.038,38410.5,,11443.63,2882.4169,251.484,106.8,6436.244,0.018,0.69,63932.0,732.760604,0.716,0.284,0.962915058,12865,233.3515293,8.0,3.9,553874.9,95.8,-0.3,14.1,13.8,10.7,11.0,216,18.71588251,99.9,99.8,99.6,44.4,95.7,55.0,99.9,0.3,74.2,6.1,5.87,696.6,,,,,,,,721.4777653,45713,1356.112607,,,,18.0,99.4,7872.848285,162.9,,,,690.0,6221,527,0.41,993,,,63.03,22825,
2010-01-04T00:00:00.000+0000,76.1,,,,,,,,29.905,43.4054,80.12,,38807.2,-0.078208582,5.0,,,,,183.44,7484970,,,142968,,13.72,8375.8638,,30789.2,0.038,38410.5,,11443.63,2882.4169,251.484,106.8,6436.244,0.018,0.69,63932.0,732.760604,0.716,0.284,0.962915058,12865,233.3515293,8.0,3.9,553874.9,95.8,-0.3,14.1,13.8,10.7,11.0,216,18.71588251,99.9,99.8,99.6,44.4,95.7,55.0,99.9,0.3,74.2,6.1,5.87,696.6,,,,,,,,721.4777653,45713,1356.112607,,,,18.0,99.4,7872.848285,162.9,,,,690.0,6221,527,0.41,993,,,63.03,22825,
2010-01-05T00:00:00.000+0000,76.1,,,,,,,,29.836,42.96,80.59,,38807.2,-0.078208582,5.0,,,,,183.44,7484970,,,142968,,13.72,8375.8638,,30789.2,0.038,38410.5,,11443.63,2882.4169,251.484,106.8,6436.244,0.018,0.69,63932.0,732.760604,0.716,0.284,0.962915058,12865,233.3515293,8.0,3.9,553874.9,95.8,-0.3,14.1,13.8,10.7,11.0,216,18.71588251,99.9,99.8,99.6,44.4,95.7,55.0,99.9,0.3,74.2,6.1,5.87,696.6,,,,,,,,721.4777653,45713,1356.112607,,,,18.0,99.4,7872.848285,162.9,,,,690.0,6221,527,0.41,993,,,63.03,22825,
2010-01-06T00:00:00.000+0000,76.1,,,,,,,,29.715,42.9138,81.89,,38807.2,-0.078208582,5.0,,,,,184.87,7484970,,,142968,,13.72,8375.8638,,30789.2,0.038,38410.5,,11443.63,2882.4169,251.484,106.8,6436.244,0.018,0.69,63932.0,732.760604,0.716,0.284,0.962915058,12865,233.3515293,8.0,3.9,553874.9,95.8,-0.3,14.1,13.8,10.7,11.0,216,18.71588251,99.9,99.8,99.6,44.4,95.7,55.0,99.9,0.3,74.2,6.1,5.87,696.6,,,,,,,,721.4777653,45713,1356.112607,,,,18.0,99.4,7872.848285,162.9,,,,690.0,6221,527,0.41,993,,,63.03,22825,
2010-01-07T00:00:00.000+0000,76.1,,,,,,,,29.775,42.7143,81.51,,38807.2,-0.078208582,5.0,,,,,187.97,7484970,,,142968,,13.72,8375.8638,,30789.2,0.038,38410.5,,11443.63,2882.4169,251.484,106.8,6436.244,0.018,0.69,63932.0,732.760604,0.716,0.284,0.962915058,12865,233.3515293,8.0,3.9,553874.9,95.8,-0.3,14.1,13.8,10.7,11.0,216,18.71588251,99.9,99.8,99.6,44.4,95.7,55.0,99.9,0.3,74.2,6.1,5.87,696.6,,,,,,,,721.4777653,45713,1356.112607,,,,18.0,99.4,7872.848285,162.9,,,,690.0,6221,527,0.41,993,,,63.03,22825,
2010-01-08T00:00:00.000+0000,76.1,,,,,,,,29.775,42.9359,81.37,,38807.2,-0.078208582,5.0,,,,,187.97,7484970,,,142968,,13.72,8375.8638,,30789.2,0.038,38410.5,,11443.63,2882.4169,251.484,106.8,6436.244,0.018,0.69,63932.0,732.760604,0.716,0.284,0.962915058,12865,233.3515293,8.0,3.9,553874.9,95.8,-0.3,14.1,13.8,10.7,11.0,216,18.71588251,99.9,99.8,99.6,44.4,95.7,55.0,99.9,0.3,74.2,6.1,5.87,696.6,,,,,,,,721.4777653,45713,1356.112607,,,,18.0,99.4,7872.848285,162.9,,,,690.0,6221,527,0.41,993,,,63.03,22825,
2010-01-09T00:00:00.000+0000,76.1,,,,,,,,29.775,42.9359,81.37,,38807.2,-0.078208582,5.0,,,,,190.77,7484970,,,142968,,13.72,8375.8638,,30789.2,0.038,38410.5,,11443.63,2882.4169,251.484,106.8,6436.244,0.018,0.69,63932.0,732.760604,0.716,0.284,0.962915058,12865,233.3515293,8.0,3.9,553874.9,95.8,-0.3,14.1,13.8,10.7,11.0,216,18.71588251,99.9,99.8,99.6,44.4,95.7,55.0,99.9,0.3,74.2,6.1,5.87,696.6,,,,,,,,721.4777653,45713,1356.112607,,,,18.0,99.4,7872.848285,162.9,,,,690.0,6221,527,0.41,993,,,63.03,22825,
2010-01-10T00:00:00.000+0000,76.1,,,,,,,,29.775,42.9359,81.37,,38807.2,-0.078208582,5.0,,,,,192.39,7484970,,,142968,,13.72,8375.8638,,30789.2,0.038,38410.5,,11443.63,2882.4169,251.484,106.8,6436.244,0.018,0.69,63932.0,732.760604,0.716,0.284,0.962915058,12865,233.3515293,8.0,3.9,553874.9,95.8,-0.3,14.1,13.8,10.7,11.0,216,18.71588251,99.9,99.8,99.6,44.4,95.7,55.0,99.9,0.3,74.2,6.1,5.87,696.6,,,,,,,,721.4777653,45713,1356.112607,,,,18.0,99.4,7872.848285,162.9,,,,690.0,6221,527,0.41,993,,,63.03,22825,


In [6]:
# With this registered as a temp view, it will only be available to this particular notebook. If you'd like other users to be able to query this table, you can also create a table from the DataFrame.
# Once saved, this table will persist across cluster restarts as well as allow various users across different notebooks to query this data.
# To do so, choose your table name and uncomment the bottom line.

permanent_table_name = "macro"
df2.write.format("parquet").mode('overwrite').saveAsTable(permanent_table_name)

In [7]:
# Cleanup
del df1
del df2
del permanent_table_name

## TRANSFORMATION
### Raw CSV or Raw PARQUET -> Parquet

In [9]:
macro =spark.sql("SELECT * FROM macro")
print(macro.count())
print(macro.dtypes)

In [10]:
sberbank_housing =spark.sql("SELECT * FROM sberbank_housing")
print(sberbank_housing.count())
print(sberbank_housing.dtypes)

In [11]:
# Wide representation of the data in one table
## Macro adds supplemental data to sberbank_housing
sberbank_housing = sberbank_housing.withColumnRenamed('timestamp', 'sberbank_timestamp').select("*")
macro = macro.withColumnRenamed('timestamp', 'macro_timestamp').select('*')
sberbank_housing_macro = sberbank_housing.join(macro, 
                                               sberbank_housing.sberbank_timestamp == macro.macro_timestamp, 
                                               'left_outer')

In [12]:
print(sberbank_housing_macro.count())
print(sberbank_housing_macro.columns)

In [13]:
sberbank_housing_macro.createOrReplaceTempView("sberbank_housing_macro")

In [14]:
%sql
SELECT *
FROM sberbank_housing_macro
LIMIT 10

id,sberbank_timestamp,full_sq,life_sq,floor,max_floor,material,build_year,num_room,kitch_sq,state,product_type,sub_area,area_m,raion_popul,green_zone_part,indust_part,children_preschool,preschool_quota,preschool_education_centers_raion,children_school,school_quota,school_education_centers_raion,school_education_centers_top_20_raion,hospital_beds_raion,healthcare_centers_raion,university_top_20_raion,sport_objects_raion,additional_education_raion,culture_objects_top_25,culture_objects_top_25_raion,shopping_centers_raion,office_raion,thermal_power_plant_raion,incineration_raion,oil_chemistry_raion,radiation_raion,railroad_terminal_raion,big_market_raion,nuclear_reactor_raion,detention_facility_raion,full_all,male_f,female_f,young_all,young_male,young_female,work_all,work_male,work_female,ekder_all,ekder_male,ekder_female,0_6_all,0_6_male,0_6_female,7_14_all,7_14_male,7_14_female,0_17_all,0_17_male,0_17_female,16_29_all,16_29_male,16_29_female,0_13_all,0_13_male,0_13_female,raion_build_count_with_material_info,build_count_block,build_count_wood,build_count_frame,build_count_brick,build_count_monolith,build_count_panel,build_count_foam,build_count_slag,build_count_mix,raion_build_count_with_builddate_info,build_count_before_1920,build_count_1921-1945,build_count_1946-1970,build_count_1971-1995,build_count_after_1995,ID_metro,metro_min_avto,metro_km_avto,metro_min_walk,metro_km_walk,kindergarten_km,school_km,park_km,green_zone_km,industrial_km,water_treatment_km,cemetery_km,incineration_km,railroad_station_walk_km,railroad_station_walk_min,ID_railroad_station_walk,railroad_station_avto_km,railroad_station_avto_min,ID_railroad_station_avto,public_transport_station_km,public_transport_station_min_walk,water_km,water_1line,mkad_km,ttk_km,sadovoe_km,bulvar_ring_km,kremlin_km,big_road1_km,ID_big_road1,big_road1_1line,big_road2_km,ID_big_road2,railroad_km,railroad_1line,zd_vokzaly_avto_km,ID_railroad_terminal,bus_terminal_avto_km,ID_bus_terminal,oil_chemistry_km,nuclear_reactor_km,radiation_km,power_transmission_line_km,thermal_power_plant_km,ts_km,big_market_km,market_shop_km,fitness_km,swim_pool_km,ice_rink_km,stadium_km,basketball_km,hospice_morgue_km,detention_facility_km,public_healthcare_km,university_km,workplaces_km,shopping_centers_km,office_km,additional_education_km,preschool_km,big_church_km,church_synagogue_km,mosque_km,theater_km,museum_km,exhibition_km,catering_km,ecology,green_part_500,prom_part_500,office_count_500,office_sqm_500,trc_count_500,trc_sqm_500,cafe_count_500,cafe_sum_500_min_price_avg,cafe_sum_500_max_price_avg,cafe_avg_price_500,cafe_count_500_na_price,cafe_count_500_price_500,cafe_count_500_price_1000,cafe_count_500_price_1500,cafe_count_500_price_2500,cafe_count_500_price_4000,cafe_count_500_price_high,big_church_count_500,church_count_500,mosque_count_500,leisure_count_500,sport_count_500,market_count_500,green_part_1000,prom_part_1000,office_count_1000,office_sqm_1000,trc_count_1000,trc_sqm_1000,cafe_count_1000,cafe_sum_1000_min_price_avg,cafe_sum_1000_max_price_avg,cafe_avg_price_1000,cafe_count_1000_na_price,cafe_count_1000_price_500,cafe_count_1000_price_1000,cafe_count_1000_price_1500,cafe_count_1000_price_2500,cafe_count_1000_price_4000,cafe_count_1000_price_high,big_church_count_1000,church_count_1000,mosque_count_1000,leisure_count_1000,sport_count_1000,market_count_1000,green_part_1500,prom_part_1500,office_count_1500,office_sqm_1500,trc_count_1500,trc_sqm_1500,cafe_count_1500,cafe_sum_1500_min_price_avg,cafe_sum_1500_max_price_avg,cafe_avg_price_1500,cafe_count_1500_na_price,cafe_count_1500_price_500,cafe_count_1500_price_1000,cafe_count_1500_price_1500,cafe_count_1500_price_2500,cafe_count_1500_price_4000,cafe_count_1500_price_high,big_church_count_1500,church_count_1500,mosque_count_1500,leisure_count_1500,sport_count_1500,market_count_1500,green_part_2000,prom_part_2000,office_count_2000,office_sqm_2000,trc_count_2000,trc_sqm_2000,cafe_count_2000,cafe_sum_2000_min_price_avg,cafe_sum_2000_max_price_avg,cafe_avg_price_2000,cafe_count_2000_na_price,cafe_count_2000_price_500,cafe_count_2000_price_1000,cafe_count_2000_price_1500,cafe_count_2000_price_2500,cafe_count_2000_price_4000,cafe_count_2000_price_high,big_church_count_2000,church_count_2000,mosque_count_2000,leisure_count_2000,sport_count_2000,market_count_2000,green_part_3000,prom_part_3000,office_count_3000,office_sqm_3000,trc_count_3000,trc_sqm_3000,cafe_count_3000,cafe_sum_3000_min_price_avg,cafe_sum_3000_max_price_avg,cafe_avg_price_3000,cafe_count_3000_na_price,cafe_count_3000_price_500,cafe_count_3000_price_1000,cafe_count_3000_price_1500,cafe_count_3000_price_2500,cafe_count_3000_price_4000,cafe_count_3000_price_high,big_church_count_3000,church_count_3000,mosque_count_3000,leisure_count_3000,sport_count_3000,market_count_3000,green_part_5000,prom_part_5000,office_count_5000,office_sqm_5000,trc_count_5000,trc_sqm_5000,cafe_count_5000,cafe_sum_5000_min_price_avg,cafe_sum_5000_max_price_avg,cafe_avg_price_5000,cafe_count_5000_na_price,cafe_count_5000_price_500,cafe_count_5000_price_1000,cafe_count_5000_price_1500,cafe_count_5000_price_2500,cafe_count_5000_price_4000,cafe_count_5000_price_high,big_church_count_5000,church_count_5000,mosque_count_5000,leisure_count_5000,sport_count_5000,market_count_5000,price_doc,macro_timestamp,oil_urals,gdp_quart,gdp_quart_growth,cpi,ppi,gdp_deflator,balance_trade,balance_trade_growth,usdrub,eurrub,brent,net_capital_export,gdp_annual,gdp_annual_growth,average_provision_of_build_contract,average_provision_of_build_contract_moscow,rts,micex,micex_rgbi_tr,micex_cbi_tr,deposits_value,deposits_growth,deposits_rate,mortgage_value,mortgage_growth,mortgage_rate,grp,grp_growth,income_per_cap,real_dispos_income_per_cap_growth,salary,salary_growth,fixed_basket,retail_trade_turnover,retail_trade_turnover_per_cap,retail_trade_turnover_growth,labor_force,unemployment,employment,invest_fixed_capital_per_cap,invest_fixed_assets,profitable_enterpr_share,unprofitable_enterpr_share,share_own_revenues,overdue_wages_per_cap,fin_res_per_cap,marriages_per_1000_cap,divorce_rate,construction_value,invest_fixed_assets_phys,pop_natural_increase,pop_migration,pop_total_inc,childbirth,mortality,housing_fund_sqm,lodging_sqm_per_cap,water_pipes_share,baths_share,sewerage_share,gas_share,hot_water_share,electric_stove_share,heating_share,old_house_share,average_life_exp,infant_mortarity_per_1000_cap,perinatal_mort_per_1000_cap,incidence_population,rent_price_4+room_bus,rent_price_3room_bus,rent_price_2room_bus,rent_price_1room_bus,rent_price_3room_eco,rent_price_2room_eco,rent_price_1room_eco,load_of_teachers_preschool_per_teacher,child_on_acc_pre_school,load_of_teachers_school_per_teacher,students_state_oneshift,modern_education_share,old_education_build_share,provision_doctors,provision_nurse,load_on_doctors,power_clinics,hospital_beds_available_per_cap,hospital_bed_occupancy_per_year,provision_retail_space_sqm,provision_retail_space_modern_sqm,turnover_catering_per_cap,theaters_viewers_per_1000_cap,seats_theather_rfmin_per_100000_cap,museum_visitis_per_100_cap,bandwidth_sports,population_reg_sports_share,students_reg_sports_share,apartment_build,apartment_fund_sqm
1,2011-08-20T00:00:00.000+0000,43,27,4,,,,,,,Investment,Bibirevo,6407578.1,155572,0.189727117,6.99893e-05,9576,5001,5,10309,11065,5,0,240.0,1,0,7,3,no,0,16,1,no,no,no,no,no,no,no,no,86206,40477,45729,21154,11007,10147,98207,52277,45930,36211,10580,25631,9576,4899,4677,10309,5463,4846,23603,12286,11317,17508,9425,8083,18654,9709,8945,211,25,0,0,0,2,184,0,0,0,211,0,0,0,206,5,1,2.590241095,1.131259906,13.57511887,1.131259906,0.145699552,0.17797535,2.158587074,0.600973099,1.080934313,23.68346,1.804127,3.633334,5.419893032,65.03871639,1,5.419893032,6.905892968,1,0.274985143,3.299821714,0.992631058,no,1.42239141,10.9185867,13.10061764,13.67565705,15.15621058,1.422391404,1,no,3.830951404,5,1.305159492,no,14.23196091,101,24.2924061,1,18.152338,5.718518835,1.210027392,1.062513046,5.814134663,4.308127002,10.81417151,1.676258313,0.485841388,3.065047099,1.107594209,8.148590774,3.516512911,2.392353035,4.248035887,0.974742843,6.715025787,0.884350021,0.648487637,0.637188832,0.947961657,0.17797535,0.625783434,0.628186549,3.932040333,14.05304655,7.389497904,7.023704919,0.516838085,good,0.0,0.0,0,0,0,0,0,,,,0,0,0,0,0,0,0,0,0,0,0,1,0,7.36,0.0,1,30500,3,55600,19,527.78,888.89,708.33,1,10,4,3,1,0,0,1,2,0,0,6,1,14.27,6.92,3,39554,9,171420,34,566.67,969.7,768.18,1,14,11,6,2,0,0,1,2,0,0,7,1,11.77,15.97,9,188854,19,1244891,36,614.29,1042.86,828.57,1,15,11,6,2,1,0,1,2,0,0,10,1,11.98,13.55,12,251554,23,1419204,68,639.68,1079.37,859.52,5,21,22,16,3,1,0,2,4,0,0,21,1,13.09,13.31,29,807385,52,4036616,152,708.57,1185.71,947.14,12,39,48,40,9,4,0,13,22,1,0,52,4,5850000,2011-08-20T00:00:00.000+0000,109.31,14313.7,3.3,354.0,420.7,86.721,15.459,10.1,29.0048,41.7681,108.62,0.301810526,46308.5,0.045037137,5.76,6.74,1575.33,1438.74,131.16,204.78,10618898,0.009740346,4.1,323275,1.051914337,11.84,9948.7728,0.187790661,42688.6,-0.005,44898.7,0.168917353,12838.36,3322.047,286.952,106.6,6643.626,0.014,0.708,73976.19863,856.424079,0.708,0.292,0.891477685,53636,226.2141565,8.5,3.8,549075.8,106.6,1.1,5.1,6.2,10.8,9.7,218,18.77206579,99.9,99.8,99.5,43.9,95.7,55.3,99.9,0.3,75.79,6.2,5.53,715.1,136.11,77.93,62.89,47.85,41.8,36.77,29.07,793.3195607,#!,1391.710938,89.0495,,,65.9,99.6,8180.755454,375.8,846,302,741,271,6943,565,0.45356,1240,269768,22.37,64.12,23587,230310
2,2011-08-23T00:00:00.000+0000,34,19,3,,,,,,,Investment,Nagatinskij Zaton,9589336.912,115352,0.372602044,0.049637257,6880,3119,5,7759,6237,8,0,229.0,1,0,6,1,yes,1,3,0,no,no,no,no,no,no,no,no,76284,34200,42084,15727,7925,7802,70194,35622,34572,29431,9266,20165,6880,3466,3414,7759,3909,3850,17700,8998,8702,15164,7571,7593,13729,6929,6800,245,83,1,0,67,4,90,0,0,0,244,1,1,143,84,15,2,0.936699728,0.647336757,7.620630408,0.635052534,0.147754269,0.273345319,0.550689737,0.065321162,0.966479097,1.317476,4.655004,8.648587,3.411993084,40.943917,2,3.641772591,4.679744508,2,0.065263344,0.78316013,0.698081318,no,9.503405157,3.103995954,6.444333466,8.132640073,8.698054189,2.887376585,2,no,3.103995974,4,0.694535727,no,9.242585522,32,5.706113234,2,9.034641872,3.489954443,2.72429538,1.246148739,3.419574049,0.725560431,6.910567711,3.424716092,0.668363679,2.000153804,8.97282283,6.127072782,1.161578983,2.543746975,12.64987875,1.47772267,1.852560245,0.686251693,0.519311324,0.688796317,1.072315063,0.273345319,0.967820571,0.471446524,4.841543888,6.829888847,0.709260033,2.358840498,0.23028691,excellent,25.14,0.0,0,0,0,0,5,860.0,1500.0,1180.0,0,1,3,0,0,1,0,0,1,0,0,0,0,26.66,0.07,2,86600,5,94065,13,615.38,1076.92,846.15,0,5,6,1,0,1,0,1,2,0,4,2,0,21.53,7.71,3,102910,7,127065,17,694.12,1205.88,950.0,0,6,7,1,2,1,0,1,5,0,4,9,0,22.37,19.25,4,165510,8,179065,21,695.24,1190.48,942.86,0,7,8,3,2,1,0,1,5,0,4,11,0,18.07,27.32,12,821986,14,491565,30,631.03,1086.21,858.62,1,11,11,4,2,1,0,1,7,0,6,19,1,10.26,27.47,66,2690465,40,2034942,177,673.81,1148.81,911.31,9,49,65,36,15,3,0,15,29,1,10,66,14,6000000,2011-08-23T00:00:00.000+0000,109.31,14313.7,3.3,354.0,420.7,86.721,15.459,10.1,28.9525,41.7537,109.31,0.301810526,46308.5,0.045037137,5.76,6.74,1578.91,1444.11,131.45,204.92,10618898,0.009740346,4.1,323275,1.051914337,11.84,9948.7728,0.187790661,42688.6,-0.005,44898.7,0.168917353,12838.36,3322.047,286.952,106.6,6643.626,0.014,0.708,73976.19863,856.424079,0.708,0.292,0.891477685,53636,226.2141565,8.5,3.8,549075.8,106.6,1.1,5.1,6.2,10.8,9.7,218,18.77206579,99.9,99.8,99.5,43.9,95.7,55.3,99.9,0.3,75.79,6.2,5.53,715.1,136.11,77.93,62.89,47.85,41.8,36.77,29.07,793.3195607,#!,1391.710938,89.0495,,,65.9,99.6,8180.755454,375.8,846,302,741,271,6943,565,0.45356,1240,269768,22.37,64.12,23587,230310
3,2011-08-27T00:00:00.000+0000,43,29,2,,,,,,,Investment,Tekstil'shhiki,4808269.831,101708,0.112559644,0.118537385,5879,1463,4,6207,5580,7,0,1183.0,1,0,5,1,no,0,0,1,no,no,no,yes,no,no,no,no,101982,46076,55906,13028,6835,6193,63388,31813,31575,25292,7609,17683,5879,3095,2784,6207,3269,2938,14884,7821,7063,19401,9045,10356,11252,5916,5336,330,59,0,0,206,4,60,0,1,0,330,1,0,246,63,20,3,2.120998901,1.637996285,17.3515154,1.445959617,0.049101536,0.158071895,0.374847751,0.453172405,0.939275144,4.91266,3.381083,11.99648,1.277658039,15.33189647,3,1.277658039,1.701419537,3,0.328756044,3.945072522,0.468264622,no,5.60479992,2.927487097,6.963402995,8.054252314,9.067884956,0.647249803,3,no,2.927487099,4,0.70069112,no,9.540544478,5,6.710302485,3,5.777393501,7.50661249,0.772216104,1.60218297,3.682454651,3.562187704,5.75236835,1.375442778,0.733101062,1.239303854,1.978517187,0.767568769,1.952770629,0.621357002,7.682302975,0.097143527,0.841254102,1.510088854,1.48653302,1.543048836,0.391957389,0.158071895,3.178751487,0.755946015,7.92215157,4.273200485,3.156422843,4.958214283,0.190461977,poor,1.67,0.0,0,0,0,0,3,666.67,1166.67,916.67,0,0,2,1,0,0,0,0,0,0,0,0,0,4.99,0.29,0,0,0,0,9,642.86,1142.86,892.86,2,0,5,2,0,0,0,0,1,0,0,5,3,9.92,6.73,0,0,1,2600,14,516.67,916.67,716.67,2,4,6,2,0,0,0,0,4,0,0,6,5,12.99,12.75,4,100200,7,52550,24,563.64,977.27,770.45,2,8,9,4,1,0,0,0,4,0,0,8,5,12.14,26.46,8,110856,7,52550,41,697.44,1192.31,944.87,2,9,17,9,3,1,0,0,11,0,0,20,6,13.69,21.58,43,1478160,35,1572990,122,702.68,1196.43,949.55,10,29,45,25,10,3,0,11,27,0,4,67,10,5700000,2011-08-27T00:00:00.000+0000,109.31,14313.7,3.3,354.0,420.7,86.721,15.459,10.1,28.8082,41.7114,111.36,0.301810526,46308.5,0.045037137,5.76,6.74,1596.17,1458.84,131.08,204.84,10618898,0.009740346,4.1,323275,1.051914337,11.84,9948.7728,0.187790661,42688.6,-0.005,44898.7,0.168917353,12838.36,3322.047,286.952,106.6,6643.626,0.014,0.708,73976.19863,856.424079,0.708,0.292,0.891477685,53636,226.2141565,8.5,3.8,549075.8,106.6,1.1,5.1,6.2,10.8,9.7,218,18.77206579,99.9,99.8,99.5,43.9,95.7,55.3,99.9,0.3,75.79,6.2,5.53,715.1,136.11,77.93,62.89,47.85,41.8,36.77,29.07,793.3195607,#!,1391.710938,89.0495,,,65.9,99.6,8180.755454,375.8,846,302,741,271,6943,565,0.45356,1240,269768,22.37,64.12,23587,230310
4,2011-09-01T00:00:00.000+0000,89,50,9,,,,,,,Investment,Mitino,12583535.69,178473,0.194702869,0.069753361,13087,6839,9,13670,17063,10,0,,1,0,17,6,no,0,11,4,no,no,no,no,no,no,no,no,21155,9828,11327,28563,14680,13883,120381,60040,60341,29529,9083,20446,13087,6645,6442,13670,7126,6544,32063,16513,15550,3292,1450,1842,24934,12782,12152,458,9,51,12,124,50,201,0,9,2,459,13,24,40,130,252,4,1.489049154,0.984536582,11.56562408,0.963802007,0.179440956,0.236455018,0.078090293,0.106124506,0.451173311,15.62371,2.01708,14.31764,4.2914325,51.49719001,4,3.816044582,5.271136062,4,0.131596959,1.579163513,1.200336487,no,2.677824281,14.60650078,17.45719794,18.30943312,19.48700542,2.677824284,1,no,2.780448941,17,1.999265421,no,17.47838035,83,6.734618018,1,27.6678632,9.522537611,6.348716334,1.767612439,11.17833328,0.583024969,27.89271688,0.811275289,0.62348431,1.950316967,6.483171621,7.385520691,4.923843177,3.549557568,8.789894266,2.163735157,10.9031613,0.622271644,0.599913582,0.934273498,0.8926743,0.236455018,1.03177679,1.561504846,15.30044908,16.99067736,16.04152067,5.02969633,0.465820158,good,17.36,0.57,0,0,0,0,2,1000.0,1500.0,1250.0,0,0,0,2,0,0,0,0,0,0,0,0,0,19.25,10.35,1,11000,6,80780,12,658.33,1083.33,870.83,0,3,4,5,0,0,0,0,0,0,0,3,1,28.38,6.57,2,11000,7,89492,23,673.91,1130.43,902.17,0,5,9,8,1,0,0,1,0,0,0,9,2,32.29,5.73,2,11000,7,89492,25,660.0,1120.0,890.0,0,5,11,8,1,0,0,1,1,0,0,13,2,20.79,3.57,4,167000,12,205756,32,718.75,1218.75,968.75,0,5,14,10,3,0,0,1,2,0,0,18,3,14.18,3.89,8,244166,22,942180,61,931.58,1552.63,1242.11,4,7,21,15,11,2,1,4,4,0,0,26,3,13100000,2011-09-01T00:00:00.000+0000,111.29,14313.7,3.3,353.2,434.4,86.721,15.386,10.1,28.9655,41.4014,114.29,0.599708879,46308.5,0.045037137,5.76,6.74,1703.86,1551.58,131.45,205.42,10720911,0.009606741,4.0,386388,1.049542764,11.92,9948.7728,0.187790661,40311.3,-0.005,44898.7,0.168917353,12856.97,3322.047,286.952,106.6,6643.626,0.014,0.708,73976.19863,856.424079,0.708,0.292,0.891477685,53636,226.2141565,8.5,3.8,549075.8,106.6,1.1,5.1,6.2,10.8,9.7,218,18.77206579,99.9,99.8,99.5,43.9,95.7,55.3,99.9,0.3,75.79,6.2,5.53,715.1,155.22,94.02,66.28,51.15,44.25,37.73,30.63,793.3195607,#!,1391.710938,89.0495,,,65.9,99.6,8180.755454,375.8,846,302,741,271,6943,565,0.45356,1240,269768,22.37,64.12,23587,230310
5,2011-09-05T00:00:00.000+0000,77,77,4,,,,,,,Investment,Basmannoe,8398460.622,108171,0.015233744,0.037316452,5706,3240,7,6748,7770,9,0,562.0,4,2,25,2,no,0,10,93,no,no,no,yes,yes,no,no,no,28179,13522,14657,13368,7159,6209,68043,34236,33807,26760,8563,18197,5706,2982,2724,6748,3664,3084,15237,8113,7124,5164,2583,2581,11631,6223,5408,746,48,0,0,643,16,35,0,3,1,746,371,114,146,62,53,5,1.257186453,0.876620232,8.266305238,0.68885877,0.247901208,0.376838057,0.258288769,0.236214054,0.392870988,10.68354,2.936581,11.90391,0.853960072,10.24752087,5,1.59589817,2.156283865,113,0.071480323,0.857763874,0.820294318,no,11.61665314,1.721833675,0.046809568,0.787593311,2.578670647,1.721833683,4,no,3.133530966,10,0.084112545,yes,1.59589817,113,1.423427954,4,6.515857089,8.671015673,1.638318096,3.632640421,4.587916559,2.60941961,9.15505713,1.969737724,0.220287667,2.544696,3.975401349,3.610753828,0.307915375,1.864637406,3.779781109,1.121702845,0.991682626,0.892667526,0.429052137,0.077900959,0.810801456,0.376838057,0.378755838,0.121680643,2.584369607,1.11248589,1.800124877,1.339652258,0.026102416,excellent,3.56,4.44,15,293699,1,45000,48,702.22,1166.67,934.44,3,17,10,11,7,0,0,1,4,0,2,3,0,3.34,8.29,46,420952,3,158200,153,763.45,1272.41,1017.93,8,39,45,39,19,2,1,7,12,0,6,7,0,4.12,4.83,93,1195735,9,445900,272,766.8,1272.73,1019.76,19,70,74,72,30,6,1,18,30,0,10,14,2,4.53,5.02,149,1625130,17,564843,483,765.93,1269.23,1017.58,28,130,129,131,50,14,1,35,61,0,17,21,3,5.06,8.62,305,3420907,60,2296870,1068,853.03,1410.45,1131.74,63,266,267,262,149,57,4,70,121,1,40,77,5,8.38,10.92,689,8404624,114,3503058,2283,853.88,1411.45,1132.66,143,566,578,552,319,108,17,135,236,2,91,195,14,16331452,2011-09-05T00:00:00.000+0000,111.29,14313.7,3.3,353.2,434.4,86.721,15.386,10.1,29.4625,41.5902,110.08,0.599708879,46308.5,0.045037137,5.76,6.74,1617.01,1492.83,131.19,198.24,10920215,0.018590211,4.0,386388,1.049542764,11.92,9948.7728,0.187790661,40311.3,-0.005,44898.7,0.168917353,12856.97,3322.047,286.952,106.6,6643.626,0.014,0.708,73976.19863,856.424079,0.708,0.292,0.891477685,53636,226.2141565,8.5,3.8,549075.8,106.6,1.1,5.1,6.2,10.8,9.7,218,18.77206579,99.9,99.8,99.5,43.9,95.7,55.3,99.9,0.3,75.79,6.2,5.53,715.1,155.22,94.02,66.28,51.15,44.25,37.73,30.63,793.3195607,#!,1391.710938,89.0495,,,65.9,99.6,8180.755454,375.8,846,302,741,271,6943,565,0.45356,1240,269768,22.37,64.12,23587,230310
6,2011-09-06T00:00:00.000+0000,67,46,14,,,,,,,Investment,Nizhegorodskoe,7506452.02,43795,0.007670134,0.486245621,2418,852,2,2514,2012,3,0,,0,0,7,0,no,0,6,19,yes,no,no,yes,no,no,no,no,19940,9400,10540,5291,2744,2547,29660,15793,13867,8844,2608,6236,2418,1224,1194,2514,1328,1186,5866,3035,2831,4851,2329,2522,4632,2399,2233,188,24,0,0,147,2,15,0,0,0,188,0,5,152,25,6,6,2.735883907,1.593246481,18.37816963,1.531514136,0.145954816,0.113466218,1.073495427,1.497902638,0.256487453,7.18674,0.78033,14.07514,0.375311695,4.503740339,6,0.375311695,1.407418835,6,0.189227153,2.270725835,0.612447325,no,8.296086727,0.284868107,3.519388985,4.395057477,5.645795859,0.284868136,4,no,1.478528507,3,0.244670412,no,5.070196504,5,6.682088764,4,3.95950924,8.757686082,0.193126987,2.34156168,1.272894442,1.438003448,5.374563767,3.447863628,0.81041306,1.911842782,2.108923435,4.233094726,1.450974874,3.391116928,4.356122442,1.698723584,3.830021305,1.042261834,0.440707312,0.422357874,3.066285203,0.113466218,0.686931702,0.870446514,4.787705729,3.388809733,3.71355663,2.553423533,0.004469307,poor,0.0,19.42,5,227705,3,102000,7,1000.0,1625.0,1312.5,3,0,1,2,1,0,0,0,0,0,0,0,0,0.0,40.27,10,275135,5,164000,9,883.33,1416.67,1150.0,3,1,1,3,1,0,0,3,1,0,0,1,0,0.0,50.64,18,431090,6,186400,14,718.18,1181.82,950.0,3,3,3,4,1,0,0,4,2,0,0,11,0,0.38,51.58,21,471290,14,683945,33,741.38,1258.62,1000.0,4,5,13,8,2,1,0,6,5,0,0,21,1,1.82,39.99,54,1181009,29,1059171,120,737.96,1231.48,984.72,12,24,37,35,11,1,0,12,12,0,2,31,7,5.92,25.79,253,4274339,63,2010320,567,769.92,1280.08,1025.0,35,137,163,155,62,14,1,53,78,1,20,113,17,9100000,2011-09-06T00:00:00.000+0000,111.29,14313.7,3.3,353.2,434.4,86.721,15.386,10.1,29.6676,41.5031,112.89,0.599708879,46308.5,0.045037137,5.76,6.74,1618.63,1499.7,131.24,203.16,10920215,0.018590211,4.0,386388,1.049542764,11.92,9948.7728,0.187790661,40311.3,-0.005,44898.7,0.168917353,12856.97,3322.047,286.952,106.6,6643.626,0.014,0.708,73976.19863,856.424079,0.708,0.292,0.891477685,53636,226.2141565,8.5,3.8,549075.8,106.6,1.1,5.1,6.2,10.8,9.7,218,18.77206579,99.9,99.8,99.5,43.9,95.7,55.3,99.9,0.3,75.79,6.2,5.53,715.1,155.22,94.02,66.28,51.15,44.25,37.73,30.63,793.3195607,#!,1391.710938,89.0495,,,65.9,99.6,8180.755454,375.8,846,302,741,271,6943,565,0.45356,1240,269768,22.37,64.12,23587,230310
7,2011-09-08T00:00:00.000+0000,25,14,10,,,,,,,Investment,Sokol'niki,10320473.23,57405,0.523439075,0.042307271,2459,933,5,2810,5050,5,0,4849.0,3,0,17,6,no,0,6,9,no,no,no,no,no,no,no,yes,85956,40724,45232,5682,2925,2757,35003,17490,17513,16720,5351,11369,2459,1241,1218,2810,1472,1338,6510,3345,3165,19445,10085,9360,4884,2507,2377,217,23,1,0,139,12,41,0,0,1,216,11,38,90,58,19,7,1.453761834,1.026916052,12.32299262,1.026916052,0.501165569,0.516979177,2.393238738,0.632444746,0.707268095,13.9007,1.371596,10.32988,2.603670532,31.24404639,7,3.031337419,3.959288563,7,0.050211053,0.602532635,0.399865062,no,8.618597103,2.026622694,3.940565824,4.686895435,6.468718768,2.026622672,4,no,3.657904476,20,0.464015082,no,3.028547461,97,4.062549158,4,6.164593843,11.80753189,2.10648677,1.298839815,5.106633095,3.345431932,13.70496751,1.825870151,0.132256233,1.278911604,1.733264299,1.877747146,0.588118977,0.198894455,1.374586648,0.868403997,3.168781583,0.544311589,0.513689337,0.067724564,0.27301919,0.516979177,0.822377431,0.318230275,4.48494144,0.75222301,2.969314705,3.373342934,0.447173592,poor,0.0,0.0,3,7719,0,0,2,750.0,1250.0,1000.0,0,0,1,1,0,0,0,0,2,0,1,2,0,14.59,4.16,8,151911,3,19400,16,760.0,1300.0,1030.0,1,3,7,3,1,1,0,2,4,0,2,7,0,20.5,5.57,20,453371,9,105241,44,673.17,1158.54,915.85,3,12,17,7,4,1,0,4,11,0,3,20,1,23.45,5.25,42,791467,17,510141,71,720.0,1215.38,967.69,6,16,24,17,7,1,0,9,14,0,4,28,1,22.2,7.57,72,1387991,24,743439,160,698.04,1183.01,940.52,7,41,57,37,16,2,0,17,29,0,8,56,2,25.23,12.76,228,3305831,49,1656529,635,771.05,1291.18,1031.11,34,163,194,144,81,16,3,38,80,1,27,127,8,5500000,2011-09-08T00:00:00.000+0000,111.29,14313.7,3.3,353.2,434.4,86.721,15.386,10.1,29.6304,41.2839,114.55,0.599708879,46308.5,0.045037137,5.76,6.74,1675.14,1556.46,131.41,205.19,10920215,0.018590211,4.0,386388,1.049542764,11.92,9948.7728,0.187790661,40311.3,-0.005,44898.7,0.168917353,12856.97,3322.047,286.952,106.6,6643.626,0.014,0.708,73976.19863,856.424079,0.708,0.292,0.891477685,53636,226.2141565,8.5,3.8,549075.8,106.6,1.1,5.1,6.2,10.8,9.7,218,18.77206579,99.9,99.8,99.5,43.9,95.7,55.3,99.9,0.3,75.79,6.2,5.53,715.1,155.22,94.02,66.28,51.15,44.25,37.73,30.63,793.3195607,#!,1391.710938,89.0495,,,65.9,99.6,8180.755454,375.8,846,302,741,271,6943,565,0.45356,1240,269768,22.37,64.12,23587,230310
8,2011-09-09T00:00:00.000+0000,44,44,5,,,,,,,Investment,Bibirevo,6407578.1,155572,0.189727117,6.99893e-05,9576,5001,5,10309,11065,5,0,240.0,1,0,7,3,no,0,16,1,no,no,no,no,no,no,no,no,86206,40477,45729,21154,11007,10147,98207,52277,45930,36211,10580,25631,9576,4899,4677,10309,5463,4846,23603,12286,11317,17508,9425,8083,18654,9709,8945,211,25,0,0,0,2,184,0,0,0,211,0,0,0,206,5,8,3.623270141,1.593203382,11.52778764,0.96064897,0.134365152,0.581510695,1.602056341,0.165124113,1.166991896,23.11838,1.131798,4.042541,4.309574655,51.71489587,8,5.622394422,7.453825079,1,0.25481389,3.057766679,0.567006418,no,1.04229051,11.53414568,13.71375173,14.2861768,15.77042634,1.042290487,1,no,3.303551988,36,1.964850803,no,14.4344623,101,23.39166932,1,18.84810404,5.33292585,1.149883196,0.952974662,5.713887982,4.871096986,9.913434734,2.795147723,0.342545086,2.090237161,2.359385347,8.114016035,3.495441189,1.69654313,4.213461148,0.746259906,6.917527177,1.09857248,0.657587332,0.872231483,0.371856693,0.581510695,1.079769307,0.558238725,4.469517135,13.15230977,8.018089004,7.718966846,0.373715085,good,17.62,0.0,0,0,0,0,4,1000.0,1625.0,1312.5,0,0,1,2,1,0,0,0,0,0,0,1,0,26.27,0.0,1,30500,3,13200,13,784.62,1269.23,1026.92,0,4,2,5,2,0,0,0,1,0,0,3,0,17.42,1.97,2,70500,11,207671,29,585.71,982.14,783.93,1,13,7,6,2,0,0,1,3,0,0,7,1,15.02,7.13,6,113554,12,226291,36,614.29,1042.86,828.57,1,15,11,6,2,1,0,2,4,0,0,9,1,11.47,13.56,12,251554,21,1377234,55,623.53,1049.02,836.27,4,21,15,11,3,1,0,3,4,0,0,18,2,15.8,11.44,24,711780,45,2197401,143,709.09,1189.39,949.24,11,37,46,36,9,4,0,11,18,1,0,47,4,2000000,2011-09-09T00:00:00.000+0000,111.29,14313.7,3.3,353.2,434.4,86.721,15.386,10.1,30.0495,41.1192,112.77,0.599708879,46308.5,0.045037137,5.76,6.74,1625.29,1518.22,131.33,205.75,10920215,0.018590211,4.0,386388,1.049542764,11.92,9948.7728,0.187790661,40311.3,-0.005,44898.7,0.168917353,12856.97,3322.047,286.952,106.6,6643.626,0.014,0.708,73976.19863,856.424079,0.708,0.292,0.891477685,53636,226.2141565,8.5,3.8,549075.8,106.6,1.1,5.1,6.2,10.8,9.7,218,18.77206579,99.9,99.8,99.5,43.9,95.7,55.3,99.9,0.3,75.79,6.2,5.53,715.1,155.22,94.02,66.28,51.15,44.25,37.73,30.63,793.3195607,#!,1391.710938,89.0495,,,65.9,99.6,8180.755454,375.8,846,302,741,271,6943,565,0.45356,1240,269768,22.37,64.12,23587,230310
9,2011-09-10T00:00:00.000+0000,42,27,5,,,,,,,Investment,Koptevo,4787424.272,96959,0.03845587,0.122090347,6507,3272,4,6566,8751,6,0,1894.0,4,0,7,6,no,0,0,7,no,no,no,no,no,no,no,no,78810,36091,42719,13901,7344,6557,59120,29664,29456,23938,6980,16958,6507,3456,3051,6566,3453,3113,15510,8201,7309,17662,8488,9174,12269,6487,5782,390,101,2,0,216,11,60,0,0,0,390,1,9,290,39,51,9,2.175430788,1.338707436,14.74228941,1.228524117,0.11290453,0.256589329,0.939584123,0.148956733,0.670431737,20.94097,1.759887,3.925497,1.540958494,18.49150192,9,1.540958494,2.428244611,9,0.219775841,2.637310091,0.322954506,no,6.53914608,5.514867811,8.290836714,8.963023472,10.357535,2.438125457,5,no,2.725990867,14,0.83357986,no,7.50071515,121,8.571925744,5,16.911467,1.652274682,2.679181439,1.307362663,6.178131848,4.421560784,18.18034021,1.263583558,0.0,2.467325771,0.540760283,0.731141912,0.799921119,1.782439237,3.432144582,1.032523625,1.204220449,0.256589329,2.332387982,1.143674241,0.217477464,0.256589329,0.554880314,0.555962558,4.43275291,6.788241041,5.273171011,2.449453233,0.123609825,poor,7.71,0.0,0,0,0,0,7,633.33,1083.33,858.33,1,1,3,2,0,0,0,0,0,0,0,5,0,19.25,6.55,0,0,0,0,10,566.67,1000.0,783.33,1,2,5,2,0,0,0,2,2,0,0,6,0,19.83,19.76,5,116353,0,0,15,592.31,1038.46,815.38,2,4,6,2,1,0,0,2,3,0,0,9,2,22.0,19.72,12,144351,0,0,18,593.75,1031.25,812.5,2,5,7,3,1,0,0,2,3,0,0,14,3,17.87,21.59,33,872741,12,503283,98,572.41,965.52,768.97,11,36,28,21,1,1,0,4,8,0,1,24,6,18.73,14.52,94,2283313,41,1051314,292,711.36,1185.61,948.48,28,86,81,69,19,8,1,18,34,1,3,85,11,5300000,2011-09-10T00:00:00.000+0000,111.29,14313.7,3.3,353.2,434.4,86.721,15.386,10.1,30.0495,41.1192,112.77,0.599708879,46308.5,0.045037137,5.76,6.74,1625.29,1518.22,131.33,205.75,10920215,0.018590211,4.0,386388,1.049542764,11.92,9948.7728,0.187790661,40311.3,-0.005,44898.7,0.168917353,12856.97,3322.047,286.952,106.6,6643.626,0.014,0.708,73976.19863,856.424079,0.708,0.292,0.891477685,53636,226.2141565,8.5,3.8,549075.8,106.6,1.1,5.1,6.2,10.8,9.7,218,18.77206579,99.9,99.8,99.5,43.9,95.7,55.3,99.9,0.3,75.79,6.2,5.53,715.1,155.22,94.02,66.28,51.15,44.25,37.73,30.63,793.3195607,#!,1391.710938,89.0495,,,65.9,99.6,8180.755454,375.8,846,302,741,271,6943,565,0.45356,1240,269768,22.37,64.12,23587,230310
10,2011-09-13T00:00:00.000+0000,36,21,9,,,,,,,Investment,Kuncevo,52351766.54,142462,0.070662054,0.035145267,9347,4050,7,9292,11234,8,0,2620.0,0,0,5,2,no,0,3,3,no,no,no,yes,no,no,no,no,78507,37052,41455,19784,10339,9445,85551,43352,42199,37127,11024,26103,9347,4806,4541,9292,4919,4373,22071,11484,10587,15929,8541,7388,17469,9082,8387,737,68,204,14,237,21,180,2,10,1,738,47,88,413,94,96,10,1.383373263,0.396274723,4.755296675,0.396274723,0.309672803,0.150686282,1.969773852,0.191191251,2.024970785,16.75309,2.790526,16.5619,2.522421641,30.26905969,10,1.673206368,2.735303613,22,0.218409543,2.62091451,1.021499072,no,3.953500268,5.885973152,8.955785282,10.00345994,10.86912007,1.101813754,6,no,1.102384108,9,0.147894252,no,10.22914523,50,13.77381942,5,20.09408343,7.353920421,0.61502045,0.453543783,4.154171002,2.12435988,20.46122679,5.174844257,0.508801924,1.281865943,5.741238918,5.076912538,2.969620143,1.35479157,5.673912846,2.604184808,5.23895598,0.150686282,0.912378535,0.459233791,1.794687627,0.150686282,1.167638745,1.150550082,3.557616808,4.373416959,5.148563093,0.711874878,0.345225166,satisfactory,33.79,0.0,1,15565,0,0,2,400.0,750.0,575.0,0,1,1,0,0,0,0,0,0,0,0,0,0,29.35,0.0,2,46565,1,16600,5,620.0,1100.0,860.0,0,2,2,0,1,0,0,0,0,0,1,7,0,25.02,0.0,2,46565,1,16600,9,788.89,1388.89,1088.89,0,2,4,0,3,0,0,2,2,0,1,10,0,20.47,0.0,2,46565,4,33440,22,861.9,1476.19,1169.05,1,2,10,4,4,1,0,2,2,0,1,13,0,22.56,7.15,10,635269,12,545761,56,1050.94,1735.85,1393.4,3,9,15,12,9,8,0,5,9,0,1,32,0,23.35,10.5,30,1132519,32,1404650,143,921.71,1531.01,1226.36,14,28,39,30,19,13,0,10,20,1,3,67,1,2000000,2011-09-13T00:00:00.000+0000,111.29,14313.7,3.3,353.2,434.4,86.721,15.386,10.1,30.2323,41.4451,111.89,0.599708879,46308.5,0.045037137,5.76,6.74,1572.0,1498.16,131.19,205.94,10920215,0.018590211,4.0,386388,1.049542764,11.92,9948.7728,0.187790661,40311.3,-0.005,44898.7,0.168917353,12856.97,3322.047,286.952,106.6,6643.626,0.014,0.708,73976.19863,856.424079,0.708,0.292,0.891477685,53636,226.2141565,8.5,3.8,549075.8,106.6,1.1,5.1,6.2,10.8,9.7,218,18.77206579,99.9,99.8,99.5,43.9,95.7,55.3,99.9,0.3,75.79,6.2,5.53,715.1,155.22,94.02,66.28,51.15,44.25,37.73,30.63,793.3195607,#!,1391.710938,89.0495,,,65.9,99.6,8180.755454,375.8,846,302,741,271,6943,565,0.45356,1240,269768,22.37,64.12,23587,230310


In [15]:
spark.catalog.dropTempView("sberbank_housing_macro")

In [16]:
# Persist this wide dataset that has not been altered yet

permanent_table_name = "sberbank_housing_macro"
sberbank_housing_macro.write.format("parquet").mode('overwrite').saveAsTable(permanent_table_name)

## FEATURE ENGINEERING
### PARQUET -> PARQUET

In [18]:
from pyspark.sql.functions import udf
from pyspark.sql.types import StringType, IntegerType

def yn_conversion(val):
  if str(val).lower() == 'yes':
    return 1
  elif str(val).lower() == 'no':
    return 0
  elif str(val).lower() == 'y':
    return 1
  elif str(val).lower() == 'n':
    return 0
  else:
    return None

    
to_none_udf = udf(lambda val: None if val == "NA" else val, StringType())
yn_conversion_udf = udf(lambda val: yn_conversion(val), IntegerType())

spark.udf.register("to_none_udf", to_none_udf)
spark.udf.register("yn_conversion_udf", yn_conversion_udf)

In [19]:
%sql
SELECT *
FROM sberbank_housing_macro
LIMIT 10

id,sberbank_timestamp,full_sq,life_sq,floor,max_floor,material,build_year,num_room,kitch_sq,state,product_type,sub_area,area_m,raion_popul,green_zone_part,indust_part,children_preschool,preschool_quota,preschool_education_centers_raion,children_school,school_quota,school_education_centers_raion,school_education_centers_top_20_raion,hospital_beds_raion,healthcare_centers_raion,university_top_20_raion,sport_objects_raion,additional_education_raion,culture_objects_top_25,culture_objects_top_25_raion,shopping_centers_raion,office_raion,thermal_power_plant_raion,incineration_raion,oil_chemistry_raion,radiation_raion,railroad_terminal_raion,big_market_raion,nuclear_reactor_raion,detention_facility_raion,full_all,male_f,female_f,young_all,young_male,young_female,work_all,work_male,work_female,ekder_all,ekder_male,ekder_female,0_6_all,0_6_male,0_6_female,7_14_all,7_14_male,7_14_female,0_17_all,0_17_male,0_17_female,16_29_all,16_29_male,16_29_female,0_13_all,0_13_male,0_13_female,raion_build_count_with_material_info,build_count_block,build_count_wood,build_count_frame,build_count_brick,build_count_monolith,build_count_panel,build_count_foam,build_count_slag,build_count_mix,raion_build_count_with_builddate_info,build_count_before_1920,build_count_1921-1945,build_count_1946-1970,build_count_1971-1995,build_count_after_1995,ID_metro,metro_min_avto,metro_km_avto,metro_min_walk,metro_km_walk,kindergarten_km,school_km,park_km,green_zone_km,industrial_km,water_treatment_km,cemetery_km,incineration_km,railroad_station_walk_km,railroad_station_walk_min,ID_railroad_station_walk,railroad_station_avto_km,railroad_station_avto_min,ID_railroad_station_avto,public_transport_station_km,public_transport_station_min_walk,water_km,water_1line,mkad_km,ttk_km,sadovoe_km,bulvar_ring_km,kremlin_km,big_road1_km,ID_big_road1,big_road1_1line,big_road2_km,ID_big_road2,railroad_km,railroad_1line,zd_vokzaly_avto_km,ID_railroad_terminal,bus_terminal_avto_km,ID_bus_terminal,oil_chemistry_km,nuclear_reactor_km,radiation_km,power_transmission_line_km,thermal_power_plant_km,ts_km,big_market_km,market_shop_km,fitness_km,swim_pool_km,ice_rink_km,stadium_km,basketball_km,hospice_morgue_km,detention_facility_km,public_healthcare_km,university_km,workplaces_km,shopping_centers_km,office_km,additional_education_km,preschool_km,big_church_km,church_synagogue_km,mosque_km,theater_km,museum_km,exhibition_km,catering_km,ecology,green_part_500,prom_part_500,office_count_500,office_sqm_500,trc_count_500,trc_sqm_500,cafe_count_500,cafe_sum_500_min_price_avg,cafe_sum_500_max_price_avg,cafe_avg_price_500,cafe_count_500_na_price,cafe_count_500_price_500,cafe_count_500_price_1000,cafe_count_500_price_1500,cafe_count_500_price_2500,cafe_count_500_price_4000,cafe_count_500_price_high,big_church_count_500,church_count_500,mosque_count_500,leisure_count_500,sport_count_500,market_count_500,green_part_1000,prom_part_1000,office_count_1000,office_sqm_1000,trc_count_1000,trc_sqm_1000,cafe_count_1000,cafe_sum_1000_min_price_avg,cafe_sum_1000_max_price_avg,cafe_avg_price_1000,cafe_count_1000_na_price,cafe_count_1000_price_500,cafe_count_1000_price_1000,cafe_count_1000_price_1500,cafe_count_1000_price_2500,cafe_count_1000_price_4000,cafe_count_1000_price_high,big_church_count_1000,church_count_1000,mosque_count_1000,leisure_count_1000,sport_count_1000,market_count_1000,green_part_1500,prom_part_1500,office_count_1500,office_sqm_1500,trc_count_1500,trc_sqm_1500,cafe_count_1500,cafe_sum_1500_min_price_avg,cafe_sum_1500_max_price_avg,cafe_avg_price_1500,cafe_count_1500_na_price,cafe_count_1500_price_500,cafe_count_1500_price_1000,cafe_count_1500_price_1500,cafe_count_1500_price_2500,cafe_count_1500_price_4000,cafe_count_1500_price_high,big_church_count_1500,church_count_1500,mosque_count_1500,leisure_count_1500,sport_count_1500,market_count_1500,green_part_2000,prom_part_2000,office_count_2000,office_sqm_2000,trc_count_2000,trc_sqm_2000,cafe_count_2000,cafe_sum_2000_min_price_avg,cafe_sum_2000_max_price_avg,cafe_avg_price_2000,cafe_count_2000_na_price,cafe_count_2000_price_500,cafe_count_2000_price_1000,cafe_count_2000_price_1500,cafe_count_2000_price_2500,cafe_count_2000_price_4000,cafe_count_2000_price_high,big_church_count_2000,church_count_2000,mosque_count_2000,leisure_count_2000,sport_count_2000,market_count_2000,green_part_3000,prom_part_3000,office_count_3000,office_sqm_3000,trc_count_3000,trc_sqm_3000,cafe_count_3000,cafe_sum_3000_min_price_avg,cafe_sum_3000_max_price_avg,cafe_avg_price_3000,cafe_count_3000_na_price,cafe_count_3000_price_500,cafe_count_3000_price_1000,cafe_count_3000_price_1500,cafe_count_3000_price_2500,cafe_count_3000_price_4000,cafe_count_3000_price_high,big_church_count_3000,church_count_3000,mosque_count_3000,leisure_count_3000,sport_count_3000,market_count_3000,green_part_5000,prom_part_5000,office_count_5000,office_sqm_5000,trc_count_5000,trc_sqm_5000,cafe_count_5000,cafe_sum_5000_min_price_avg,cafe_sum_5000_max_price_avg,cafe_avg_price_5000,cafe_count_5000_na_price,cafe_count_5000_price_500,cafe_count_5000_price_1000,cafe_count_5000_price_1500,cafe_count_5000_price_2500,cafe_count_5000_price_4000,cafe_count_5000_price_high,big_church_count_5000,church_count_5000,mosque_count_5000,leisure_count_5000,sport_count_5000,market_count_5000,price_doc,macro_timestamp,oil_urals,gdp_quart,gdp_quart_growth,cpi,ppi,gdp_deflator,balance_trade,balance_trade_growth,usdrub,eurrub,brent,net_capital_export,gdp_annual,gdp_annual_growth,average_provision_of_build_contract,average_provision_of_build_contract_moscow,rts,micex,micex_rgbi_tr,micex_cbi_tr,deposits_value,deposits_growth,deposits_rate,mortgage_value,mortgage_growth,mortgage_rate,grp,grp_growth,income_per_cap,real_dispos_income_per_cap_growth,salary,salary_growth,fixed_basket,retail_trade_turnover,retail_trade_turnover_per_cap,retail_trade_turnover_growth,labor_force,unemployment,employment,invest_fixed_capital_per_cap,invest_fixed_assets,profitable_enterpr_share,unprofitable_enterpr_share,share_own_revenues,overdue_wages_per_cap,fin_res_per_cap,marriages_per_1000_cap,divorce_rate,construction_value,invest_fixed_assets_phys,pop_natural_increase,pop_migration,pop_total_inc,childbirth,mortality,housing_fund_sqm,lodging_sqm_per_cap,water_pipes_share,baths_share,sewerage_share,gas_share,hot_water_share,electric_stove_share,heating_share,old_house_share,average_life_exp,infant_mortarity_per_1000_cap,perinatal_mort_per_1000_cap,incidence_population,rent_price_4+room_bus,rent_price_3room_bus,rent_price_2room_bus,rent_price_1room_bus,rent_price_3room_eco,rent_price_2room_eco,rent_price_1room_eco,load_of_teachers_preschool_per_teacher,child_on_acc_pre_school,load_of_teachers_school_per_teacher,students_state_oneshift,modern_education_share,old_education_build_share,provision_doctors,provision_nurse,load_on_doctors,power_clinics,hospital_beds_available_per_cap,hospital_bed_occupancy_per_year,provision_retail_space_sqm,provision_retail_space_modern_sqm,turnover_catering_per_cap,theaters_viewers_per_1000_cap,seats_theather_rfmin_per_100000_cap,museum_visitis_per_100_cap,bandwidth_sports,population_reg_sports_share,students_reg_sports_share,apartment_build,apartment_fund_sqm
1,2011-08-20T00:00:00.000+0000,43,27,4,,,,,,,Investment,Bibirevo,6407578.1,155572,0.189727117,6.99893e-05,9576,5001,5,10309,11065,5,0,240.0,1,0,7,3,no,0,16,1,no,no,no,no,no,no,no,no,86206,40477,45729,21154,11007,10147,98207,52277,45930,36211,10580,25631,9576,4899,4677,10309,5463,4846,23603,12286,11317,17508,9425,8083,18654,9709,8945,211,25,0,0,0,2,184,0,0,0,211,0,0,0,206,5,1,2.590241095,1.131259906,13.57511887,1.131259906,0.145699552,0.17797535,2.158587074,0.600973099,1.080934313,23.68346,1.804127,3.633334,5.419893032,65.03871639,1,5.419893032,6.905892968,1,0.274985143,3.299821714,0.992631058,no,1.42239141,10.9185867,13.10061764,13.67565705,15.15621058,1.422391404,1,no,3.830951404,5,1.305159492,no,14.23196091,101,24.2924061,1,18.152338,5.718518835,1.210027392,1.062513046,5.814134663,4.308127002,10.81417151,1.676258313,0.485841388,3.065047099,1.107594209,8.148590774,3.516512911,2.392353035,4.248035887,0.974742843,6.715025787,0.884350021,0.648487637,0.637188832,0.947961657,0.17797535,0.625783434,0.628186549,3.932040333,14.05304655,7.389497904,7.023704919,0.516838085,good,0.0,0.0,0,0,0,0,0,,,,0,0,0,0,0,0,0,0,0,0,0,1,0,7.36,0.0,1,30500,3,55600,19,527.78,888.89,708.33,1,10,4,3,1,0,0,1,2,0,0,6,1,14.27,6.92,3,39554,9,171420,34,566.67,969.7,768.18,1,14,11,6,2,0,0,1,2,0,0,7,1,11.77,15.97,9,188854,19,1244891,36,614.29,1042.86,828.57,1,15,11,6,2,1,0,1,2,0,0,10,1,11.98,13.55,12,251554,23,1419204,68,639.68,1079.37,859.52,5,21,22,16,3,1,0,2,4,0,0,21,1,13.09,13.31,29,807385,52,4036616,152,708.57,1185.71,947.14,12,39,48,40,9,4,0,13,22,1,0,52,4,5850000,2011-08-20T00:00:00.000+0000,109.31,14313.7,3.3,354.0,420.7,86.721,15.459,10.1,29.0048,41.7681,108.62,0.301810526,46308.5,0.045037137,5.76,6.74,1575.33,1438.74,131.16,204.78,10618898,0.009740346,4.1,323275,1.051914337,11.84,9948.7728,0.187790661,42688.6,-0.005,44898.7,0.168917353,12838.36,3322.047,286.952,106.6,6643.626,0.014,0.708,73976.19863,856.424079,0.708,0.292,0.891477685,53636,226.2141565,8.5,3.8,549075.8,106.6,1.1,5.1,6.2,10.8,9.7,218,18.77206579,99.9,99.8,99.5,43.9,95.7,55.3,99.9,0.3,75.79,6.2,5.53,715.1,136.11,77.93,62.89,47.85,41.8,36.77,29.07,793.3195607,#!,1391.710938,89.0495,,,65.9,99.6,8180.755454,375.8,846,302,741,271,6943,565,0.45356,1240,269768,22.37,64.12,23587,230310
2,2011-08-23T00:00:00.000+0000,34,19,3,,,,,,,Investment,Nagatinskij Zaton,9589336.912,115352,0.372602044,0.049637257,6880,3119,5,7759,6237,8,0,229.0,1,0,6,1,yes,1,3,0,no,no,no,no,no,no,no,no,76284,34200,42084,15727,7925,7802,70194,35622,34572,29431,9266,20165,6880,3466,3414,7759,3909,3850,17700,8998,8702,15164,7571,7593,13729,6929,6800,245,83,1,0,67,4,90,0,0,0,244,1,1,143,84,15,2,0.936699728,0.647336757,7.620630408,0.635052534,0.147754269,0.273345319,0.550689737,0.065321162,0.966479097,1.317476,4.655004,8.648587,3.411993084,40.943917,2,3.641772591,4.679744508,2,0.065263344,0.78316013,0.698081318,no,9.503405157,3.103995954,6.444333466,8.132640073,8.698054189,2.887376585,2,no,3.103995974,4,0.694535727,no,9.242585522,32,5.706113234,2,9.034641872,3.489954443,2.72429538,1.246148739,3.419574049,0.725560431,6.910567711,3.424716092,0.668363679,2.000153804,8.97282283,6.127072782,1.161578983,2.543746975,12.64987875,1.47772267,1.852560245,0.686251693,0.519311324,0.688796317,1.072315063,0.273345319,0.967820571,0.471446524,4.841543888,6.829888847,0.709260033,2.358840498,0.23028691,excellent,25.14,0.0,0,0,0,0,5,860.0,1500.0,1180.0,0,1,3,0,0,1,0,0,1,0,0,0,0,26.66,0.07,2,86600,5,94065,13,615.38,1076.92,846.15,0,5,6,1,0,1,0,1,2,0,4,2,0,21.53,7.71,3,102910,7,127065,17,694.12,1205.88,950.0,0,6,7,1,2,1,0,1,5,0,4,9,0,22.37,19.25,4,165510,8,179065,21,695.24,1190.48,942.86,0,7,8,3,2,1,0,1,5,0,4,11,0,18.07,27.32,12,821986,14,491565,30,631.03,1086.21,858.62,1,11,11,4,2,1,0,1,7,0,6,19,1,10.26,27.47,66,2690465,40,2034942,177,673.81,1148.81,911.31,9,49,65,36,15,3,0,15,29,1,10,66,14,6000000,2011-08-23T00:00:00.000+0000,109.31,14313.7,3.3,354.0,420.7,86.721,15.459,10.1,28.9525,41.7537,109.31,0.301810526,46308.5,0.045037137,5.76,6.74,1578.91,1444.11,131.45,204.92,10618898,0.009740346,4.1,323275,1.051914337,11.84,9948.7728,0.187790661,42688.6,-0.005,44898.7,0.168917353,12838.36,3322.047,286.952,106.6,6643.626,0.014,0.708,73976.19863,856.424079,0.708,0.292,0.891477685,53636,226.2141565,8.5,3.8,549075.8,106.6,1.1,5.1,6.2,10.8,9.7,218,18.77206579,99.9,99.8,99.5,43.9,95.7,55.3,99.9,0.3,75.79,6.2,5.53,715.1,136.11,77.93,62.89,47.85,41.8,36.77,29.07,793.3195607,#!,1391.710938,89.0495,,,65.9,99.6,8180.755454,375.8,846,302,741,271,6943,565,0.45356,1240,269768,22.37,64.12,23587,230310
3,2011-08-27T00:00:00.000+0000,43,29,2,,,,,,,Investment,Tekstil'shhiki,4808269.831,101708,0.112559644,0.118537385,5879,1463,4,6207,5580,7,0,1183.0,1,0,5,1,no,0,0,1,no,no,no,yes,no,no,no,no,101982,46076,55906,13028,6835,6193,63388,31813,31575,25292,7609,17683,5879,3095,2784,6207,3269,2938,14884,7821,7063,19401,9045,10356,11252,5916,5336,330,59,0,0,206,4,60,0,1,0,330,1,0,246,63,20,3,2.120998901,1.637996285,17.3515154,1.445959617,0.049101536,0.158071895,0.374847751,0.453172405,0.939275144,4.91266,3.381083,11.99648,1.277658039,15.33189647,3,1.277658039,1.701419537,3,0.328756044,3.945072522,0.468264622,no,5.60479992,2.927487097,6.963402995,8.054252314,9.067884956,0.647249803,3,no,2.927487099,4,0.70069112,no,9.540544478,5,6.710302485,3,5.777393501,7.50661249,0.772216104,1.60218297,3.682454651,3.562187704,5.75236835,1.375442778,0.733101062,1.239303854,1.978517187,0.767568769,1.952770629,0.621357002,7.682302975,0.097143527,0.841254102,1.510088854,1.48653302,1.543048836,0.391957389,0.158071895,3.178751487,0.755946015,7.92215157,4.273200485,3.156422843,4.958214283,0.190461977,poor,1.67,0.0,0,0,0,0,3,666.67,1166.67,916.67,0,0,2,1,0,0,0,0,0,0,0,0,0,4.99,0.29,0,0,0,0,9,642.86,1142.86,892.86,2,0,5,2,0,0,0,0,1,0,0,5,3,9.92,6.73,0,0,1,2600,14,516.67,916.67,716.67,2,4,6,2,0,0,0,0,4,0,0,6,5,12.99,12.75,4,100200,7,52550,24,563.64,977.27,770.45,2,8,9,4,1,0,0,0,4,0,0,8,5,12.14,26.46,8,110856,7,52550,41,697.44,1192.31,944.87,2,9,17,9,3,1,0,0,11,0,0,20,6,13.69,21.58,43,1478160,35,1572990,122,702.68,1196.43,949.55,10,29,45,25,10,3,0,11,27,0,4,67,10,5700000,2011-08-27T00:00:00.000+0000,109.31,14313.7,3.3,354.0,420.7,86.721,15.459,10.1,28.8082,41.7114,111.36,0.301810526,46308.5,0.045037137,5.76,6.74,1596.17,1458.84,131.08,204.84,10618898,0.009740346,4.1,323275,1.051914337,11.84,9948.7728,0.187790661,42688.6,-0.005,44898.7,0.168917353,12838.36,3322.047,286.952,106.6,6643.626,0.014,0.708,73976.19863,856.424079,0.708,0.292,0.891477685,53636,226.2141565,8.5,3.8,549075.8,106.6,1.1,5.1,6.2,10.8,9.7,218,18.77206579,99.9,99.8,99.5,43.9,95.7,55.3,99.9,0.3,75.79,6.2,5.53,715.1,136.11,77.93,62.89,47.85,41.8,36.77,29.07,793.3195607,#!,1391.710938,89.0495,,,65.9,99.6,8180.755454,375.8,846,302,741,271,6943,565,0.45356,1240,269768,22.37,64.12,23587,230310
4,2011-09-01T00:00:00.000+0000,89,50,9,,,,,,,Investment,Mitino,12583535.69,178473,0.194702869,0.069753361,13087,6839,9,13670,17063,10,0,,1,0,17,6,no,0,11,4,no,no,no,no,no,no,no,no,21155,9828,11327,28563,14680,13883,120381,60040,60341,29529,9083,20446,13087,6645,6442,13670,7126,6544,32063,16513,15550,3292,1450,1842,24934,12782,12152,458,9,51,12,124,50,201,0,9,2,459,13,24,40,130,252,4,1.489049154,0.984536582,11.56562408,0.963802007,0.179440956,0.236455018,0.078090293,0.106124506,0.451173311,15.62371,2.01708,14.31764,4.2914325,51.49719001,4,3.816044582,5.271136062,4,0.131596959,1.579163513,1.200336487,no,2.677824281,14.60650078,17.45719794,18.30943312,19.48700542,2.677824284,1,no,2.780448941,17,1.999265421,no,17.47838035,83,6.734618018,1,27.6678632,9.522537611,6.348716334,1.767612439,11.17833328,0.583024969,27.89271688,0.811275289,0.62348431,1.950316967,6.483171621,7.385520691,4.923843177,3.549557568,8.789894266,2.163735157,10.9031613,0.622271644,0.599913582,0.934273498,0.8926743,0.236455018,1.03177679,1.561504846,15.30044908,16.99067736,16.04152067,5.02969633,0.465820158,good,17.36,0.57,0,0,0,0,2,1000.0,1500.0,1250.0,0,0,0,2,0,0,0,0,0,0,0,0,0,19.25,10.35,1,11000,6,80780,12,658.33,1083.33,870.83,0,3,4,5,0,0,0,0,0,0,0,3,1,28.38,6.57,2,11000,7,89492,23,673.91,1130.43,902.17,0,5,9,8,1,0,0,1,0,0,0,9,2,32.29,5.73,2,11000,7,89492,25,660.0,1120.0,890.0,0,5,11,8,1,0,0,1,1,0,0,13,2,20.79,3.57,4,167000,12,205756,32,718.75,1218.75,968.75,0,5,14,10,3,0,0,1,2,0,0,18,3,14.18,3.89,8,244166,22,942180,61,931.58,1552.63,1242.11,4,7,21,15,11,2,1,4,4,0,0,26,3,13100000,2011-09-01T00:00:00.000+0000,111.29,14313.7,3.3,353.2,434.4,86.721,15.386,10.1,28.9655,41.4014,114.29,0.599708879,46308.5,0.045037137,5.76,6.74,1703.86,1551.58,131.45,205.42,10720911,0.009606741,4.0,386388,1.049542764,11.92,9948.7728,0.187790661,40311.3,-0.005,44898.7,0.168917353,12856.97,3322.047,286.952,106.6,6643.626,0.014,0.708,73976.19863,856.424079,0.708,0.292,0.891477685,53636,226.2141565,8.5,3.8,549075.8,106.6,1.1,5.1,6.2,10.8,9.7,218,18.77206579,99.9,99.8,99.5,43.9,95.7,55.3,99.9,0.3,75.79,6.2,5.53,715.1,155.22,94.02,66.28,51.15,44.25,37.73,30.63,793.3195607,#!,1391.710938,89.0495,,,65.9,99.6,8180.755454,375.8,846,302,741,271,6943,565,0.45356,1240,269768,22.37,64.12,23587,230310
5,2011-09-05T00:00:00.000+0000,77,77,4,,,,,,,Investment,Basmannoe,8398460.622,108171,0.015233744,0.037316452,5706,3240,7,6748,7770,9,0,562.0,4,2,25,2,no,0,10,93,no,no,no,yes,yes,no,no,no,28179,13522,14657,13368,7159,6209,68043,34236,33807,26760,8563,18197,5706,2982,2724,6748,3664,3084,15237,8113,7124,5164,2583,2581,11631,6223,5408,746,48,0,0,643,16,35,0,3,1,746,371,114,146,62,53,5,1.257186453,0.876620232,8.266305238,0.68885877,0.247901208,0.376838057,0.258288769,0.236214054,0.392870988,10.68354,2.936581,11.90391,0.853960072,10.24752087,5,1.59589817,2.156283865,113,0.071480323,0.857763874,0.820294318,no,11.61665314,1.721833675,0.046809568,0.787593311,2.578670647,1.721833683,4,no,3.133530966,10,0.084112545,yes,1.59589817,113,1.423427954,4,6.515857089,8.671015673,1.638318096,3.632640421,4.587916559,2.60941961,9.15505713,1.969737724,0.220287667,2.544696,3.975401349,3.610753828,0.307915375,1.864637406,3.779781109,1.121702845,0.991682626,0.892667526,0.429052137,0.077900959,0.810801456,0.376838057,0.378755838,0.121680643,2.584369607,1.11248589,1.800124877,1.339652258,0.026102416,excellent,3.56,4.44,15,293699,1,45000,48,702.22,1166.67,934.44,3,17,10,11,7,0,0,1,4,0,2,3,0,3.34,8.29,46,420952,3,158200,153,763.45,1272.41,1017.93,8,39,45,39,19,2,1,7,12,0,6,7,0,4.12,4.83,93,1195735,9,445900,272,766.8,1272.73,1019.76,19,70,74,72,30,6,1,18,30,0,10,14,2,4.53,5.02,149,1625130,17,564843,483,765.93,1269.23,1017.58,28,130,129,131,50,14,1,35,61,0,17,21,3,5.06,8.62,305,3420907,60,2296870,1068,853.03,1410.45,1131.74,63,266,267,262,149,57,4,70,121,1,40,77,5,8.38,10.92,689,8404624,114,3503058,2283,853.88,1411.45,1132.66,143,566,578,552,319,108,17,135,236,2,91,195,14,16331452,2011-09-05T00:00:00.000+0000,111.29,14313.7,3.3,353.2,434.4,86.721,15.386,10.1,29.4625,41.5902,110.08,0.599708879,46308.5,0.045037137,5.76,6.74,1617.01,1492.83,131.19,198.24,10920215,0.018590211,4.0,386388,1.049542764,11.92,9948.7728,0.187790661,40311.3,-0.005,44898.7,0.168917353,12856.97,3322.047,286.952,106.6,6643.626,0.014,0.708,73976.19863,856.424079,0.708,0.292,0.891477685,53636,226.2141565,8.5,3.8,549075.8,106.6,1.1,5.1,6.2,10.8,9.7,218,18.77206579,99.9,99.8,99.5,43.9,95.7,55.3,99.9,0.3,75.79,6.2,5.53,715.1,155.22,94.02,66.28,51.15,44.25,37.73,30.63,793.3195607,#!,1391.710938,89.0495,,,65.9,99.6,8180.755454,375.8,846,302,741,271,6943,565,0.45356,1240,269768,22.37,64.12,23587,230310
6,2011-09-06T00:00:00.000+0000,67,46,14,,,,,,,Investment,Nizhegorodskoe,7506452.02,43795,0.007670134,0.486245621,2418,852,2,2514,2012,3,0,,0,0,7,0,no,0,6,19,yes,no,no,yes,no,no,no,no,19940,9400,10540,5291,2744,2547,29660,15793,13867,8844,2608,6236,2418,1224,1194,2514,1328,1186,5866,3035,2831,4851,2329,2522,4632,2399,2233,188,24,0,0,147,2,15,0,0,0,188,0,5,152,25,6,6,2.735883907,1.593246481,18.37816963,1.531514136,0.145954816,0.113466218,1.073495427,1.497902638,0.256487453,7.18674,0.78033,14.07514,0.375311695,4.503740339,6,0.375311695,1.407418835,6,0.189227153,2.270725835,0.612447325,no,8.296086727,0.284868107,3.519388985,4.395057477,5.645795859,0.284868136,4,no,1.478528507,3,0.244670412,no,5.070196504,5,6.682088764,4,3.95950924,8.757686082,0.193126987,2.34156168,1.272894442,1.438003448,5.374563767,3.447863628,0.81041306,1.911842782,2.108923435,4.233094726,1.450974874,3.391116928,4.356122442,1.698723584,3.830021305,1.042261834,0.440707312,0.422357874,3.066285203,0.113466218,0.686931702,0.870446514,4.787705729,3.388809733,3.71355663,2.553423533,0.004469307,poor,0.0,19.42,5,227705,3,102000,7,1000.0,1625.0,1312.5,3,0,1,2,1,0,0,0,0,0,0,0,0,0.0,40.27,10,275135,5,164000,9,883.33,1416.67,1150.0,3,1,1,3,1,0,0,3,1,0,0,1,0,0.0,50.64,18,431090,6,186400,14,718.18,1181.82,950.0,3,3,3,4,1,0,0,4,2,0,0,11,0,0.38,51.58,21,471290,14,683945,33,741.38,1258.62,1000.0,4,5,13,8,2,1,0,6,5,0,0,21,1,1.82,39.99,54,1181009,29,1059171,120,737.96,1231.48,984.72,12,24,37,35,11,1,0,12,12,0,2,31,7,5.92,25.79,253,4274339,63,2010320,567,769.92,1280.08,1025.0,35,137,163,155,62,14,1,53,78,1,20,113,17,9100000,2011-09-06T00:00:00.000+0000,111.29,14313.7,3.3,353.2,434.4,86.721,15.386,10.1,29.6676,41.5031,112.89,0.599708879,46308.5,0.045037137,5.76,6.74,1618.63,1499.7,131.24,203.16,10920215,0.018590211,4.0,386388,1.049542764,11.92,9948.7728,0.187790661,40311.3,-0.005,44898.7,0.168917353,12856.97,3322.047,286.952,106.6,6643.626,0.014,0.708,73976.19863,856.424079,0.708,0.292,0.891477685,53636,226.2141565,8.5,3.8,549075.8,106.6,1.1,5.1,6.2,10.8,9.7,218,18.77206579,99.9,99.8,99.5,43.9,95.7,55.3,99.9,0.3,75.79,6.2,5.53,715.1,155.22,94.02,66.28,51.15,44.25,37.73,30.63,793.3195607,#!,1391.710938,89.0495,,,65.9,99.6,8180.755454,375.8,846,302,741,271,6943,565,0.45356,1240,269768,22.37,64.12,23587,230310
7,2011-09-08T00:00:00.000+0000,25,14,10,,,,,,,Investment,Sokol'niki,10320473.23,57405,0.523439075,0.042307271,2459,933,5,2810,5050,5,0,4849.0,3,0,17,6,no,0,6,9,no,no,no,no,no,no,no,yes,85956,40724,45232,5682,2925,2757,35003,17490,17513,16720,5351,11369,2459,1241,1218,2810,1472,1338,6510,3345,3165,19445,10085,9360,4884,2507,2377,217,23,1,0,139,12,41,0,0,1,216,11,38,90,58,19,7,1.453761834,1.026916052,12.32299262,1.026916052,0.501165569,0.516979177,2.393238738,0.632444746,0.707268095,13.9007,1.371596,10.32988,2.603670532,31.24404639,7,3.031337419,3.959288563,7,0.050211053,0.602532635,0.399865062,no,8.618597103,2.026622694,3.940565824,4.686895435,6.468718768,2.026622672,4,no,3.657904476,20,0.464015082,no,3.028547461,97,4.062549158,4,6.164593843,11.80753189,2.10648677,1.298839815,5.106633095,3.345431932,13.70496751,1.825870151,0.132256233,1.278911604,1.733264299,1.877747146,0.588118977,0.198894455,1.374586648,0.868403997,3.168781583,0.544311589,0.513689337,0.067724564,0.27301919,0.516979177,0.822377431,0.318230275,4.48494144,0.75222301,2.969314705,3.373342934,0.447173592,poor,0.0,0.0,3,7719,0,0,2,750.0,1250.0,1000.0,0,0,1,1,0,0,0,0,2,0,1,2,0,14.59,4.16,8,151911,3,19400,16,760.0,1300.0,1030.0,1,3,7,3,1,1,0,2,4,0,2,7,0,20.5,5.57,20,453371,9,105241,44,673.17,1158.54,915.85,3,12,17,7,4,1,0,4,11,0,3,20,1,23.45,5.25,42,791467,17,510141,71,720.0,1215.38,967.69,6,16,24,17,7,1,0,9,14,0,4,28,1,22.2,7.57,72,1387991,24,743439,160,698.04,1183.01,940.52,7,41,57,37,16,2,0,17,29,0,8,56,2,25.23,12.76,228,3305831,49,1656529,635,771.05,1291.18,1031.11,34,163,194,144,81,16,3,38,80,1,27,127,8,5500000,2011-09-08T00:00:00.000+0000,111.29,14313.7,3.3,353.2,434.4,86.721,15.386,10.1,29.6304,41.2839,114.55,0.599708879,46308.5,0.045037137,5.76,6.74,1675.14,1556.46,131.41,205.19,10920215,0.018590211,4.0,386388,1.049542764,11.92,9948.7728,0.187790661,40311.3,-0.005,44898.7,0.168917353,12856.97,3322.047,286.952,106.6,6643.626,0.014,0.708,73976.19863,856.424079,0.708,0.292,0.891477685,53636,226.2141565,8.5,3.8,549075.8,106.6,1.1,5.1,6.2,10.8,9.7,218,18.77206579,99.9,99.8,99.5,43.9,95.7,55.3,99.9,0.3,75.79,6.2,5.53,715.1,155.22,94.02,66.28,51.15,44.25,37.73,30.63,793.3195607,#!,1391.710938,89.0495,,,65.9,99.6,8180.755454,375.8,846,302,741,271,6943,565,0.45356,1240,269768,22.37,64.12,23587,230310
8,2011-09-09T00:00:00.000+0000,44,44,5,,,,,,,Investment,Bibirevo,6407578.1,155572,0.189727117,6.99893e-05,9576,5001,5,10309,11065,5,0,240.0,1,0,7,3,no,0,16,1,no,no,no,no,no,no,no,no,86206,40477,45729,21154,11007,10147,98207,52277,45930,36211,10580,25631,9576,4899,4677,10309,5463,4846,23603,12286,11317,17508,9425,8083,18654,9709,8945,211,25,0,0,0,2,184,0,0,0,211,0,0,0,206,5,8,3.623270141,1.593203382,11.52778764,0.96064897,0.134365152,0.581510695,1.602056341,0.165124113,1.166991896,23.11838,1.131798,4.042541,4.309574655,51.71489587,8,5.622394422,7.453825079,1,0.25481389,3.057766679,0.567006418,no,1.04229051,11.53414568,13.71375173,14.2861768,15.77042634,1.042290487,1,no,3.303551988,36,1.964850803,no,14.4344623,101,23.39166932,1,18.84810404,5.33292585,1.149883196,0.952974662,5.713887982,4.871096986,9.913434734,2.795147723,0.342545086,2.090237161,2.359385347,8.114016035,3.495441189,1.69654313,4.213461148,0.746259906,6.917527177,1.09857248,0.657587332,0.872231483,0.371856693,0.581510695,1.079769307,0.558238725,4.469517135,13.15230977,8.018089004,7.718966846,0.373715085,good,17.62,0.0,0,0,0,0,4,1000.0,1625.0,1312.5,0,0,1,2,1,0,0,0,0,0,0,1,0,26.27,0.0,1,30500,3,13200,13,784.62,1269.23,1026.92,0,4,2,5,2,0,0,0,1,0,0,3,0,17.42,1.97,2,70500,11,207671,29,585.71,982.14,783.93,1,13,7,6,2,0,0,1,3,0,0,7,1,15.02,7.13,6,113554,12,226291,36,614.29,1042.86,828.57,1,15,11,6,2,1,0,2,4,0,0,9,1,11.47,13.56,12,251554,21,1377234,55,623.53,1049.02,836.27,4,21,15,11,3,1,0,3,4,0,0,18,2,15.8,11.44,24,711780,45,2197401,143,709.09,1189.39,949.24,11,37,46,36,9,4,0,11,18,1,0,47,4,2000000,2011-09-09T00:00:00.000+0000,111.29,14313.7,3.3,353.2,434.4,86.721,15.386,10.1,30.0495,41.1192,112.77,0.599708879,46308.5,0.045037137,5.76,6.74,1625.29,1518.22,131.33,205.75,10920215,0.018590211,4.0,386388,1.049542764,11.92,9948.7728,0.187790661,40311.3,-0.005,44898.7,0.168917353,12856.97,3322.047,286.952,106.6,6643.626,0.014,0.708,73976.19863,856.424079,0.708,0.292,0.891477685,53636,226.2141565,8.5,3.8,549075.8,106.6,1.1,5.1,6.2,10.8,9.7,218,18.77206579,99.9,99.8,99.5,43.9,95.7,55.3,99.9,0.3,75.79,6.2,5.53,715.1,155.22,94.02,66.28,51.15,44.25,37.73,30.63,793.3195607,#!,1391.710938,89.0495,,,65.9,99.6,8180.755454,375.8,846,302,741,271,6943,565,0.45356,1240,269768,22.37,64.12,23587,230310
9,2011-09-10T00:00:00.000+0000,42,27,5,,,,,,,Investment,Koptevo,4787424.272,96959,0.03845587,0.122090347,6507,3272,4,6566,8751,6,0,1894.0,4,0,7,6,no,0,0,7,no,no,no,no,no,no,no,no,78810,36091,42719,13901,7344,6557,59120,29664,29456,23938,6980,16958,6507,3456,3051,6566,3453,3113,15510,8201,7309,17662,8488,9174,12269,6487,5782,390,101,2,0,216,11,60,0,0,0,390,1,9,290,39,51,9,2.175430788,1.338707436,14.74228941,1.228524117,0.11290453,0.256589329,0.939584123,0.148956733,0.670431737,20.94097,1.759887,3.925497,1.540958494,18.49150192,9,1.540958494,2.428244611,9,0.219775841,2.637310091,0.322954506,no,6.53914608,5.514867811,8.290836714,8.963023472,10.357535,2.438125457,5,no,2.725990867,14,0.83357986,no,7.50071515,121,8.571925744,5,16.911467,1.652274682,2.679181439,1.307362663,6.178131848,4.421560784,18.18034021,1.263583558,0.0,2.467325771,0.540760283,0.731141912,0.799921119,1.782439237,3.432144582,1.032523625,1.204220449,0.256589329,2.332387982,1.143674241,0.217477464,0.256589329,0.554880314,0.555962558,4.43275291,6.788241041,5.273171011,2.449453233,0.123609825,poor,7.71,0.0,0,0,0,0,7,633.33,1083.33,858.33,1,1,3,2,0,0,0,0,0,0,0,5,0,19.25,6.55,0,0,0,0,10,566.67,1000.0,783.33,1,2,5,2,0,0,0,2,2,0,0,6,0,19.83,19.76,5,116353,0,0,15,592.31,1038.46,815.38,2,4,6,2,1,0,0,2,3,0,0,9,2,22.0,19.72,12,144351,0,0,18,593.75,1031.25,812.5,2,5,7,3,1,0,0,2,3,0,0,14,3,17.87,21.59,33,872741,12,503283,98,572.41,965.52,768.97,11,36,28,21,1,1,0,4,8,0,1,24,6,18.73,14.52,94,2283313,41,1051314,292,711.36,1185.61,948.48,28,86,81,69,19,8,1,18,34,1,3,85,11,5300000,2011-09-10T00:00:00.000+0000,111.29,14313.7,3.3,353.2,434.4,86.721,15.386,10.1,30.0495,41.1192,112.77,0.599708879,46308.5,0.045037137,5.76,6.74,1625.29,1518.22,131.33,205.75,10920215,0.018590211,4.0,386388,1.049542764,11.92,9948.7728,0.187790661,40311.3,-0.005,44898.7,0.168917353,12856.97,3322.047,286.952,106.6,6643.626,0.014,0.708,73976.19863,856.424079,0.708,0.292,0.891477685,53636,226.2141565,8.5,3.8,549075.8,106.6,1.1,5.1,6.2,10.8,9.7,218,18.77206579,99.9,99.8,99.5,43.9,95.7,55.3,99.9,0.3,75.79,6.2,5.53,715.1,155.22,94.02,66.28,51.15,44.25,37.73,30.63,793.3195607,#!,1391.710938,89.0495,,,65.9,99.6,8180.755454,375.8,846,302,741,271,6943,565,0.45356,1240,269768,22.37,64.12,23587,230310
10,2011-09-13T00:00:00.000+0000,36,21,9,,,,,,,Investment,Kuncevo,52351766.54,142462,0.070662054,0.035145267,9347,4050,7,9292,11234,8,0,2620.0,0,0,5,2,no,0,3,3,no,no,no,yes,no,no,no,no,78507,37052,41455,19784,10339,9445,85551,43352,42199,37127,11024,26103,9347,4806,4541,9292,4919,4373,22071,11484,10587,15929,8541,7388,17469,9082,8387,737,68,204,14,237,21,180,2,10,1,738,47,88,413,94,96,10,1.383373263,0.396274723,4.755296675,0.396274723,0.309672803,0.150686282,1.969773852,0.191191251,2.024970785,16.75309,2.790526,16.5619,2.522421641,30.26905969,10,1.673206368,2.735303613,22,0.218409543,2.62091451,1.021499072,no,3.953500268,5.885973152,8.955785282,10.00345994,10.86912007,1.101813754,6,no,1.102384108,9,0.147894252,no,10.22914523,50,13.77381942,5,20.09408343,7.353920421,0.61502045,0.453543783,4.154171002,2.12435988,20.46122679,5.174844257,0.508801924,1.281865943,5.741238918,5.076912538,2.969620143,1.35479157,5.673912846,2.604184808,5.23895598,0.150686282,0.912378535,0.459233791,1.794687627,0.150686282,1.167638745,1.150550082,3.557616808,4.373416959,5.148563093,0.711874878,0.345225166,satisfactory,33.79,0.0,1,15565,0,0,2,400.0,750.0,575.0,0,1,1,0,0,0,0,0,0,0,0,0,0,29.35,0.0,2,46565,1,16600,5,620.0,1100.0,860.0,0,2,2,0,1,0,0,0,0,0,1,7,0,25.02,0.0,2,46565,1,16600,9,788.89,1388.89,1088.89,0,2,4,0,3,0,0,2,2,0,1,10,0,20.47,0.0,2,46565,4,33440,22,861.9,1476.19,1169.05,1,2,10,4,4,1,0,2,2,0,1,13,0,22.56,7.15,10,635269,12,545761,56,1050.94,1735.85,1393.4,3,9,15,12,9,8,0,5,9,0,1,32,0,23.35,10.5,30,1132519,32,1404650,143,921.71,1531.01,1226.36,14,28,39,30,19,13,0,10,20,1,3,67,1,2000000,2011-09-13T00:00:00.000+0000,111.29,14313.7,3.3,353.2,434.4,86.721,15.386,10.1,30.2323,41.4451,111.89,0.599708879,46308.5,0.045037137,5.76,6.74,1572.0,1498.16,131.19,205.94,10920215,0.018590211,4.0,386388,1.049542764,11.92,9948.7728,0.187790661,40311.3,-0.005,44898.7,0.168917353,12856.97,3322.047,286.952,106.6,6643.626,0.014,0.708,73976.19863,856.424079,0.708,0.292,0.891477685,53636,226.2141565,8.5,3.8,549075.8,106.6,1.1,5.1,6.2,10.8,9.7,218,18.77206579,99.9,99.8,99.5,43.9,95.7,55.3,99.9,0.3,75.79,6.2,5.53,715.1,155.22,94.02,66.28,51.15,44.25,37.73,30.63,793.3195607,#!,1391.710938,89.0495,,,65.9,99.6,8180.755454,375.8,846,302,741,271,6943,565,0.45356,1240,269768,22.37,64.12,23587,230310


In [20]:
# Select features of interest and engineer features.
engineered_df = spark.sql("""SELECT
                                year(sberbank_timestamp) as year,
                                month(sberbank_timestamp) as month,
                                day(sberbank_timestamp) as day,
                                full_sq,
                                life_sq,
                                floor,
                                upper(sub_area) as sub_area,            --normalize in case of capitalization diffs
                                upper(product_type) as product_type,    --normalize in case of capitalization diffs
                                to_none_udf(max_floor) as max_floor,
                                to_none_udf(material) as material,
                                to_none_udf(build_year) as build_year,
                                yn_conversion_udf(big_market_raion) as big_market_raion,
                                yn_conversion_udf(nuclear_reactor_raion) as nuclear_reactor_raion,
                                yn_conversion_udf(detention_facility_raion) as detention_facility_raion,
                                cast(work_all as Double) as work_all
                           FROM sberbank_housing_macro""")
engineered_df.show()

In [21]:
# Persist this engineered dataset

permanent_table_name = "sberbank_train_01"
engineered_df.write.format("parquet").saveAsTable(permanent_table_name)