# Get Data

This notebook provides all the steps needed to get all the necessary data for this project.
It assumes that the working data directory is `./data`

We are going to be performing this work for the Philippines, but this process will work for any country.

### High Resolution Population Density Maps + Demographic Estimates
[Philippines: High Resolution Population Density Maps + Demographic Estimates from Facebook](https://data.humdata.org/dataset/6d9f35c0-4764-49ee-b364-329db0b7a47d) contains data in CSV and GeoTIFF, optionally available with different demographic ages.  

We will focus on Total Population, but this analysis will work with any GeoTIFF demographic subsets available.

In [3]:
# Get the PHL population GeoTiff
!wget -O "./data/phl_pop_geotiff.zip" "https://data.humdata.org/dataset/6d9f35c0-4764-49ee-b364-329db0b7a47d/resource/4a178155-b746-4f04-8f1b-2a79cc6f5153/download/population_phl_2018-10-01_geotiff.zip"

# Unzip
!unzip ./data/phl_pop_geotiff.zip -d ./data/pop/phl/

--2020-08-26 18:03:37--  https://data.humdata.org/dataset/6d9f35c0-4764-49ee-b364-329db0b7a47d/resource/4a178155-b746-4f04-8f1b-2a79cc6f5153/download/population_phl_2018-10-01_geotiff.zip
Resolving data.humdata.org (data.humdata.org)... 34.206.254.225, 3.227.32.143, 54.161.199.142
Connecting to data.humdata.org (data.humdata.org)|34.206.254.225|:443... connected.
HTTP request sent, awaiting response... 302 Found
Location: https://s3.eu-central-1.amazonaws.com/hdx-ckan-filestore-prod/resources/4a178155-b746-4f04-8f1b-2a79cc6f5153/population_phl_2018-10-01_geotiff.zip?X-Amz-Algorithm=AWS4-HMAC-SHA256&X-Amz-Expires=180&X-Amz-Credential=AKIARZNKTAO7U6UN77MP%2F20200826%2Feu-central-1%2Fs3%2Faws4_request&X-Amz-SignedHeaders=host&X-Amz-Date=20200826T180338Z&X-Amz-Signature=6ce11e28cf23124fb7499eecaa8d7e624c37186cdd419e26b53128a813ddf415 [following]
--2020-08-26 18:03:39--  https://s3.eu-central-1.amazonaws.com/hdx-ckan-filestore-prod/resources/4a178155-b746-4f04-8f1b-2a79cc6f5153/population_p

In [3]:
# Get the IDN population GeoTiff
!wget -O "./data/idn_pop_geotiff.zip" "https://data.humdata.org/dataset/0474df44-62b5-4a4c-a4fd-fd733979e2cc/resource/2b5f1310-ef98-44cb-b8b6-0d314add751c/download/population_idn_2018-10-01_geotiff.zip"

# Unzip
!unzip ./data/idn_pop_geotiff.zip -d ./data/pop/idn

--2020-08-28 12:06:31--  https://data.humdata.org/dataset/0474df44-62b5-4a4c-a4fd-fd733979e2cc/resource/2b5f1310-ef98-44cb-b8b6-0d314add751c/download/population_idn_2018-10-01_geotiff.zip
Resolving data.humdata.org (data.humdata.org)... 3.227.32.143, 54.161.199.142, 34.206.254.225
Connecting to data.humdata.org (data.humdata.org)|3.227.32.143|:443... connected.
HTTP request sent, awaiting response... 302 Found
Location: https://s3.eu-central-1.amazonaws.com/hdx-ckan-filestore-prod/resources/2b5f1310-ef98-44cb-b8b6-0d314add751c/population_idn_2018-10-01_geotiff.zip?X-Amz-Algorithm=AWS4-HMAC-SHA256&X-Amz-Expires=180&X-Amz-Credential=AKIARZNKTAO7U6UN77MP%2F20200828%2Feu-central-1%2Fs3%2Faws4_request&X-Amz-SignedHeaders=host&X-Amz-Date=20200828T120633Z&X-Amz-Signature=285e35b82cb623fcc4ced60b4e00385bb32e46021dcec12adb18184c898719e7 [following]
--2020-08-28 12:06:33--  https://s3.eu-central-1.amazonaws.com/hdx-ckan-filestore-prod/resources/2b5f1310-ef98-44cb-b8b6-0d314add751c/population_idn

### Subnational Boundaries
[PHL Subnational Boundaries](https://data.humdata.org/dataset/caf116df-f984-4deb-85ca-41b349d3f313/resource/12457689-6a86-4474-8032-5ca9464d38a8) 

In [4]:
# Getting Administrative Boundaries for the Philippines
!wget -O "./data/phl_adm_psa_namria_20200529_shp.zip" "https://data.humdata.org/dataset/caf116df-f984-4deb-85ca-41b349d3f313/resource/12457689-6a86-4474-8032-5ca9464d38a8/download/phl_adm_psa_namria_20200529_shp.zip"

# Unzip
!unzip ./data/phl_adm_psa_namria_20200529_shp.zip -d ./data/bounds

--2020-08-26 18:03:43--  https://data.humdata.org/dataset/caf116df-f984-4deb-85ca-41b349d3f313/resource/12457689-6a86-4474-8032-5ca9464d38a8/download/phl_adm_psa_namria_20200529_shp.zip
Resolving data.humdata.org (data.humdata.org)... 34.206.254.225, 3.227.32.143, 54.161.199.142
Connecting to data.humdata.org (data.humdata.org)|34.206.254.225|:443... connected.
HTTP request sent, awaiting response... 302 Found
Location: https://s3.eu-central-1.amazonaws.com/hdx-ckan-filestore-prod/resources/12457689-6a86-4474-8032-5ca9464d38a8/phl_adm_psa_namria_20200529_shp.zip?X-Amz-Algorithm=AWS4-HMAC-SHA256&X-Amz-Expires=180&X-Amz-Credential=AKIARZNKTAO7U6UN77MP%2F20200826%2Feu-central-1%2Fs3%2Faws4_request&X-Amz-SignedHeaders=host&X-Amz-Date=20200826T180344Z&X-Amz-Signature=6f8558bc3897f8b901b975af39495d0561fa7206b324dd0f598a35d771cbc7b0 [following]
--2020-08-26 18:03:44--  https://s3.eu-central-1.amazonaws.com/hdx-ckan-filestore-prod/resources/12457689-6a86-4474-8032-5ca9464d38a8/phl_adm_psa_namr

### OSM Extracts
The [Geofabrik](https://download.geofabrik.de/) download server hosts OSM extracts of various regions.  We are going to get data from them which we will use for building our road network.

*This is only necessary if you are planning on building your own network*

In [6]:
!mkdir "./data/osm_extracts/"
!wget -O "./data/osm_extracts/phl.osm.pbf" "https://download.geofabrik.de/asia/philippines-latest.osm.pbf"

--2020-08-26 18:09:35--  https://download.geofabrik.de/asia/philippines-latest.osm.pbf
Resolving download.geofabrik.de (download.geofabrik.de)... 88.99.142.44, 116.202.112.212
Connecting to download.geofabrik.de (download.geofabrik.de)|88.99.142.44|:443... connected.
HTTP request sent, awaiting response... 200 OK
Length: 361130167 (344M) [application/octet-stream]
Saving to: ‘./data/osm_extracts/phl.osm.pbf’


2020-08-26 18:10:01 (13.4 MB/s) - ‘./data/osm_extracts/phl.osm.pbf’ saved [361130167/361130167]

