# Interesting Findings
### Clouds
Clouds obscure land features and are often interpreted as buildings.

### Roof solar panels
Solar panels on roofs often appear as damaged buildings.

_Tip:  Include more cloud images in the training dataset to improve building and damage detection._

# File Formats and Datasets

### [GeoJSON](https://geojson.org/)
Geographic JSON format with support for multi-polygons and custom properties.

```
{
  "type": "FeatureCollection",
  "crs": {
      "type": "name", 
      "properties": { 
          "name": "urn:ogc:def:crs:OGC:1.3:CRS84" 
      } 
  },
  "features": 
     [
       {
         "type": "Feature", 
         "properties": {
             "DAMAGE_AREA_ID": "{f8432ec2-49d5-42c8-8065-4c8d8b54e773}", 
             "CREATED_DATE": "2020-10-30T21:17:14+00:00", 
             "MODIFIED_DATE": "2020-11-19T20:03:11+00:00", 
             "FEATURE_USER_ID": "smckelvey_partner", 
             "OBSERVATION_USER_ID": "smckelvey_partner", 
             "REGION_ID": 5, 
             "LABEL": "RF-1", 
             "HOST_CODE": 20.0, 
             "HOST": "California red fir", 
             "HOST_GROUP_CODE": null, 
             "HOST_GROUP": null, 
             "DCA_CODE": 11050, 
             "DCA_COMMON_NAME": "fir engraver", 
             "DAMAGE_TYPE_CODE": 2, 
             "DAMAGE_TYPE": "Mortality", 
             "PERCENT_AFFECTED_CODE": 1, 
             "PERCENT_AFFECTED": "Very Light (1-3%)", 
             "PERCENT_MIN": 1.0, 
             "PERCENT_MAX": 3.0, 
             "PERCENT_MID": 2.0, 
             "NOTES": "NA20; 20200730; 035H", 
             "OBSERVATION_COUNT": "SINGLE", 
             "COLLECTION_MODE": "DesktopGIS", 
             "SNAPGRID_ROW": null, 
             "SNAPGRID_COLUMN": null, 
             "AREA_TYPE": "POLYGON", 
             "ACRES": 5.09, 
             "STATUS": 1, 
             "SURVEY_YEAR": 2020, 
             "LEGACY_SEVERITY_CODE": null, 
             "LEGACY_SEVERITY": null, 
             "LEGACY_PATTERN_CODE": null, 
             "LEGACY_PATTERN": null, 
             "LEGACY_TPA": null, 
             "LEGACY_NO_TREES": null, 
             "LEGACY_FOREST_TYPE_CODE": null, 
             "LEGACY_FOREST_TYPE": null, 
             "GRP": null, 
             "OBSERVATION_ID": "{333ed692-0a37-4990-a368-4d503b791621}", 
             "IDS_DATA_SOURCE": "91", 
             "DATA_SOURCE_NAME": "REMOTE_SENSING", 
             "US_AREA": "CONUS", 
             "SHAPE_Length": 595.21228600142638, 
             "SHAPE_Area": 20579.801704105612 
         }, 
         "geometry": { 
             "type": "MultiPolygon", 
             "coordinates": [ 
                 [ 
                     [ 
                         [ -120.20806708986845, 38.604999600002188 ], 
                         [ -120.210326122885249, 38.605113969931573 ], 
                         [ -120.210303609363322, 38.605562702761141 ], 
                         [ -120.208604606082048, 38.606291053520479 ], 
                         [ -120.207699241299181, 38.605333861676037 ], 
                         [ -120.20806708986845, 38.604999600002188 ] 
                     ] 
                 ] 
             ] 
         } 
     }
   ]
}
```

### [GeoTIFF](https://en.wikipedia.org/wiki/GeoTIFF)

Geographic Tagged Image File Format (GeoTIFF) organizes and stores geospatial raster image* datasets such as satellite imagery and terrain models.  

GeoTIFF stores spatial information and coordinates within the image file including the raster’s cell size and coordinate reference system (CRS).


### [Cloud-Optimized GeoTIFF (COG)](https://www.cogeo.org/)

A Cloud Optimized GeoTIFF (COG) is a regular GeoTIFF file with pixels organized in a way that enables `HTTP GET` range requests.  This allows streaming the GeoTIFF as needed versus having to download the whole file.

### [AWS Open Data Registry - Disaster Response](https://registry.opendata.aws/tag/disaster-response/)
This registry exists to help people discover and share datasets that are available via AWS resources.

### [Forestry Insect-and-Disease Detection Survey](https://www.fs.fed.us/foresthealth/applied-sciences/mapping-reporting/detection-surveys.shtml)

Collecting data on the health of trees in areas affected by insects and diseases throughout U.S. forests.

### [Landsat Satellite Imagery 30m](https://earthexplorer.usgs.gov/)

Space-based images of the Earth’s land surface at 30m resolution.  These images serve as valuable resources for land-use and land-change research over time.

### [Copernicus Digital Elevation Model (DEM)](https://registry.opendata.aws/copernicus-dem/)
Multi-purpose dataset that represents the Earth's surface including buildings, infrastructure, and vegetation.

### [Moderate Resolution Imaging Spectroradiometer (MODIS) 500m](https://registry.opendata.aws/modis/)

Views the entire surface of the Earth every 1-2 days to describe land, oceans, and atmosphere at 500m resolution.  This dataset is used to study trends across the global.

### [European Space Agency (ESA) WorldCover 10m](https://registry.opendata.aws/esa-worldcover/)
European Space Agency's global land cover map at 10m resolution using 3x3 tiles across the following 11 categories:
* Tree cover 
* Shrublan
* Grassland
* Croplan
* Built-up
* Bare/sparse vegetation
* Snow and ice
* Permanent water bodies
* Herbaceous wetland
* Mangroves
* Moss and lichen

### [ECMWF ERA5 Reanalysis](https://registry.opendata.aws/ecmwf-era5/)
Provides essential atmospheric meteorological characteristics for the following:
* air temperature
* pressure and wind at different altitudes
* rainfall
* soil moisture content
* sea-surface temperature
* wave height

### [Fire Information for Resource Management System (FIRMS)](https://firms.modaps.eosdis.nasa.gov/)
Near-realtime monitoring and detection of active fires and thermal oddities within 3 hours of satellite observation.  Supports active-fire alerts as well as history thermal data over time.

### [SpaceNet2 Building Detection]()
Provides labeled data for roads, building footprints, and other points of interest in the following number of buildings across 4 cities:
* 151k buildings and 216 sq km in Las Vegas
* 23k buildings and 1,030 sq km in Paris
* 92K buildings and 1,000 sq km in Shanghai
* 35K buildings and 765 sq km in Khartoum

### [OpenStreetMap](https://www.openstreetmap.org/#map=4/38.01/-95.84)
Collaborative, crowd-sourced geographic database of the world from over 2 million registered users using GPS devices, aerial photography, and local knowledge of the area.

### [xBD Dataset](https://xview2.org/dataset)
Annotated, high-resolution satellite imagery for building-damage assessment.

* 850,000+ buildings including 4 levels of damage:  none, minor, major, destroyed
* 3,700+ pairs of pre and post disaster satellite images
* 6 disaster types including earthquake, fire, flooding, tsunami, volcano, wind


### [Maxar Open Data Program](https://www.maxar.com/open-data)
Open datasets for sudden-onset major crisis events such as the following:
* Hurricane Michael Oct 7-16, 2018
* Hurricane Florence Sep 10-19, 2018
* Hurricane Harvey Aug 17 - Sep 2, 2017
* Hurricane Matthew Sep 28 - Oct 10, 2016
* Monsoon in Nepal, India, Bangladesh Jul - Sep, 2017
* Moore, OK Tornado May 20, 2013
* Tuscaloosa, AL Tornado Apr 27, 2011
* Joplin, MO Tornado May 22, 2011

### [Sentinel2](https://sentinel.esa.int/web/sentinel/missions/sentinel-2)

Satellites that monitor variability in Earth's surface conditions to help study the following:
* Climate change 
* Emergency management
* Security
* Border protection
* Maritime surveillance

### [ImageNet](https://www.image-net.org)
ImageNet contains more than 14 million images across 20,000 categories and includes over 1 million images with bounding boxes.

# Models and Libraries

### [ResUNet](https://arxiv.org/abs/1711.10684)

Semantic segmentation neural network of roads within aerial images used for disaster-response research.

### [GeoPandas](https://github.com/geopandas)

Adds support for geographic data to Pandas.  Implements `GeoSeries` and `GeoDataFrame` as subclasses of `pandas.Series` and `pandas.DataFrame`.

### [EarthPy](https://earthpy.readthedocs.io/) 
Python package devoted to working with spatial-raster and remote-sensing vector data.

### [Rasterio](https://rasterio.readthedocs.io/)
Reads and writes GeoTIFF files and provides a Python API based on Numpy N-dimensional arrays.

### [Scikit-Learn](https://scikit-learn.org/stable/tutorial/machine_learning_map/index.html)
* K-Nearest Neighbor (KNN)
* PCA
* SVD
* TSNE
* Binary Classification
* Multi-class Classification
* Logistic Regression
* Decision Tree
* Random Forest

### [Numba](https://numba.pydata.org/)
"Numpy for GPU" using Nvidia CUDA library