# Industry Accelerators - Insurance Loss Estimation Using Remote Sensing Data

## Introduction

With the increase in the number of satellite launches and the dramatic improvements in sensing technology (e.g., better cameras with hyper-spectral imaging, synthetic aperture radar, and lidar), remote sensing data now has much better quality in resolution and spatio-temporal coverage. There is demand for using this data to solve business issues across a wide range of industries, including insurance, urban planning, agriculture, climate change, and flood prevention. This creates a need for ML/AI technology solutions for remote sensing data. <br>

Insurance Loss Estimation Using Remote Sensing Data Industry Accelerator shows how to derive insights from remote sensing data, by utilizing an example of studying flooding events for assisting insurance claims. The idea in this accelerator is to study satellite images before and after a certain event, and with the help of spatiotemporal analysis and ML/AI techniques, we can get insights into which regions are affected and to what extent. Such insights provide valuable information to insurance companies on property damage, allowing claims to be processed more efficiently. <br>
For this specific usecase, we picked a category 4 storm event -- Hurricane Laura -- which caused damage to many areas of Louisiana, United States in August, 2020. Insurance companies can use this accelerator to understand remote sensing capabilities to study the flood event, get insights on impacted regions and then predict estimated claims for individal properties.

This accelerator includes a structured glossary of more than 70 business terms and a set of sample data science assets. The glossary provides the information architecture that you need to understand how to estimate property inurance loss using remote sensing. Your data scientists can use the sample notebooks, remote sensing analysis, predictive model, and dashboard to accelerate data preparation, machine learning modeling, and data reporting. All the functionality in this accelerator can be generalized and used towards other use cases for remote sensing data.

![image-2.png](attachment:image-2.png)
![image.png](attachment:image.png)

## Inventory of Artifacts provided
### Knowledge Catalog
Described in the Knowledge Center https://www.ibm.com/support/producthub/icpdata/docs/content/SSQNUZ_latest/svc-nav/head/industry-accel-svc.html

### Sample Datasets
We provide csv files with raw data for the accelerator: <br>
**Remote Sensing Input.csv:** Property Id and property cordinates in Cameron town, Louisiana. <br>
**Insurance Loss Claims.csv:** Customer's property information including property value, construction details, size, property damage amount etc. and insurance information including sum insured limit, previous claims, estimated insurance claim etc. <br>
**b03_before.tif**:  Green band satellite image of Cameron town captured before Hurricane Laura. <br>
**b03_after.tif**: Green band satellite image of Cameron town captured after Hurricane Laura. <br>
**b08_before.tif**: Near infrared band satellite image of Cameron town captured before Hurricane Laura. <br>
**b08_after.tif**: Near infrared band satellite image of Cameron town captured after Hurricane Laura.<br>
 
Additionally, there are other 2 datasets created via the analytics project : <br>
**Remote Sensing Output.csv:** Flood impact details on the properties provided in Remote Sensing Input.csv. This dataset is generated by the notebook **1-Impact_Region_Analysis_with_Remote_Sensing**.<br>
**model_output_summary.csv** : Consolidated prepped data after combining remote sensing and insurance loss claims datasets for exploratory data analysis and data visualization in the R shiny dashboard. <br>
These files can be found in the **data_asset** folder.

### Notebooks
Follow the sequence shown below

**1-Impact_Region_Analysis_with_Remote_Sensing**: Study satellite images before and after a certain event (e.g. hurricane laura) with the help of spatiotemporal analysis and ML/AI techniques; Get insights to understand which regions are affected and to what extent; Read in property cordinates to calculate impact scores on each property and save out the result.

**2-model_training_and_deployment** : Load Remote Sensing Output and Insurance Loss Claims; Prepare and clean data for model training; Analyze correlations; Build ML models, Analyze and visualize the data; Select best performing ML model; Create the final pipeline and save to Cloud Pak for Data; Create a Watson Machine Learning based deployment space; Store the pipeline in the space and deploy the model; Store and deploy R Shiny app; Generate URL to view the app.

Optional: **0-Map Business Terms to data headers** : This notebook publishes the data input files into a specified catalog. It reads mappings from `insurance-loss-estimation-using-remote-sensing-data-map-terms.csv` and creates mappings from the data file headers to the business terms, for use in governing the data using Watson Knowledge Catalog.

### R Shiny Dashboard
The Shiny dashboard displays flood impact insights on Cameron town, property details and scores new data. The dashboard has the following tabs:

**Remote Sensing** : Displays the impact zones and properties of Cameron town. By clicking on the impact zones in the map the dashboard displays properties and insurance details specific to the impact zone. The tab also shows the satellite images before and after the storm event.

**Property View** : Targets individual property and insurance information, displays an impact score calculated by remote sensing exercise. It provides the option to run the model webservice which predicts estimated claim amount for the property.

**Simulation Tool** : This tab contains a form with all model inputs. The user can change any of these inputs and see the impact that the change has on predicting the estimated claim amount.

The dashboard app is deployed in `2-Model_training_and_Deployment` and includes instructions on how to access the dashboard app.

Alternatively, the user can run the dashboard from R-Studio console using below steps

- Download the **Insurance-Loss-Estimation-Using-Remote-Sensing-Dashboard** zip file from project data_assets .
- From the project homepage Launch the IDE for R-Studio.
- Upload the **Insurance-Loss-Estimation-Using-Remote-Sensing-Dashboard** zip file in to rstudio.
- Navigate to app.R file and click on Run App option to launch the dashboard.


### Sequence of steps to run this accelerator
- Open **1-Impact_Region_Analysis_with_Remote_Sensing** notebook & execute step-by-step.
- Open **2-Model_training_and_Deployment** notebook notebook & execute step-by-step.
- Open the dashboard from the url generated in **2-Model_training_and_Deployment** notebook.<br>
Alternatively, open the dashboard using R-Studio IDE.

**This project contains Sample Materials, provided under license.** <br>
**Licensed Materials - Property of IBM.**<br>
**© Copyright IBM Corp. 2020, 2021. All Rights Reserved.**<br>
**US Government Users Restricted Rights - Use, duplication or disclosure restricted by GSA ADP Schedule Contract with IBM Corp.**

**The satellite image data in this accelerator: 
	Contains modified Copernicus Sentinel data 2020**