[![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/forestdatapartnership/whisp/blob/main/notebooks/Colab_whisp_geojson_to_csv.ipynb)

# Whisp a geojson

Python Notebook pathway for [Whisp](https://openforis.org/solutions/whisp/) running in the cloud via [Google Colab](https://colab.google/).

**To open:**
click badge at top.

**To run:** click play buttons (or press shift + enter)

**Requirements:** Google Earth Engine (GEE) account and registered cloud project.



- **Aim:** support compliance with zero deforestation regulations
- **Input**: geojson file of plot boundaries or points
- **Output**: CSV table and geojson containing statistics and risk indicators

### Setup Google Earth Engine

In [60]:
import ee

# Google Earth Engine project name
gee_project_name = "ee-dnsalazar10" # change to your project name. If unsure see here: https://developers.google.com/earth-engine/cloud/assets)

# NB opens browser to allow access
ee.Authenticate()

# initialize with chosen project
ee.Initialize(project=gee_project_name)

### Install and import packages

In [61]:
# Install openforis-whisp (if not already installed)
!pip install --pre openforis-whisp



In [62]:
import openforis_whisp as whisp

### Get a geojson

- Files are stored tempoarily and can be viewed in a panel on the left (click on Folder icon to view).
- Press refresh if updates are not showing
- Alternatively you can work with files in your Google Drive: drive.mount('/content/drive')

In [63]:
#function to upload a geojson file. Download example here: https://github.com/andyarnell/whisp/tree/package-test-new-structure/tests/fixtures)
def import_geojson():
    from google.colab import files
    fn, content = next(iter(files.upload().items()))
    with open(f'/content/{fn}', 'wb') as f: f.write(content)
    return f'/content/{fn}'

In [64]:
GEOJSON_EXAMPLE_FILEPATH = import_geojson()
print(f"GEOJSON_EXAMPLE_FILEPATH: {GEOJSON_EXAMPLE_FILEPATH}")

Saving test1_poly.geojson to test1_poly.geojson
GEOJSON_EXAMPLE_FILEPATH: /content/test1_poly.geojson


### Whisp it

In [65]:
# Choose countries to process (currently three countries: 'co', 'ci', 'br')
iso2_codes_list = ['co', 'ci', 'br']  # Example ISO2 codes for including country specific data

In [78]:
import ee
import pandas as pd
import geopandas as gpd
import json

# Choose countries to process (currently three countries: 'co', 'ci', 'br')
iso2_codes_list = ['co', 'ci', 'br']  # Example ISO2 codes for including country specific data

# Read the geojson file directly into a GeoDataFrame
gdf = gpd.read_file(GEOJSON_EXAMPLE_FILEPATH)

# Convert any datetime columns to strings in the pandas DataFrame
for col in gdf.columns:
    if pd.api.types.is_datetime64_any_dtype(gdf[col]):
        gdf[col] = gdf[col].astype(str)

# Convert the GeoDataFrame to an Earth Engine FeatureCollection
ee_feature_collection = ee.FeatureCollection(gdf.__geo_interface__)

# Process the Earth Engine FeatureCollection with whisp
df_stats = whisp.whisp_formatted_stats_ee_to_df(
    ee_feature_collection,
    # external_id_column="user_id",# optional - specify which input column/property to map to the external ID.
    national_codes=iso2_codes_list,
    # unit_type='percent', # optional - to change unit type. Default is 'ha'.
    )

Whisp multiband image compiled
Creating schema for national_codes: ['co', 'ci', 'br']
external_id


### Display results

In [79]:
df_stats

Unnamed: 0,plotId,external_id,Area,Geometry_type,Country,ProducerCountry,Admin_Level_1,Centroid_lon,Centroid_lat,Unit,...,nBR_MapBiomas_col9_palmoil_2020,nBR_MapBiomas_col9_pc_2020,nBR_INPE_TCamz_cer_annual_2020,nBR_MapBiomas_col9_soy_2020,nBR_MapBiomas_col9_annual_crops_2020,nBR_INPE_TCamz_pasture_2020,nBR_INPE_TCcer_pasture_2020,nBR_MapBiomas_col9_pasture_2020,nCI_Cocoa_bnetd,geo
0,1,,154.604996,Polygon,COL,CO,Quindío,-75.783744,4.419095,ha,...,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,"{'type': 'Polygon', 'coordinates': [[[-75.7912..."
1,2,,27.325001,MultiPolygon,COL,CO,Quindío,-75.780787,4.421473,ha,...,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,"{'type': 'Polygon', 'coordinates': [[[-75.7816..."
2,3,,7.255,Polygon,COL,CO,Quindío,-75.779172,4.419302,ha,...,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,"{'type': 'Polygon', 'coordinates': [[[-75.7809..."
3,4,,4.249,Polygon,COL,CO,Quindío,-75.784319,4.41691,ha,...,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,"{'type': 'Polygon', 'coordinates': [[[-75.7853..."
4,5,,6.043,Polygon,COL,CO,Quindío,-75.787801,4.420453,ha,...,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,"{'type': 'Polygon', 'coordinates': [[[-75.7892..."
5,6,,0.431,Polygon,COL,CO,Quindío,-75.782231,4.418663,ha,...,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,"{'type': 'Polygon', 'coordinates': [[[-75.7826..."
6,7,,0.289,Polygon,COL,CO,Quindío,-75.78736,4.41889,ha,...,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,"{'type': 'Polygon', 'coordinates': [[[-75.7877..."
7,8,,10.654,Polygon,COL,CO,Quindío,-75.788942,4.421013,ha,...,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,"{'type': 'Polygon', 'coordinates': [[[-75.7904..."
8,9,,0.43,Polygon,COL,CO,Quindío,-75.78375,4.422371,ha,...,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,"{'type': 'Polygon', 'coordinates': [[[-75.7841..."
9,10,,19.841,Polygon,COL,CO,Quindío,-75.786466,4.418606,ha,...,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,"{'type': 'Polygon', 'coordinates': [[[-75.7893..."


### Add risk category columns

In [80]:
# adds risk columns to end of dataframe
df_w_risk = whisp.whisp_risk(df=df_stats,national_codes=iso2_codes_list)

Using unit type: ha


### Display updated table
- Scroll to far right to see additions

In [81]:
df_w_risk

Unnamed: 0,plotId,external_id,Area,Geometry_type,Country,ProducerCountry,Admin_Level_1,Centroid_lon,Centroid_lat,Unit,...,Ind_05_primary_2020,Ind_06_nat_reg_forest_2020,Ind_07_planted_plantations_2020,Ind_08_planted_plantations_after_2020,Ind_09_treecover_after_2020,Ind_10_agri_after_2020,Ind_11_logging_concession_before_2020,risk_pcrop,risk_acrop,risk_timber
0,1,,154.604996,Polygon,COL,CO,Quindío,-75.783744,4.419095,ha,...,no,yes,no,no,yes,yes,no,low,low,low
1,2,,27.325001,MultiPolygon,COL,CO,Quindío,-75.780787,4.421473,ha,...,no,yes,no,no,yes,yes,no,low,low,low
2,3,,7.255,Polygon,COL,CO,Quindío,-75.779172,4.419302,ha,...,no,yes,no,no,yes,yes,no,low,low,low
3,4,,4.249,Polygon,COL,CO,Quindío,-75.784319,4.41691,ha,...,no,yes,no,no,yes,yes,no,low,low,low
4,5,,6.043,Polygon,COL,CO,Quindío,-75.787801,4.420453,ha,...,no,yes,no,no,yes,yes,no,low,low,low
5,6,,0.431,Polygon,COL,CO,Quindío,-75.782231,4.418663,ha,...,no,yes,no,no,yes,yes,no,more_info_needed,more_info_needed,high
6,7,,0.289,Polygon,COL,CO,Quindío,-75.78736,4.41889,ha,...,no,yes,no,no,no,yes,no,low,low,low
7,8,,10.654,Polygon,COL,CO,Quindío,-75.788942,4.421013,ha,...,no,yes,no,no,yes,yes,no,low,low,low
8,9,,0.43,Polygon,COL,CO,Quindío,-75.78375,4.422371,ha,...,no,yes,no,no,yes,yes,no,more_info_needed,more_info_needed,high
9,10,,19.841,Polygon,COL,CO,Quindío,-75.786466,4.418606,ha,...,no,yes,no,no,yes,yes,no,low,low,low


### Export table with risk columns to CSV (temporary storage)

In [82]:
df_w_risk.to_csv("whisp_output_table_w_risk.csv",index=False)

### Export table with risk columns to geojson (temporary storage)

In [83]:
whisp.convert_df_to_geojson(df_w_risk,"whisp_output_table_w_risk.geojson") # builds a geojson file containing Whisp columns. Uses the geometry column "geo" to create the spatial features.

GeoJSON saved to whisp_output_table_w_risk.geojson


### Download outputs to local storage
- Saves files in "Downloads" folder on your machine
- If you see a "Downloads blocked" button at top of browser click to allow file downloads.
- Alternatively right click on file in the folder (in the panel on your left) and choose 'Download'.

In [84]:
from google.colab import files
files.download('whisp_output_table_w_risk.csv')

<IPython.core.display.Javascript object>

<IPython.core.display.Javascript object>

In [None]:
files.download('whisp_output_table_w_risk.geojson') # spatial output