Detection of free and occupied surfaces on rooftops

The set of provided scripts aim to evaluate the surface available on rooftops by detecting objects.

Table of content

Requirements
Classification of occupancy
LiDAR segmentation
Image segmentation
Combination of segmentation results

Requirements

Hardware

Image processing was run on a 32 GiB RAM machine with 16 GiB GPU (NVIDIA Tesla T4) compatible with CUDA.
There is no hardware requirement to process the LiDAR point cloud.

Installation

The scripts were developed with Python 3.8 on Ubuntu 20.04 OS.

All the dependencies required for the project are listed in requirements.in and requirements.txt. To install them:

Create a Python virtual environment

  python3 -m venv <dir_path>/<name of the virtual environment>
  source <dir_path>/<name of the virtual environment>/bin/activate

Install dependencies with pip >= 20.3:
```
  pip install -r requirements.txt
```
requirements.txt has been obtained by compiling requirements.in. Recompiling the file might lead to libraries version changes:
```
  pip install -r requirements.txt
```
- Requirements for the LiDAR workflows only:
```
  pip install -r requirements_lidar.txt
```
Specific libraries were used for image processing:
- PyTorch version 1.10
- CUDA version 11.3
- segment-geospatial version 0.10.2.
  The library was forked and adapted to our needs and can be found on the STDL's git repository: https://github.com/swiss-territorial-data-lab/segment-geospatial/releases/tag/v1.0.0.

Disclaimer: We do not guarantee that the scripts in the sandbox folder and the scripts not included in the workflows can be executed with the provided requirements.

Data

The datasets used by all workflows are described here after:

Roof delimitation: vector shapefile CAD_BATIMENT_HORSOL_TOIT providing the roof planes by EGID;
Ground truth of the roof objects: vector shapefile of the labels produced for this project and used for the assessments of the segmentation workflows:
- version 2023-11-10 LiDAR: ground truth used for the optimization and the assessment of the LiDAR segmentation. Most objects in the low height classes, such as lawn and terraces, have been removed from the dataset;
- version 2023-11-13: ground truth used for the optimization and the assessment of the image segmentation. It corresponds to the complete ground truth;
EGID lists: selected buildings for the ground truth identified by their federal number (EGID). The complete list of building is divided between the training and test datasets:
- EGID_GT_test.csv: 17 EGIDs selected to control the performance of the algorithm on a test dataset. It is provided here as an example to run the code with.

In addition, the workflows working with LiDAR need:

LiDAR point clouds: the tiles of the 2019 flight over the Geneva canton were used. They are automatically downloaded through a script;
Emprises tiles LiDAR 2019.shp: shapes corresponding to the LiDAR tiles.

The workflow for the roof classification, needs:

Ground truth of the roof occupancy: vector shapefile of the roof delimitation with an attribute class giving the real roof occupancy.
- Disclaimer: a subset of the ground truth used to train a random forest is given here. It does not correspond to the data used to train the random forest during the project.

The workflow for the image segmentation needs:

True orthophotos of the canton of Geneva: processed from aerial image acquired by the State of Geneva in 2019. RGB tiff images with a spatial resolution of about 7 cm/px. Images are available on request from SITG.
Image tile shapes: vector shapefile of the true orthophoto tiles available on request from SITG.

In this repository, only test data is supplied, along with a subset of the roof delimitations, to enable the user to run an example. The full datasets can be requested by contacting the STDL.

Classification of occupancy

This set of scripts classifies the roof planes as "occupied" or "potentially free" based on their roughness and intensity.

Script description

The script get_lidar_infos.py allows to get some characteristics of the point clouds.

The following scripts are used to classify roof planes by occupancy:

retrieve_point_clouds.py: downloads the tiles intersecting a vector layer. This script is borrowed from the LiDAR segmentation workflow.
rasterize_intensity.py: creates an intensity raster for each LiDAR point cloud in the input directory.
- The function used to produce the intensity rasters is LidarIdwInterpolation - WhiteboxTools.
rasterize_roughness.py: creates a DEM and saves it as a raster, then estimates the multi-scale roughness from the DEM.
- The function used to produce the DEM is LidarDigitalSurfaceModel - WhiteboxTools.
- The function used to calculate the multi-scale roughness is MultiscaleRoughness - WhiteboxTools.
get_zonal_stats.py: gets zonal statistics of intensity and roughness for each roof plane.
- Roof planes smaller than 2 m² are classified as "occupied" and no zonal statistics are calculated. They are too small for a solar or vegetated installation.
- If the LiDAR point cloud overlapping the roof plane is not classified as building, hte plane is classified as "undefined". The presence of a roof at this location should be controlled.
Two methods were developed for classification:
1. Use of manual thresholds without ground truth:
  - manual_thresholds.py: classifies the roofs using threshold defined in the config file.
  - assess_classif_surfaces.py: if ground truth is provided later on or if an expert assesses the result, calculates the precision of the classification, also called "satisfaction rate".
2. Use of a random forest with ground truth
  - random_forest.py:
    - train mode: if the train parameter is set to True, trains a model per office and saves them as pickle files, assesses the quality of the classifications.
    - inference mode: if the train parameter is set to False, uses the trained models to make inferences about roof planes.

Workflow

Some tiles corresponding to the provided test data can be downloaded with the command below and unzipped by the user.

python scripts/pcd_segmentation/retrieve_point_clouds.py config/config_pcdseg_all_roofs.yaml

The command lines for the workflow are provided below.

python scripts/occupation_classification/rasterize_intensity.py config/config_occupation_classification.yaml
python scripts/occupation_classification/rasterize_roughness.py config/config_occupation_classification.yaml
python scripts/occupation_classification/get_zonal_stats.py config/config_occupation_classification.yaml

When no ground truth is available, the classification can be performed with the script manual_thresholds.py using thresholds calibrated manually by an operator. The results can then eventually be assessed by experts, and used as ground truth.

python scripts/occupation_classification/manual_thresholds.py config/config_occupation_classification.yaml
python scripts/assessment/assess_classif_surfaces.py config/config_occupation_classification.yaml

When a ground truth is available, the classification can be performed and assessed with the script random_forest.py.

python scripts/occupation_classification/random_forest.py config/config_occupation_classification.yaml

Other scripts ban be found in the folder scripts/occupation_classification. Their goal is to detect objects based on intensity. The results were not satisfactory and they were therefore not implemented in the final workflow.

LiDAR segmentation

The set of scripts is dedicated to the segmentation of rooftop objects in the LiDAR point cloud. This workflow is based on Open3D. It supposes that roofs composed of flat planes and that obstacles protrude.

Script description

The following scripts are used to segment the LiDAR point cloud:

retrieve_point_clouds.py: downloads the point clouds,
prepare_data.py: reads and filters the roofs of the selected EGIDs in the point clouds,
pcd_segmentation.py: segments the point clouds in planes and clusters,
vectorization.py: creates 2D polygons from the segmented point clouds,
post_processing.py: merges the results for the pitched and other roofs together and simplifies the geometry of the detections,
assess_results.py: evaluates results by comparing them with the ground truth, calculates metrics and tags detections,
assess_area.py: calculates the free and occupied surface of each EGID and compares it with the ground truth.

An additional script is available:

optimize_hyperparam_LiDAR.py: optimizes the hyperparameters to maximize the f1 score and median IoU. Based on the Oputna framework.

Workflow

The command lines for the workflow are provided below.

First, the LiDAR point cloud tiles have to be downloaded with the command below and unzipped by the user.

python scripts/pcd_segmentation/retrieve_point_clouds.py config/config_pcdseg_all_roofs.yaml

After that, the point cloud segmentation is performed. Specific parameters are used for pitched roofs.

python scripts/pcd_segmentation/prepare_data.py config/config_pcdseg_all_roofs.yaml
python scripts/pcd_segmentation/pcd_segmentation.py config/config_pcdseg_all_roofs.yaml
python scripts/pcd_segmentation/vectorization.py config/config_pcdseg_all_roofs.yaml
python scripts/pcd_segmentation/prepare_data.py config/config_pcdseg_pitched_roofs.yaml
python scripts/pcd_segmentation/pcd_segmentation.py config/config_pcdseg_pitched_roofs.yaml
python scripts/pcd_segmentation/vectorization.py config/config_pcdseg_pitched_roofs.yaml

Then, the results for the pitched and other roofs are merged. Their geometry is simplified with buffering and copping operations, as well as the Visvalingam-Wyatt algorithm. The different obstacles are merged together to form the occupied surfaces.

python scripts/pcd_segmentation/post_processing.py config/config_pcdseg_all_roofs.yaml

Finally, the results are assessed

python scripts/assessment/assess_results.py config/config_pcdseg_all_roofs.yaml
python scripts/assessment/assess_area.py config/config_pcdseg_all_roofs.yaml

The workflow described here is working with the training subset of the ground truth used for the optimization of the hyperparameters. The configuration file config_pcdseg_test.yaml works with the test subset of the ground truth, allowing to test on buildings not considered in the optimization.

The optimization of hyperparameters can be performed as follow:

python scripts/pcd_segmentation/optimize_hyperparam_LiDAR.py config/config_pcdseg_all_roofs.yaml

Image segmentation

The set of scripts is dedicated to the segmentation of objects in images. The segmentation is based on a deep learning method using SAM (Segment-Anything Model). The final product is a vector layer of detected objects on the selected roofs.

Script description

generate_tiles.py: generates custom tiles of the roof extent;
segment_images.py: creates detection masks and vectorizes them. Images are segmented using segment-geospatial which provides a practical framework for using SAM (Segment-Anything Model) with georeferenced data.
produce_vector_layer.py: filters the vector layer for each building and aggregates all layers into a single one (detected objects);
assess_results.py: evaluates results by comparing them with the ground truth, calculates metrics and tags detections;
assess_area.py: calculates the free and occupied surface of each EGID and compare it with the ground truth;
optimize_hyperparameters.py: optimizes SAM hyperparameters to maximize the desired metrics (f1 score, median IoU, precision, recall,...). Based on the Oputna framework.

Workflow

The workflow can be run by issuing the following list of commands:

python scripts/image_segmentation/generate_tiles.py config/config_imgseg.yaml
python scripts/image_segmentation/segment_images.py config/config_imgseg.yaml
python scripts/image_segmentation/filter_merge_detections.py config/config_imgseg.yaml
python scripts/assessment/assess_results.py config/config_imgseg.yaml
python scripts/assessment/assess_area.py config/config_imgseg.yaml

The optimization of hyperparameters can be performed as follow:

python scripts/image_segmentation/optimize_hyperparameters.py config/config_imgseg.yaml

Combination of segmentation results

LiDAR and image segmentation results can be combined. Two methods are used:

Polygon concatenation: the detection polygons obtained from LiDAR segmentation and image segmentation are concatenated.
Polygon filtering with spatial join: the detection polygons obtained from image segmentation are filtered, retaining only polygons that overlap those obtained from LiDAR segmentation.

Script description

combine_results_seg.py: combines results of LiDAR segmentation and image segmentation using concatenation (concatenation) of polygons and spatial join (sjoin) of polygons;
assess_results.py: evaluates results by comparing them with the ground truth, calculates metrics and tags detections. Specify which combination method to assess in the configuration file;
assess_area.py: calculates the free and occupied surface of each EGID and compares it with the ground truth. Specify which combination method to assess in the configuration file.

Workflow

The workflow can be run by issuing the following list of commands:

python scripts/assessment/combine_results_seg.py config/config_combine_seg.yaml
python scripts/assessment/assess_results.py config/config_combine_seg.yaml
python scripts/assessment/assess_area.py config/config_combine_seg.yaml

Additional developments

The scripts written for additional developments and not retained in the final workflow can be found in the sandbox folder. We do not provide the environment and the necessary files to test those scripts.

Filtering based on roof parameters

The suitability of a roof to host a solar or vegetated installation can be estimated based on the roof slope and area. The selection of roofs based on these approximated parameters was tested. It was not integrated to this workflow as other teams are already working on a more sophisticated filtering procedure.

Data: This workflow is based on the following layers, available in the SITG catalog.

CAD_BATIMENT_HORSOL_TOIT.shp: roof planes of above-ground buildings.
OCEN_SOLAIRE_ID_SURFACE_BASE.shp: roofs, sheds and parkings.
FTI_PERIMETRE.shp: perimeters of the industrial zones managed by the Foundation for Industrial Lands of Geneva.
DPS_ENSEMBLE.shp & DPS_CLASSEMENT.shp: architectural and landscape surveys of the canton, archaeological and archival research sites, and scientific inventories. Listed buildings in accordance with the cantonal law on the protection of monuments and sites.

Requirements

There are no hardware or software requirements.
All the scripts were developed with Python 3.11.

Workflow

Diagram of the criteria applied to determine the roof suitability for vegetation and solar panels.

A single script applies all the filters.

python scripts/sandbox/filter_by_attributes.py config/config_sandbox.yaml

Automatic classification of the LiDAR point cloud

We tested the deep learning algorithm developed by flai to classify LiDAR point clouds. flai applied its algorithm to the LiDAR data we provided them. They vectorized the clusters of the class Roof objects. A script was written to assess the results by comparison with the ground truth.

python scripts/sandbox/assess_flai.py config/config_sandbox.yaml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Detection of free and occupied surfaces on rooftops

Requirements

Hardware

Installation

Data

Classification of occupancy

Script description

Workflow

LiDAR segmentation

Script description

Workflow

Image segmentation

Script description

Workflow

Combination of segmentation results

Script description

Workflow

Additional developments

Filtering based on roof parameters

Automatic classification of the LiDAR point cloud

About

Releases 1

Packages

Contributors 3

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 659 Commits
config		config
data		data
img		img
scripts		scripts
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.in		requirements.in
requirements.txt		requirements.txt
requirements_lidar.in		requirements_lidar.in
requirements_lidar.txt		requirements_lidar.txt

License

swiss-territorial-data-lab/proj-rooftops

Folders and files

Latest commit

History

Repository files navigation

Detection of free and occupied surfaces on rooftops

Requirements

Hardware

Installation

Data

Classification of occupancy

Script description

Workflow

LiDAR segmentation

Script description

Workflow

Image segmentation

Script description

Workflow

Combination of segmentation results

Script description

Workflow

Additional developments

Filtering based on roof parameters

Automatic classification of the LiDAR point cloud

About

Topics

Resources

License

Stars

Watchers

Forks

Releases 1

Packages 0

Contributors 3

Languages

Packages