A Deep Gravity model for mobility flows generation

Citing

If you use the code in this repository, please cite our paper:

F. Simini, G. Barlacchi, M. Luca, L. Pappalardo, A Deep Gravity model for mobility flows generation, Nature Communications 12, 6576 (2021). https://doi.org/10.1038/s41467-021-26752-4

@article{Simini2021,
author = {Simini, Filippo and Barlacchi, Gianni and Luca, Massimilano and Pappalardo, Luca},
doi = {10.1038/s41467-021-26752-4},
issn = {2041-1723},
journal = {Nature Communications},
number = {1},
pages = {6576},
title = {{A Deep Gravity model for mobility flows generation}},
url = {https://doi.org/10.1038/s41467-021-26752-4},
volume = {12},
year = {2021}}

and the official code repository:

Abstract

The movements of individuals within and among cities influence critical aspects of our society, such as well-being, the spreading of epidemics, and the quality of the environment. When information about mobility flows is not available for a particular region of interest, we must rely on mathematical models to generate them. We propose Deep Gravity, an effective model to generate flow probabilities that exploits many features (e.g., land use, road network, transport, food, health facilities) extracted from voluntary geographic data, and uses deep neural networks to discover non-linear relationships between those features and mobility flows. Our experiments, conducted on mobility flows in England, Italy, and New York State, show that Deep Gravity achieves a significant increase in performance, especially in densely populated regions of interest, with respect to the classic gravity model and models that do not use deep neural networks or geographic data. Deep Gravity has good generalization capability, generating realistic flows also for geographic areas for which there is no data availability for training. Finally, we show how flows generated by Deep Gravity may be explained in terms of the geographic features and highlight crucial differences among the three considered countries interpreting the model’s prediction with explainable AI techniques.

Figure 1. Performances in terms of Common Part of Commuters (CPC) of Deep Gravity (DG) vs the gravity model (G) in an highly populated area in England

Architecture of Deep Gravity

To generate the flows from a given origin location (e.g., $l_i$ ), Deep Gravity uses a number of input features to compute the probability $p_{i,j}$ that any of the $n$ locations in the region of interest (e.g., $l_j$ ) is the destination of a trip from $l_i$ . Specifically, the model output is a n-dimensional vector of probabilities $p_{i,j}$ for $j = 1, ..., n$ . These probabilities are computed in three steps (see figure below).

Figure 2. Architecture of Deep Gravity

The input vectors $x(l_i, l_j) = concat[x_i, x_j, r_{i,j}]$ for $j =1, \dots, n$ are obtained performing a concatenation of the following input features: $x_i$ , the feature vector of the origin location $l_i$ ; $x_j$ the feature vector of the destination location $l_j$ ; and the distance between origin and destination $r_{i, j}$ . For each origin location (e.g. $l_i$ ), $n$ input vectors $x(l_i, l_j)$ with $j = 1, \dots, n$ are created, one for each location in the region of interest that could be a potential destination.
The input vectors $x(l_i, l_j)$ are fed in parallel to the same feed-forward neural network. The network has 15 hidden layers of dimensions 256 (the bottom six layers) and 128 (the other layers) with LeakyReLu activation function, $a$ . Specifically, the output of hidden layer $h$ is given by the vector $z^{(0)}(l_i, l_j) = a(W^{(0)} \cdot x(l_i, l_j))$ for the first layer ( $h=0$ ) and $z^{(h)}(l_i, l_j) = a(W^{(h)} \cdot z^{(h - 1)}(l_i, l_j))$ for $h>0$ , where $W$ are matrices whose entries are parameters learned during training.
The output of the last layer is a scalar $s(l_i, l_j) \in[-\infty, \infty]$ called score: the higher the score for a pair of locations $(l_i, l_j)$ , the higher the probability to observe a trip from $l_i$ to $l_j$ according to the model. Finally, the scores are transformed into probabilities using a softmax function, $p_{i,j} = e^{s(l_i, l_j)} / \sum_{k} e^{s(l_i, l_k)}$ , which transforms all scores into positive numbers that sum up to one. The generated flow between two locations is then obtained by multiplying the probability (i.e., the model's output) and the origin's total outflow.

The location feature vector $x_i$ provides a spatial representation of an area, and it contains features describing some properties of location $l_i$ , e.g., the total length of residential roads or the number of restaurants therein. Its dimension, $d$ , is equal to the total number of features considered. The location features we use include the population size of each location and geographical features extracted from OpenStreetMap belonging to the following categories:

Land use areas (5 features): total area (in squared km) for each possible land use class, i.e., residential, commercial, industrial, retail and natural;
Road network (3 features): total length (in km) for each different types of roads, i.e., residential, main and other;
Transport facilities (2 features): total count of Points Of Interest (POIs) and buildings related to each possible transport facility, e.g., bus/train station, bus stop, car parking;
Food facilities (2 features): total count of POIs and buildings related to food facilities, e.g., bar, cafe, restaurant;
Health facilities (2 features): total count of POIs and buildings related to health facilities, e.g., clinic, hospital, pharmacy;
Education facilities (2 features): total count of POIs and buildings related to education facilities, e.g., school, college, kindergarten;
Retail facilities (2 features): total count of POIs and buildings related to retail facilities, e.g., supermarket, department store, mall.

In addition, Deep Gravity includes as feature the geographic distance, $r_{i, j}$ , between two locations $l_i$ and $l_j$ , which is defined as the distance measured along the surface of the earth between the centroids of the two polygons representing the locations. All values of features for a given location (excluding distance) are normalized dividing them by the location's area.

Each flow in Deep Gravity is hence described by 39 features (18 geographic features of the origin and 18 of the destination, distance between origin and destination, and their populations).

The loss function of Deep Gravity is the cross-entropy:

$H = - \sum_{i} \sum_j \frac{y(l_i, l_j)}{O_i} \ln p_{i,j}$

where $y(l_i, l_j) / O_i$ is the fraction of observed flows from $l_i$ that go to $l_j$ and $p_{i, j}$ is the model's probability of a unit flow from $l_i$ to $l_j$ . Note that the sum over $i$ of the cross-entropies of different origin locations follows from the assumption that flows from different locations are independent events, which allows us to apply the additive property of the cross-entropy for independent random variables.

The network is trained for 20 epochs with the RMSprop optimizer with momentum 0.9 and learning rate $5 \cdot 10^{-6}$ using batches of size 64 origin locations. To reduce the training time, we use negative sampling and consider up to 512 randomly selected destinations for each origin location.

Running Deep Gravity

Setup

Make sure you have the following dependencies installed:

pytorch 1.7.1
numpy 1.19.2
pandas 1.2.4
geopandas 0.9.0
scikit-mobility 1.1.0
area

Experiments

Once you installed all the packages correctly, you can run the experiments.

We expect to find some datasets in a path named data/<country_name> where country name is a parameter that can be passed to the model. In particular, we expect to find:

tessellation.geojson or tessellation.shp. The tessellation can also be generated by using the parameters tessellation-area and tessellation-size when the model is called.
output_areas.geojson or output_areas.shp. A file containing the location code and the geometry of the output areas. the column containing the location code can be specified using the parameter oa-id-column when calling the model.
flows.csv containing three columns indicating the origin, destination and the actual flow of people. The columns with the information can be called specifying the parameters flow-origin-column, flow-destination-column and flow-flows-column. Due to GitHub policy, the file containing the flows for the running example of New York have to be downloaded from here. Data are derived starting from the GeoDS COVID-19 project
features.csv containing at least a column named like oa-id-column and a set of other columns representing the features of the model

An example of dataset collected in New York is already loaded in the repository and the following examples are based on that. Note that when main.py is launched for the first time, a set of additional files are generated in a folder called processed. These files should not be removed.

The model can be run with the following command:

python main.py --dataset new_york --oa-id-column GEOID --flow-origin-column geoid_o --flow-destination-column geoid_d --flow-flows-column pop_flows --epochs 1 --device cpu --mode train

you can also include some parameters related to the model:

batch-size to specify the input batch size for training. Deafult is 1
test-batch-size to specify the batch size at test time. Default is 1
epochs default is 10
lr that is the learning rate. Default is 5e-6
momentum default is 0.9
seed
device can be cpu or gpu
mode that can be train or test

There are also some parameters related to the

Once your model is trained, you will find the results of the test phase in a file in the results directory. The file will be named tile2cpc_<model-type>_<country>_<no-round>.csv. In the same folder, you will also find the trained model named model_<model-type>_<country>_<no-round>.pt

Plot of the results

Once you have the results for all the four models in at least a country and at least for one no-round, you can reproduce Figure 3 and Table 1 of the paper using the notebook plot_results.ipynb

Additional Data

The datasets used in the experiments can be found at:

Data related to POIs should be retrieved from appropriate services. Examples are Overpass API, HOTosm or - suggested - by downloading a local copy of the OSM database in a PostgreSQL instance and by running appropriate queries. The query we used to retrieved POIs information is available in osm_query.yaml

Name		Name	Last commit message	Last commit date
Latest commit History 27 Commits
deepgravity		deepgravity
imgs		imgs
CITATION.cff		CITATION.cff
README.md		README.md
osm_query.yaml		osm_query.yaml
plots.ipynb		plots.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

deepgravity

deepgravity

imgs

imgs

CITATION.cff

CITATION.cff

README.md

README.md

osm_query.yaml

osm_query.yaml

plots.ipynb

plots.ipynb

Repository files navigation

A Deep Gravity model for mobility flows generation

Table of contents

Citing

Abstract

Architecture of Deep Gravity

Running Deep Gravity

Setup

Experiments

Plot of the results

Additional Data

About

Releases 1

Packages

Contributors 3

Languages

scikit-mobility/DeepGravity

Folders and files

Latest commit

History

Repository files navigation

A Deep Gravity model for mobility flows generation

Table of contents

Citing

Abstract

Architecture of Deep Gravity

Running Deep Gravity

Setup

Experiments

Plot of the results

Additional Data

About

Topics

Resources

Stars

Watchers

Forks

Languages