This repository contains supply and demand data in zones of the Baltic Sea as a result of historical vessel movements from 2020 based on automatic identification system (AIS) data provided by the Danish Maritime Authority (DMA).
The processed data is located within the data folder of this repository. The folder contains four different files.
- The
supply_in_zone.csv
file contains supply in tons on a weekly basis per zone. The zone column corresponds to the respective zone, the timestamp column to the calendar week and the supply_tons column to amount of supply in metric tons (generalized over all different types of cargo). - The
demand_in_zone.csv
file contains demand in tons on a weekly basis per zone. The zone column corresponds to the respective zone, the timestamp column to the calendar week and the demand_tons column to amount of demand in metric tons (generalized over all different types of cargo). - The
zones.csv
file contains a list of all zones. - The
polygones.pkl
file is a pickle file of a dataframe containing definitions of polygons for the zones found in filezones.csv
.
The supply and demand data is derived from AIS signals provided by the Danish Maritime Authority. Since the AIS signals, besides some specifications about the vessel itself, only contain information about the current geo-position of same, further preprocessing steps are necessary to calculate the supply and demand in the different zones of the Baltic Sea. The following parameters, given in AIS signals, are used for the calculations: timestamp
, latitude
, longitude
, imo
, mmsi
, draught
, destination
. Below screenshot shows the zones, in which we geographically have split up the Baltic Sea. To get the polygon definition, please load the polygones.pkl
in Python using pickle.
To calculate supply and demand in zones based on AIS, following steps have been performed:
- Get all AIS signals related to one vessel. The
imo
ormmsi
is a unique identifier for a vessel and can be used to consider every vessel one by one. - The time wise ordered AIS data (use
timestamp
), more precisely, the geo-position based onlatitude
andlongitude
illustrate the route a vessel is taking. - To derive supply and demand in zones, the zone in which the vessel is loading (supply zone) and discharging (demand zone) have to be determined. To do so, the vessels
draft
anddestination
give an indication. The point in time, i.e., the AIS signal, in which thedestination
changed compared to the previous signal, denotes the start of a new voyage. Further, if the vesselsdraft
has increased, the vessel was loading cargo, thus, the vessel is in a supply zone and travels towards a demand zone. - Using a vessel database, e.g. Vesselfinder (see below), we determine the vessels maximum capacity in metric tons, its so called deadweight (dwt).
- Since the dwt denotes the capacity of a vessel in metric tons, we take the dwt of the vessel as the amount of cargo, the vessel was loading. Note, it may not reflect one hundred percent of reality.
- Performing such steps for each vessel, yields voyages between supply and demand zones including the amount of cargo transported.
- Aggregating the data on a weekly and zone basis, gives data as provided in the
supply_in_zones.csv
anddemand_in_zones.csv
file.
- Raw AIS Data: https://www.dma.dk/SikkerhedTilSoes/Sejladsinformation/AIS/
- Vessel Database: https://www.vesselfinder.com/de