# Data Report: Correlation of share of electric vehicles among all registrations and GDP

This report uses open data from Mobilithek (https://mobilithek.info/offers/573357280039153664) containing information about vehicle registrations (VR) in Germany (provided by the "Kraftfahrtbundesamt") and combines with data from the "Statistikportal" (https://www.statistikportal.de/de/veroeffentlichungen/bruttoinlandsprodukt-bruttowertschoepfung) providing information about the GDP of all german federal states.  

The question that interests us is: Does a correlation between the share of EVs among all registrations and the GDP exist?

## Install dependencies
Initially, install all required dependencies, which are listed in the file `requirements.txt`.

In [37]:
%%capture
%pip install -r requirements.txt

## Load data
Create a pandas dataframe using the local sqlite file.

In [35]:
import pandas as pd

# the full dataframe contains uncompressed information for every month and every state
full_df = pd.read_sql_table('vr_2023', 'sqlite:///data/clean/evs_per_capita.sqlite')
# the aggregated dataframe combines all monthly data
aggr_df = pd.read_sql_table('evs_per_capita', 'sqlite:///data/clean/evs_per_capita.sqlite')

## Is there a correlation between share of EVs and GDP?
To answer our initial question, we use plotly to draw a scatterplot of all federal states, the x-axis representing the GDP and the y-axis showing the share of EVs among all vehicle reigstrations.  

The states will be colored based on the total number of vehicle registrations, allowing us to see additionaly where the most vehicles are registered.  

To better visualize a potential correlation between share of VRs and GDP, a trendline is computed, using a linear regression based on the "ordinary least squares" method.
The trend line is displayed as an orange linear line.

In [36]:
import plotly.express as px

fig = px.scatter(aggr_df,
                 x='gdp_per_capita',
                 y='share_electric',
                 color='total',
                 hover_data=['federal_state'],
                 trendline='ols',
                 title="Correlation between share of EVs and GDP",
                 labels={
                     'share_electric': "Share of EVs (in %)",
                     'gdp_per_capita': "GDP (in EUR)",
                     'total': "Total # of VRs",
                     'federal_state': 'State'
                 }, width=700)


fig.show()


## Conclusion

The trend line shows a positive correlation between the share of EVs among all VRs and the states GDP, therefor it can be said that in states with a higher GDP there is a tendency to buy (and register) more EVs than cars with a traditional combustion engine. One possible explanation for this could be the higher prices of EVs.