# Geospatial Map of Germany

This Jupyter Notebook contains code to demonstrate a potential visualization (a geospatial choropleth map) of standardized inpatient treatment costs of a selected ICD-10-diagnosis across german federal lands ("Bundesländer").

A similar graph at other administrative levels (e.g. german distrcits) can be easily made by changing a few lines of the code.

I identified the following necessary tasks:

1. find and load an adequate GeoFile (preferably a ShapeFile, shp.) --> I used an official shape file from the Federal Ministry of Cartography of Germany

2. create random, but reproducable data to visualize standardized inpatient treatment costs (= e.g. costs per inhabitants)
--> data on inpatient treatment costs in Germany can be requested from "DeStatis", the Federal Ministry of Statistics in Germany ("DRG-Statistik"). However, data from the Ministry are not free. Therefore, I decided to create my own, random data.

3. create a plesant and informative graph with Matplotlib (other good options would have been Plotly or Seaborn)


## Setup:

In [None]:
# import necessary modules
import pandas as pd
import numpy as np
import geopandas as gpd
import matplotlib.pyplot as plt

__1.) find and load an adequate GeoFile__

At first, an adequate GeoFile had to be found and downloaded. I decided to look for officially approved GeoFiles and found the ones provided by the german Federal Ministry of Cartography. The GeoFiles for different administrative levels and in different scales can be found on the following link:

https://gdz.bkg.bund.de/index.php/default/digitale-geodaten/verwaltungsgebiete.html

I used the data with the 1:2 500 000 scale which can be found on this link:

https://gdz.bkg.bund.de/index.php/default/digitale-geodaten/verwaltungsgebiete/verwaltungsgebiete-1-2-500-000-stand-31-12-vg2500-12-31.html

Furthermore, I chose the GeoFile with the coordinate reference system (CRS) "3-degree Gauss-Kruger zone 3", which corresponds to the CRS EPSG:31467 (for an introduction to coordinate reference systems and the use of geospatial data in python, enroll in the corresponding course on https://kaggle.com/learn/geospatial-analysis). 

Last, but not least, I decided to use the file for the administrative level of the german federal lands ("Bundesländer"), which is marked by the abbreviation "LAN". 

In [None]:
# read in GeoFile
