---
title: Research of possible countries for geoscore
date: now
author: Jan Cap
---

Before starting my diploma thesis, I need to choose a country for which I will create a geoscore. The natural choice would be Slovakia, Germany, Poland, or Austria, as those are neighboring countries to Czechia. In the country of our choice, there shouldn't be any geoscore yet, because that would make the thesis obsolete. The country should have some dataset with geographical locations (as buildings or at least voting districts), some data that could be used as target (insolvency rate, unemployment rate, etc.) and some data that could be used as features (voting results, population data, crime rate, etc.).

## Slovakia

### Geoscore Existence



I wasn't able to find any geoscore for Slovakia.

### Geographical Locations

According to [this](https://e-justice.europa.eu/topics/registers-business-insolvency-land/land-registers-eu-countries/sk_cs) page, Slovakia has some kind of land register.
On [this](https://kataster.skgeodesy.sk/eskn-portal/registre) page, there are code lists for the land register (best granularity is by city).


### Data for Target Variable

In Slovakia they have an alternative to Czech ISIR called [register upadcov](https://replik.justice.sk/ru-verejnost-web/). System has API with description on [this](https://www.justice.gov.sk/sluzby/register-predinsolvencnych-likvidacnych-a-insolvencnych-konani/prirucky-a-manualy-k-is-replik/) page. 

### Data for Features

Voting data are available on [this](https://slovak.statistics.sk/) page of statistics office of Slovakia. Data are granular to voting district level.

- Presidential elections 2024: [data](https://volby.statistics.sk/prez/prez2024/en/subory_na_stiahnutie.html)
- Parliamentary elections 2024: [data](https://volbysr.sk/en/subory_na_stiahnutie.html)

In [None]:
import pandas as pd

pd.read_excel("../data/sk/SLO2021II-III-IV.xlsx", sheet_name="KAP_IV_ZÚJ_ČO_ZSJ")

## Germany

List of official databases can be found [here](https://www.destatis.de/DE/Home/_neue_startseite/_documents/_daten-und-analyse/_datenbbank-box.html).

Genesis Online also has an API https://www.destatis.de/EN/Service/OpenData/api-webservice.html

Geodata portal: https://www.destatis.de/EN/Service/OpenData/maps-geodata.html

### Geoscore Existence

I found [this](https://www.schufa.de/en/newsroom/creditworthiness/geoscoring-place-residence-affect-credit-rating/) article from Schufa, where they provide geoscore for Germany. Looks like it they sell it to banks and other institutions. 

*To calculate the individual Regioscore, SCHUFA analyzed all homes in Germany and assigned them to one of 5.4 million so-called Regioclusters. A regiocluster combines the houses in the immediate neighborhood and must consist of at least six households and six people.To calculate a regioscore, the system checks which cluster the person in question belongs to.*

*We also do not use any additional geographical data such as the unemployment rate in a municipality, average income or the number of ongoing insolvency proceedings in the municipality.*

They calculate score only based on people living in the same neighborhood, not on income, unemployment rate or insolvency rate etc. Our approach to geoscore will be different.

### Geographical Locations

structure of germany is visualized on [this](https://learn.opengeoedu.de/en/opendata/vorlesung/open-government-data/verwaltungsdaten-in-dach-und-eu/adm_de) page. 

- Bund - federal level
- Bundeslander (Länder) - federal states (16 in total)
- Regierungsbezirke - governmental district level (only in some federal states)
- Kreise - county level
- Gemeinden - municipality level


Data on Administrative district are available on [this](https://www-genesis.destatis.de/datenbank/online/statistic/11111/details) page with granularity on administrative district level.

### Target Variable Data

Insolvency portal: https://neu.insolvenzbekanntmachungen.de/ap/

### Data for Features

Germany official data portal: https://www.govdata.de/suche/daten/bundestagswahl-2025-stadtergebnis

Various data on municipality level can be found on [this](https://www.destatis.de/DE/Themen/Laender-Regionen/Regionales/_inhalt.html) page.


#### Election data
2025 German federal election
- https://www.bundeswahlleiterin.de/en/bundestagswahlen/2025/ergebnisse/weitere-ergebnisse.html
- granularity on voting district level

2021 German federal election
- https://www.bundeswahlleiterin.de/en/bundestagswahlen/2021/ergebnisse/weitere-ergebnisse.html
- granularity on voting district level

Euro parlament 2024
- https://www.bundeswahlleiterin.de/en/europawahlen/2024/ergebnisse/weitere-ergebnisse.html
- granularity on ballot box district

Voting data are in good quality and granularity.

#### Demographic data
https://www.destatis.de/EN/Themes/Society-Environment/Population/Current-Population/_node.html#sprg478526

All data about population, households, families, education, buildings etc. can be found here: https://ergebnisse.zensus2022.de/datenbank/online
