Skip to content

nate-downer/foreign-country-viewer

Repository files navigation

foreign-country-viewer

Repository for deploying the "How Foreign will New Countries Feel?" viewer app. Exploratory data analysis resources have been moved to their own sub-folder.

Because python notebooks contain visualizations of the world map generated with the Folium library, they may be too graphically intense for GitHub to render an output for all notebooks.

The App Itself:

The app is a Jupyter Notebook written in python. It utilizes Mercury to build the UI and Heroku to deploy it as a web app.

It can be viewed here!

Limitations:

It is inherently difficult to quantify how foreign a country will "feel", nevertheless that is this project's lofty goal. To try and achieve this, users are able to customize the results by adjusting the relative weights of six different "axis" (Demographics, Health, Climate, Economy, Language, and Religion). This way the algorithm can put a greater or lesser emphasis on each of these factors depending on how much the user believes they contribute to the "feel" of a place for them. For a granular view of what data is used to build the groups for each axis, check out the data sources.

The algorithm works by using different clustering algorithms to group countries that share similarities. Countries can be similar in many ways, and there are varying degrees to which countries can be similar, so this analysis uses 36 different clustering algorithms that group countries in 36 different ways. The exact way in which each of these clusters works was the subject of a great deal of tuning. To compute how similar any two countries are, all the viewer app has to do is find how many times a given country appears in the same group as another.

Each clustering algorithm only sees data relating to one of the six axis. This allows the relative weights of each axis to be factored into the algorithm after all of the computationally intensive clustering has been done. This, in turn, greatly increases the performance of the wed app (which is already very slow due to the time it takes to render the map). For each of the six axis, six sets of groups are generated, which is why there is a total of 36 sets of groups.

Data Sources:

Yes... the data was all scrubbed from Wikipedia. It may not be the most reliable, but of all the different data sources I could find, it did have the most consistent naming conventions, and when you are trying to aggregate data about this many different kinds of factors, that ends up being the most important thing. Below are links to the specific data sources.

Demographics Data

Health Data

Climate Data

Economy Data

Language Data

Religion Data

About

Repository for deploying the "How Foreign will New Countries Feel?" viewer app.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published