Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

OWID_WRL calculations #127

Closed
stark71 opened this issue Aug 19, 2020 · 1 comment
Closed

OWID_WRL calculations #127

stark71 opened this issue Aug 19, 2020 · 1 comment

Comments

@stark71
Copy link

stark71 commented Aug 19, 2020

CSV and JSON from 2020-08-19:
When I try to calculate OWID_WRL by my self I can't get your figures.

Date: 2020-08-18

total_cases:

  • Your calculation=21_901_632.
  • When I sum all the total_cases for all the countries (HK is missing btw) I get 21_900_936.
  • 696 are missing!

population: 44_315_622 missing!

Regarding *_per_million (total_deaths_per_million, new_deaths_per_million, etc.)
How do you calculate OWID_WRL?

  • Averaging by number of countries
  • sum_for_each_country(country_value * (country_population / 1_000_000) )/ cardinal(countries)

What about demographic data (median age, population density, gdp per capita, etc)

Could you document your calculations?

Thank you for your work.

@edomt
Copy link
Collaborator

edomt commented Aug 20, 2020

Hi @stark71

The reasons for those discrepancies are different, so I'll reply for each.

696 missing for total cases

These 696 cases belong to the "International" group, i.e. they're cases that weren't attributed to a particular country. I've fixed our code so that these totals are now carried forward from the last date they were counted (March 10), which should solve the issue.

Note that Hong Kong is included by the European CDC as part of the figures for China.

World population

The total world population isn't calculated but rather taken directly from the UN estimate available here: https://github.com/owid/covid-19-data/blob/master/scripts/input/un/population_2020.csv
The difference of 44 million is due to the fact that not every country in the world has reported cases & deaths figures for COVID-19.

World figures per capita

These are calculated by adding up the metric (e.g. total_deaths) across all countries, then dividing the result by the world population (UN estimate).

Demographic data

The demographic data is imported from various sources. See here for the complete list in our codebook: https://github.com/owid/covid-19-data/blob/master/public/data/owid-covid-codebook.csv

Edouard

@edomt edomt closed this as completed Aug 20, 2020
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants