source: https://www.abc.net.au/news/2020-08-05/donald-trump-axios-interview-jonathan-swan/12524552
# ABC article: Trump and Swan talk about Covid19 deaths
This notebook was inspired partly by an Aussie ABC article and completely by a family member posted this article to a family WhatsApp group chat. The interview really struck a curiousity in me - not because I am a Trump supporter (actually quite the contrary) but because the interviewer actually didn't make much sense to me from a statistical viewpoint when he says **"death as a proportion of population"**. The questions I initially had were:
- do we really need to know the population of people tested for Covid19? Such as Trump's perspective.
    - if so, how do we use this statistic to influence a perspective?
- given the overall population size of two countries and a death rate from covid19, can we make assumptions about the death toll of covid19 for a given country?
    - if so, how do we interpret deaths? 
        - do we consider all deaths (by all causes including covid19) by total overall population
        - do we consider testing rate for a given country?
            - **this is interesting!** So when we say *death by covid19* we infer that those people first tested positive for covid19 and as a result of the order of least precedence, we conclude this person died of covid19
                - another interesting avenue to consider here would be to get the cases of comorbidity among those that are considered in the statistic of *death by covid19*. Comorbidity in this case being the cooccurrence of other existing medical conditions in addition to covid19.
    - if so, how do we influence an audience to consider well-known political issues that exist today? Such as healthcare access in the USA, political structures for each country, etc.

### Analysis
From [this ABC article](https://www.abc.net.au/news/2020-08-05/donald-trump-axios-interview-jonathan-swan/12524552), we can distinguish two perspectives from a statistical viewpoint:
- death as a proportion of cases (Trump)
- death as a proportion of population (interviewer)

In [1]:
# define a function to handle statistical output
def stats(country, pop_all, pop_covid19, covid19_deaths):
    print(f'stats printed for {country}')
    for p in [pop_all, pop_covid19]:
        res=0
        print(f'\n\n{p}\tstart with a total pop')
        res = covid19_deaths/p
        print(f'{res}\tdivide num of covid19 deaths by total pop')
        print("%.02f%%\tmake into percentage" % (res * 100))

In [2]:
# get stats for south korea
# population size based on World Bank population API
stats('south korea', 51709098, 14499, 302)

stats printed for south korea


51709098	start with a total pop
5.840364881243916e-06	divide num of covid19 deaths by total pop
0.00%	make into percentage


14499	start with a total pop
0.020829022691220084	divide num of covid19 deaths by total pop
2.08%	make into percentage


In [3]:
# get stats for usa
# population size based on World Bank population API
stats('usa', 328239523, 5729795, 515681)

stats printed for usa


328239523	start with a total pop
0.0015710509060177984	divide num of covid19 deaths by total pop
0.16%	make into percentage


5729795	start with a total pop
0.08999990401052743	divide num of covid19 deaths by total pop
9.00%	make into percentage


Let's say we are interested in a total overall population as we only want to know who died as a result of covid19. Obviously to get a covid19 death count, we assume those persons first tested positive for covid19. This raises an interesting query, if a person was not first tested positive for covid19, then the death of that person would not constitute a covid19 death, right? This brings us to the question - should we consider the rate of testing in each country. Let's consider the case where $x$ people were never tested for covid19, that would result in $x$ non-covid19 deaths, i.e., their death would not be considered in our death count. 

Let's consider the count of individuals tested for covid19 for each country's total overall population.

In [4]:
# define a function to handle rate of testing percentage
def rate_of_testing(country, total_tested, pop_all):
    print(f"%.02f%% rate of testing in {country}" % (100*( total_tested/ pop_all)))

In [5]:
# http://ncov.mohw.go.kr/en/ - total tested: 1,606,487 
rate_of_testing('south korea', 1606487, 51600000)

3.11% rate of testing in south korea


In [6]:
# source: https://www.cdc.gov/coronavirus/2019-ncov/cases-updates/testing-in-us.html - total tested: 63,537,614
rate_of_testing('usa', 63537614, 328200000)

19.36% rate of testing in usa
