# map_rsv.py

map_rsv.py can be used to plot the global distribution of all available RSV sequences according to the collection location of each sequence. map_rsv.py looks in rsview/data/ for the output files of seq_download.py. If these files are not present, an error message is thrown notifying the user to run seq_download.py before map_rsv.py.

map_rsv.py will aggregate all downloaded sequences by their collection date and location (at country resolution) and display these numbers on a global map. The user must specify whether map_rsv.py should display viral sequences grouped by genotype or subtype. Other, optional arguments, allow the user to specify a specific temporal range to display and whether or not genotypes should be further grouped into clades.

#### For help determining specifying arguments, use "-h"

In [1]:
! map_rsv.py -h

usage: map_rsv.py [-h] [--genotype-level {collapse,all}] [--years YEARS]
                  {subtype,genotype} datadir

Plot global distribution of RSV

positional arguments:
  {subtype,genotype}    Specify whether the subtype or genotype of RSV
                        sequences should be plotted
  datadir               Specify the directory that contains seq_download.py
                        output

optional arguments:
  -h, --help            show this help message and exit
  --genotype-level {collapse,all}
                        Specify whether to plot all genotypes of RSV or
                        collapse them into major clades
  --years YEARS         Specify a range of years to plot. Example: [1990,
                        2018]. If 'all'is specified, all years for which there
                        are data points will be plotted


#### To map available RSV sequences grouped by subtype (A vs. B) from the data in the directory `data`

In [2]:
! map_rsv.py subtype ../rsview/data

#### To map available RSV sequences by genotype group. Note: this is equivalent to specifying the optional argument `--genotype-level collapse`

In [4]:
! map_rsv.py genotype ../rsview/data

#### To map available RSV sequences by genotype

In [5]:
! map_rsv.py genotype ../rsview/data --genotype-level all 

#### To map available RSV sequences that were collected during a specific time range. Note: the default range is [1990,2018]


In [6]:
! map_rsv.py subtype ../rsview/data --years [1960,2000]

Traceback (most recent call last):
  File "/Users/jilliane/miniconda3/envs/rsview/bin/map_rsv.py", line 4, in <module>
    __import__('pkg_resources').run_script('rsview==0.dev1', 'map_rsv.py')
  File "/Users/jilliane/miniconda3/envs/rsview/lib/python3.6/site-packages/pkg_resources/__init__.py", line 664, in run_script
    self.require(requires)[0].run_script(script_name, ns)
  File "/Users/jilliane/miniconda3/envs/rsview/lib/python3.6/site-packages/pkg_resources/__init__.py", line 1451, in run_script
    exec(script_code, namespace, namespace)
  File "/Users/jilliane/miniconda3/envs/rsview/lib/python3.6/site-packages/rsview-0.dev1-py3.6.egg/EGG-INFO/scripts/map_rsv.py", line 288, in <module>
  File "/Users/jilliane/miniconda3/envs/rsview/lib/python3.6/site-packages/rsview-0.dev1-py3.6.egg/EGG-INFO/scripts/map_rsv.py", line 281, in main
  File "/Users/jilliane/miniconda3/envs/rsview/lib/python3.6/site-packages/rsview-0.dev1-py3.6.egg/EGG-INFO/scripts/map_rsv.py", line 161, in 

#### To map available RSV sequences with no filter on the collection time


In [7]:
! map_rsv.py genotype ../rsview/data --years all

# plot_rsv.py

plot_rsv.py can be used to generate graphs for the viewing and analysis of the dataset on child death rates from acute respiratory infection, which we are using as a proxy for RSV disease severity. 

#### For help determining which arguments to use to plot the appropriate health data, use "-h"

In [6]:
! plot_rsv.py -h

usage: plot_rsv.py [-h] [--country COUNTRY]
                   [--highlight_country HIGHLIGHT_COUNTRY]
                   {all,country}
                   {nnd,pnd,neo9,post9,ufive9,rneo9,rpost9,rufive9,fneo9,fpost9,fufive9}
                   datadir

Plot data on child death rates from acute respiratory infection

positional arguments:
  {all,country}         Specify whether to plot data for all countries or for a specific country
  {nnd,pnd,neo9,post9,ufive9,rneo9,rpost9,rufive9,fneo9,fpost9,fufive9}
                        Specify which category of data to plot:
                         nnd: Total Neonatal Deaths
                         pnd: Total Post-Neonatal Deaths
                         neo9: Neonatal deaths due to Acute Respiratory Infection
                         post9: Post-neonatal deaths due to Acute Respiratory Infection
                         ufive9: Underfive deaths due to Acute Respiratory Infection
                         rneo9: Neonatal death

#### To plot health data for reach country, averaged from 2000-2016:

In [9]:
! plot_rsv.py all rufive9 ../rsview/data

#### Use the optional argument "--highlight_country" to emphasize the country of interest in a full plot

In [11]:
! plot_rsv.py all rufive9 ../rsview/data --highlight_country='Kenya'

#### To plot health data by year for a single country, use level='country' and the optional "--country" argument

In [10]:
! plot_rsv.py country rufive9 ../rsview/data --country='Kenya'

#### If you don't specify a country, it will plot global results over time

In [12]:
! plot_rsv.py country rufive9 ../rsview/data

 
 # plot_correlation.py

plot_correlation.py can be used to calculate the prevalence of different RSV subtypes in the RSV sequence dataset and plot that against the dataset on child death rates from acute respiratory infection. This can be used to check for correlation between subtype prevalence or switching between subtypes and the severity of the disease.

#### Use plot_correlation.py with level='all' to generate a scatterplot of a health metric vs the ratio of subtype A to subtype B in each country

In [13]:
! plot_correlation.py all rufive9 ../rsview/data

#### To break the data for each country into per-year subtype data (colored by year), use level='year'

In [14]:
! plot_correlation.py year rufive9 ../rsview/data