Skip to content

hringbauer/covid19_data

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

20 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

@author Harald Ringbauer, March 20th 2020

This is a project to analyze data from publicly available COVID19 viral data.

The goal is to use genetic data to learn about key parameters and whether they vary across strains (e.g. virality), 2) To learn about the history of the outbreak and 3) to develop a realtime analyis tool.

To align sequences:

  1. Download the fasta from gisaid Sometimes they have blank lines in the beginning. Remove these

Downoad meta data from nextstrain git.

  1. Copy these two files into ./data

  2. run notebooks/process_data/align_fasta follow instructions there, top to bottom

  3. run notebooks/create_h5.ipynb follow instructions there. Creates h5 and also tables and .csvs of interesting loci and MAFs

About

Analysis of publicly available COVID19 data from nextstrain

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors