Skip to content

omicsEye/covid-19

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

41 Commits
 
 
 
 

Repository files navigation

Steps for the analyses that we will do:

0) Download metadata and MSA from GISAID a) First [register for an account](https://platform.gisaid.org/epi3/cfrontend#335368). This may take several days.

b) Once you have an account, sign in here with your username and password.

c) From the EpiCov tab, click on Downloads and select the down arrow for the multiple sequence alignment (ex. MSA_0728), and the metadata (nextmeta)

d) Extract from tar.xz file with tar -xf file.tar.xz


1) Download NCBI fasta files for each outgroup

a) Bat Coronavirus

b) MERS

c) SARS-related Coronavirus

d) Outgroups


  1. Change names of GISAID fasta files to EPI IDs and format metadata from GISAID

  2. Retrieve and format metadata from NCBI, combine files from NCBI and GISAID

  3. Align GISAID and NCBI sequences using mafft

  4. separate both alignments by locus.

  5. make trees using raxml for

a) GISAID + NCBI (whole genome)

b) GISAID + NCBI (each gene)
  1. Visualize trees and metadata with iTOL

  2. Make distance matrices for a) GISAID genome alignment, b) GISAID individual gene alignments, c) GISAID+NCBI genome alignment, d) GISAID+NCBI individual alignments

  3. Run m2clust on everything.

About

Scripts, data, and analysis of COVID-19

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 4

  •  
  •  
  •  
  •