Contains all code necessary to reproduce macrodomain alignment analysis using data pulled from the COVID-19 Data Portal and includes processed data files.
Pipeline order is:
- mine_data.R
- analyze_rerun.R
- analyze_conservation.R (protein mutation frequencies)
- analyze_seq_conservation.R (nucleotide mutation frequencies)