Summary

Here is the rule graph of the computational workflow:

Here is the Markdown output of each notebook in the workflow:

Get prior RBD DMS mutation-level binding and expression measurements and barcode-variant lookup table from the SARS-CoV-2-RBD_DMS_Omicron repository and the original DMS library for SARS-CoV-2 (PCR-based mutagenesis) here.
Count variants and then aggregate counts for Wuhan_Hu_1, Omicron_BA1, and Omicron_BA2 to create variant counts files for Wuhan_Hu_1, Omicron_BA1, and Omicron_BA2.
Analyze sequencing counts to cells ratio for Wuhan_Hu_1, Omicron_BA1, and Omicron_BA2 this prints a list of any samples where this ratio too low. Also creates a CSV for Wuhan_Hu_1, Omicron_BA1, and Omicron_BA2 with the sequencing counts, number of sorted cells, and ratios for all samples.
Calculate escape scores from variant counts for Wuhan_Hu_1, Omicron_BA1, and Omicron_BA2.
Call sites of strong escape for Wuhan_Hu_1, Omicron_BA1, and Omicron_BA2.
Plot escape profiles for Wuhan_Hu_1, Omicron_BA1, and Omicron_BA2.
Map escape profiles to *.pdb files using notebooks here for Wuhan_Hu_1, Omicron_BA1, and Omicron_BA2.
Make supplementary data files for Wuhan_Hu_1, Omicron_BA1, and Omicron_BA2, which are here for Wuhan_Hu_1, Omicron_BA1, and Omicron_BA2. These include dms-view input files.

Provide feedback