Skip to content
Linkage disequilibrium analysis of B cell sequences
Mathematica Python
Branch: master
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Type Name Latest commit message Commit time
Failed to load latest commit information.

Analysis of linkage disequilibrium for Dale et al 2019

These scripts and data produce Figure 2 K-P.

Citation: Dale GA, Wilkins DJ, Bohannon CD, Dilernia D, Hunter E, Bedford T, Antia R, Sanz I, Jacob J. 2019. Clustered mutations at the murine and human IgH locus exhibit significant linkage consistent with templated mutagenesis. J Immunol: ji1801615.

Generate LD statistics

Generate R^2 LD statistics for each dataset by running the script as:

python2 --dataset rPA_day_8
python2 --dataset rPA_day_16
python2 --dataset rHA_day_16
python2 --dataset tas2016
python2 --dataset rabbit
python2 --dataset chicken

This takes FASTA alignments from the fastas/ directory and populates TSV files into the rsq_tables/ directory. These have the headers isotype, gene, distance, rsq, where each row represents a specific allelic comparison. I've versioned rsq_tables/rPA_day_8.tsv, etc... for convenience.

This script is currently Python 2 compatible only.

Plot scatterplots

These TSV files are plotted using the supplied ld-plotting.nb Mathematica notebook resulting in figures/ld_plots.png.

You can’t perform that action at this time.