apobec
contains two in-house scripts that are used for NGS data analysis by HBV crispr/cas9 research group.
pip install apobec
apobec
is intended to be used in Jupyter Notebook. Create folder named input_data
and put your fastas into it. Navigate into the directory which contains the input_data
folder. Then import the package :
import apobec
and run :
%run -m apobec.count_snp_duplex
%run -m apobec.create_bars
%run -m apobec.snp_rate
The scripts take fasta alignment as an input. The input file is the result of deep sequencing reads mapping onto the reference sequence and is imported from the Geneious software.
count_snp_duplex.py
counts mutations in dinucleotide duplex context.
create_bars.py
outputs summary bar charts :
count_snp_duplex.py
outputs excel spreadsheets to further manipulate the data :
snp_rate.py
counts mutations in each read.
It outputs a distribution plot:
and also raw count and summary statistics in excel spreadsheets:
- Python 3
- biopython
- matplotlib
- numpy
- pandas
- seaborn