haplohelper

A collection of scripts for determining the haplotype composition of viral populations from sequence data collected from Pacific Biosciences and Illumina sequencing technologies.

Description of files

avg_pool_quality.py - Calculates and records the average pool quality at each position in each segment
- requires a directory with BAM files, the reference sequence, and a file containing pool info for each sample
avg_sample_quality.py - Calculates and records the average sample quality at each position in each segment
- requires a directory with BAM files and the reference sequence
config.py - File that assigns the parameters required by pacbio_haplotype.py
convert_consensus.py - converts consensus files so that each file is a sample with all segment consensus sequences
haplotype_visualization.R - script used to generate plots
- Must manually set the segment (global variable at the top of the script)
- requires the *tidy_haplotypes.csv and *tidier_haplotypes.csv files generated by pacbio_haplotype.py
illumina_linkage_plots.R - used to generate the linkage plots for illumina data. Each plot represents a single pacbio haplotype from one sample on one day. Top line represents the pacbio haplotype.
illumina_utility.py - contains functions for trying to do mutation linkage with illumina data
io_utility.py - helper functions for reading, processing, and writing data
pacbio_haplotype.py - main functions for haplotype determination
- requires config.py, io_utility.py, process_reads.py, illumina_utility.py
plot_transmission_pairs.py - script for automating the comparisons between F0 and F1 generations and generating the appropriate plots
process_reads.py - helper functions for dealing with pacbio or illumina read data

TODO

Clean/organize and document code better
Develop visualization tool

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
.gitignore		.gitignore
README.md		README.md
__init__.py		__init__.py
avg_pool_quality.py		avg_pool_quality.py
avg_sample_quality.py		avg_sample_quality.py
calc_shannon_entropy.R		calc_shannon_entropy.R
config.py		config.py
convert_consensus_files.py		convert_consensus_files.py
haplotype_visualization.R		haplotype_visualization.R
illumina_linkage_plots.R		illumina_linkage_plots.R
illumina_utility.py		illumina_utility.py
io_utility.py		io_utility.py
pacbio_haplotype.py		pacbio_haplotype.py
plot_transmission_pairs.py		plot_transmission_pairs.py
process_reads.py		process_reads.py

andrewwbutler/haplohelper

Folders and files

Latest commit

History

Repository files navigation

haplohelper

Description of files

TODO

About

Resources

Stars

Watchers

Forks

Languages