GitHub

certhia_phylogeography

Steps to analyze resequencing data for Certhia americana samples from 7 populations.

01_setup.sh -> make directories for organizing all files throughout analyses
02_align.sh -> filters, trims, and aligns data to reference genome
                -> requires input basenames.txt to run 
                -> all fastq files should be gzipped and in the formats: basename_R1.fastq.gz
                                                                         basename_R2.fastq.gz
02b_extract_filtering_info.sh -> pulls out filtering info from bbduk slurm.out files
03a_coverage.sh -> uses samtools to measure alignment coverage from bam files
03b_plot_coverage.r -> uses R to plot the output from 03a_coverage.sh
04_create_genotype_scripts.r -> R script that makes slurm array scripts and helper text files for genotyping in GATK
                -> requires popmap.txt and 06_certhia_reordered.fasta.fai files for job creation
05_concatenate_vcf_files.sh + 05b_concatenate_vcf_files.sh -> combine all vcfs from the same chromosome into single vcf files
06_filter_vcf.sh -> filter the whole-chromosome vcf files for downstream analyses
07_whole_genome_admixture.sh -> run whole-genome admixture analyses with thinned SNP datasets
08a_phylo_stats_50kbp.r + 08b_phylo_stats_100kbp.r -> R scripts to build SLURM array jobs to estimate phylogenies and popgen summary statistics in sliding windows
                -> subsequent phylogeny functions in SLURM jobs need the create_fasta_from_vcf.r and create_fasta.r R scripts
                -> subsequent popgen functions in SLURM jobs need the calculate_windows.r and window_stat_calculations.r R scripts
08c_combine_trees.r -> combine all raxml output .tre files into a single .trees file
08d_count_sites_in_tree_files.sh -> count invariable and variable sites per vcf
08d_species_trees.sh -> estimate species trees from gene trees
08e_gsi.r -> calculate GSI from each of the gene trees
09 sh and r scripts -> run LOSTRUCT
10a_50kbp_admixture.r + 10b_100kbp_admixture.r -> R scripts that make SLURM array scripts for sliding window ADMIXTURE analyses

Name		Name	Last commit message	Last commit date
Latest commit History 92 Commits
01_setup.sh		01_setup.sh
02_align.sh		02_align.sh
02b_extract_filtering_info.sh		02b_extract_filtering_info.sh
03a_coverage.sh		03a_coverage.sh
03b_plot_coverage.r		03b_plot_coverage.r
04_create_genotype_scripts.r		04_create_genotype_scripts.r
05_concatenate_vcf_files.sh		05_concatenate_vcf_files.sh
05b_concatenate_vcf_files.sh		05b_concatenate_vcf_files.sh
06_certhia_reordered.fasta.fai		06_certhia_reordered.fasta.fai
06_filter_vcf.sh		06_filter_vcf.sh
06b_filter_vcf_for_msmc2.sh		06b_filter_vcf_for_msmc2.sh
07_whole_genome_admixture.sh		07_whole_genome_admixture.sh
08a_phylo_stats_50kbp.r		08a_phylo_stats_50kbp.r
08b_phylo_stats_100kbp.r		08b_phylo_stats_100kbp.r
08c_combine_trees.r		08c_combine_trees.r
08d_count_sites_in_tree_files.sh		08d_count_sites_in_tree_files.sh
08d_species_trees.sh		08d_species_trees.sh
08e_gsi.r		08e_gsi.r
09a_lostruct_setup.sh		09a_lostruct_setup.sh
09b_run_lostruct_50kbp.r		09b_run_lostruct_50kbp.r
09c_run_lostruct_100kbp.r		09c_run_lostruct_100kbp.r
09d_combine_lostruct_results.r		09d_combine_lostruct_results.r
09e_count_sites_lostruct_files.sh		09e_count_sites_lostruct_files.sh
10a_50kbp_admixture.r		10a_50kbp_admixture.r
10b_100kbp_admixture.r		10b_100kbp_admixture.r
11a_cut_simple_vcf_for_msmc.sh		11a_cut_simple_vcf_for_msmc.sh
11b_make_msmc_directories.sh		11b_make_msmc_directories.sh
11c_make_msmc_files.sh		11c_make_msmc_files.sh
11d_run_msmc.sh		11d_run_msmc.sh
11e_organize_msmc_output.sh		11e_organize_msmc_output.sh
11f_plot_msmc.r		11f_plot_msmc.r
12a_create_snpeff_database.sh		12a_create_snpeff_database.sh
12b_run_snpeff.sh		12b_run_snpeff.sh
12c_cat_snpeff_vcfs.sh		12c_cat_snpeff_vcfs.sh
LICENSE		LICENSE
README.md		README.md
basenames.txt		basenames.txt
calculate_windows.r		calculate_windows.r
create_fasta.r		create_fasta.r
create_fasta_from_vcf.r		create_fasta_from_vcf.r
cut_popmap.txt		cut_popmap.txt
gsi_popmap.txt		gsi_popmap.txt
make_MSMC_files.r		make_MSMC_files.r
popmap.txt		popmap.txt
popmap_phylo.txt		popmap_phylo.txt
vcf_cat.txt		vcf_cat.txt
vcf_list.txt		vcf_list.txt
window_stat_calculations.r		window_stat_calculations.r

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

certhia_phylogeography

About

Releases

Packages

Languages

License

jdmanthey/certhia_phylogeography

Folders and files

Latest commit

History

Repository files navigation

certhia_phylogeography

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages