A bioinformatic pipeline for inferring sequencing depth and evenness on complete plastid genomes
This pipeline illustrates the individual steps taken during the analysis of sequencing depth and evenness of a set of 194 genome records of complete plastid genomes. The pipeline comprises of four parts: (i) read mapping, (ii) metadata extraction, (iii) calculation of sequencing depth and evenness, and (iv) statistical analysis.
Input: samples_list.csv
#!/bin/bash
INPUT=/PATH/TO/samples_list.csv
OLDIFS=$IFS
IFS=','
[ ! -f $INPUT ] && { echo "$INPUT file not found"; exit 99; }
while read -r SRA SAMPLE ACCESSION;
do
bash ./read_mapping.sh $SAMPLE $ACCESSION $SRA </dev/null
done < $INPUT
IFS=$OLDIFS
Input: samples_list.csv
#!/bin/bash
INPUT=/PATH/TO/samples_list.csv
OLDIFS=$IFS
IFS=','
[ ! -f $INPUT ] && { echo "$INPUT file not found"; exit 99; }
while read -r SRA SAMPLE ACCESSION;
do
bash ./metadata_extraction.sh $SAMPLE $ACCESSION $SRA </dev/null
done < $INPUT
IFS=$OLDIFS
Input: Customize path to resources in code
Rscript depth_and_evenness.R
Input: stats_ready_metadata.csv
Rscript plastid_analysis.R