-
Notifications
You must be signed in to change notification settings - Fork 0
06_PROFILES
If you have started your analysis with Anvio7 you will not need to do this step. Since our analysis began with Anvio6-2 and we had to update to Anvio7 to continue with the pipeline, we must perform this step.
# cd into the 04_CONTIGS/ directory
# migrate the databases
anvi-migrate *.db --migrate-dbs-quickly
Perform a nested loop as was done for the mapping step. Do not try to parallelize this! Anvio-profile gets confused if you try to start multiple processes for this at once.
conda activate anvio_emily
while read line;
do
SET=$(echo $line | cut -d" " -f1); # Isolate Coassembly names
samples=$(echo $line | cut -d" " -f2); # Isolate Sample names
delimiter=",";
declare -a Smparray=($(echo $samples | tr "$delimiter" " ")); # Create new delimiter in sample list
for samp in "${Smparray[@]}";
do
anvi-profile -c 04_CONTIGS/$SET-CONTIGS.db \ # Do the profiling
-i 05_MAPPING/$samp.bam \
--skip-SNV-profiling \
--num-threads 40 \
-o 06_PROFILES/$samp
done;
done < samples_in_sets.txt
while read line;
do
SET=$(echo $line | cut -d" " -f1);
samples=$(echo $line | cut -d" " -f2);
delimiter=",";
declare -a Smparray=($(echo $samples | tr "$delimiter" " "));
prefix="05_MAPPING/"; suffix="/PROFILE.db"; # Need to add file names and path to each element
declare -a newarray=(); # Create an empty array
for samp in "${Smparray[@]}";
do
newarray+=($(echo ${prefix}${samp}${suffix})); # Add each full filepath to the empty array
done;
anvi-merge -c 04_CONTIGS/$SET-CONTIGS.db ${newarray[@]} \ # Call the array with the adjusted filepaths
-o MERGED_PROFILES/$SET-MERGED-PROFILE;
done < samples_in_sets.txt
In 2020 Dahle group sent 60 samples for sequencing from various chimneys across the AMOR. The wiki here is to share the pipeline I used to process this dataset. The intent is to be specific about all steps involved, and to provide other lab members with this information so that they do not have to repeat the same time-consuming processes. By using my Git page, there is an added benefit of accountability and having someone to email if something doesn't work for you. :)