-
Notifications
You must be signed in to change notification settings - Fork 0
06_PROFILES
eolesin edited this page Mar 4, 2021
·
12 revisions
# cd into the 04_CONTIGS/ directory
# migrate the databases
anvi-migrate *.db --migrate-dbs-quickly
Perform a nested loop as was done for the mapping step. Do not try to parallelize this! Anvio-profile gets confused if you try to start multiple processes for this at once.
conda activate anvio_emily
while read line;
do
SET=$(echo $line | cut -d" " -f1);
samples=$(echo $line | cut -d" " -f2);
delimiter=",";
declare -a Smparray=($(echo $samples | tr "$delimiter" " "));
for samp in "${Smparray[@]}";
do
anvi-profile -c 04_CONTIGS/$SET-CONTIGS.db \
-i 05_MAPPING/$samp.bam \
--skip-SNV-profiling \
--num-threads 40 \
-o 06_PROFILES/$samp
done;
done < samples_in_sets.txt
In 2020 Dahle group sent 60 samples for sequencing from various chimneys across the AMOR. The wiki here is to share the pipeline I used to process this dataset. The intent is to be specific about all steps involved, and to provide other lab members with this information so that they do not have to repeat the same time-consuming processes. By using my Git page, there is an added benefit of accountability and having someone to email if something doesn't work for you. :)