Skip to content

Releases: Hajibabaei-Lab/SCVUC_COI_metabarcode_pipeline

SCVUC v4.3.1

17 Jun 22:11
58d51f5
Compare
Choose a tag to compare

This version has been updated to use the VSEARCH --search_exact function instead of --usearch_global --id 1.0 because --search_exact is faster and optimized for finding short exact matches instead doing global alignments.

SCVUC v4.3.0

02 Mar 18:20
f407cf5
Compare
Choose a tag to compare

Update pipeline to translate ESVs then use HMMER profile hidden Markov model sequence analysis to remove bit score outliers (putative pseudogenes) instead of just translating and removing ORF length outliers (v4.2.0).

SCVUC v4.2.0

26 Feb 16:41
Compare
Choose a tag to compare

Streamline the conda environment to remove R packages not needed to run this pipeline. Added GLIBC_2.14 and instructions to set LD_LIBRARY_PATH to point to libc.so.6 needed to run ORFfinder.

SCVUC v4.1.0

10 Feb 21:33
49f65d4
Compare
Choose a tag to compare

Provides a snakemake pipeline to process reads and summarize taxonomic assignments for each ESV in each sample.

Also provides an updated snakemake pipeline that attempts to remove arthropod pseudogenes and produces a report that includes the sequence for the longest retained open reading frame. For pseudogene removal, this version is more sensitive than v4.0.0.

SCVUC v4.0.0

19 Dec 22:21
Compare
Choose a tag to compare

Updates the conda environment to myenv.3 .

Uses VSEARCH v2.14.1 to create dereplicate reads, produce denosised ESVs (instead of USEARCH), remove chimeric sequences, and create the ESV x sample table.

Consolidates several scripts that together parsed nucleotide open reading frame sequences and removed those with outlier lengths (ORFs that are too short or too long) that are putative pseudogenes.

Updates ORFfinder settings to retrieve ORFs for plus strand only, ignoring nested ORFs, minimum length 30bp.

Includes a config file for the COI-ml-jg amplicon.

SCVUC v3.3.2

07 Nov 18:09
Compare
Choose a tag to compare

Update config files, snakefile, and get_longest_orf_nt5.plx so that if multiple ORFs of equal length are found, the first one is consistently retained for further analysis.

SCVUC v3.3.1

30 Oct 17:09
Compare
Choose a tag to compare

Updates the conda environment.yml to use myenv.2 that correctly specifies CUTADAPT v2.6 as well as the snakefile and README files.

SCVUC v3.3.0

25 Oct 17:16
Compare
Choose a tag to compare

Fixes a bug leading to an incorrect number of reads in the ESV table that was first introduced with SCVUC v3.
Cleans up summary of statistical output.
Streamlines ORF filtering to remove outliers.
Updates CUTADAPT 2.4 to 2.5
Updates VSEARCH 2.13.6 to 2.14.1

SCVUC v3.2.0

27 Sep 20:10
Compare
Choose a tag to compare

Update stats output files.

SCVUC v3.1.1

06 Sep 19:51
409c0a8
Compare
Choose a tag to compare

Updated the README.