bsub: command not found #1

frederic-mahe · 2016-09-09T08:14:02Z

To speed up taxonomic assignment, the STAMPA pipeline described on that repository splits the input dataset in small chunks and spread the computation load using the LSF scheduler (with the bsub command). If you don't have access to a cluster of computers with LSF installed, you can run the analysis linearly (i.e. multithreaded, not parallelized), using the commands below:

# variables
QUERY="representatives.fas"
DATABASE="V4_references.fas"
THREADS=8

# search for best hits
vsearch \
    --usearch_global ${QUERY} \
    --threads ${THREADS} \
    --dbmask none \
    --qmask none \
    --rowlen 0 \
    --notrunclabels \
    --userfields query+id1+target \
    --maxaccepts 0 \
    --maxrejects 32 \
    --top_hits_only \
    --output_no_hits \
    --db ${DATABASE} \
    --id 0.5 \
    --iddef 1 \
    --userout - | sed 's/;size=/_/ ; s/;//' > hits.representatives

# in case of multi-best hit, find the last-common ancestor
python stampa_merge.py $(pwd)

# sort by decreasing abundance
sort -k2,2nr -k1,1d results.representatives > representatives.results

The text was updated successfully, but these errors were encountered:

frederic-mahe closed this as completed Mar 10, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

bsub: command not found #1

bsub: command not found #1

frederic-mahe commented Sep 9, 2016

bsub: command not found #1

bsub: command not found #1

Comments

frederic-mahe commented Sep 9, 2016