Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Error writing output: ValueError: max() arg is an empty sequence #6

Closed
Talitrus opened this issue Feb 12, 2018 · 2 comments
Closed

Comments

@Talitrus
Copy link

Hi Eddi,

I've been trying to get BLCA to run with a custom database. As far as I can tell, I have everything formatted as specified in the README, but BLCA errors out while trying to write the output files with the following messages:

blastdbcmd is located in your PATH!
muscle is located in your PATH!
>  > Fasta file read in!!
>  > Read in taxonomy information!
blastn is located in your PATH!
> > Running blast!!
> > Blastn Finished!!
>  > Read in blast output!
Traceback (most recent call last):
  File "/groups/cbi/shared/apps/BLCA/BLCA.git/2.blca_main.py", line 352, in <module>
    outout.write(le+":"+max(lexsum,key=lexsum.get)+";"+str(max(lexsum.values()))+";")
ValueError: max() arg is an empty sequence

I managed to get it run successfully once or twice with different sequence IDs, but haven't been able to replicate it. Even then, it kept identifying sequences as "Unclassified," though. This would probably be a separate issue, though.

I've uploaded my blastdb, reference FASTA used to generate the blastdb (midori_blca_dedup.fasta), reference taxonomy file (midori_blca_taxonomy.txt), and two test input FASTA files (uniques10_trim.fasta) and (other_BLCA_test_input.fasta) to a Dropbox link here: https://www.dropbox.com/sh/m3hie0b9o8ldc29/AABHeBAiS94lwl_skgtEY-qIa?dl=0

Thanks for your help!

Cheers,
Bryan

@yingeddi2008
Copy link
Collaborator

yingeddi2008 commented Feb 12, 2018 via email

@Talitrus
Copy link
Author

Talitrus commented Feb 12, 2018

Fantastic! Yes, the differing number of lines/reads is because I generated the taxonomy file before deduplication, but I was hoping it wouldn't make a difference (just being lazy). The Midori database has some duplicate sequence IDs, which I didn't realize until I tried to generate the BLAST db.

Thanks for the clarification. I'm looking forward to giving BLCA a try. The results you posted look promising.

Cheers,
Bryan

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants