Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

DnoiSE does not complete: "KeyError: 'count'" #8

Closed
SanniH opened this issue Apr 16, 2021 · 3 comments
Closed

DnoiSE does not complete: "KeyError: 'count'" #8

SanniH opened this issue Apr 16, 2021 · 3 comments

Comments

@SanniH
Copy link

SanniH commented Apr 16, 2021

Hi!

I tried to run the DnoiSE.py with my own fasta file of dereplicated, non-chimeric ESVs, but unfortunately I got an error after about 45min, and was hoping you might be able to help?

The error I get:

Traceback (most recent call last):
File "DnoisE/src/DnoisE.py", line 467, in
denoised_ratio_d[i]['sequence'].upper() + "\n")
KeyError: 'count'

I run DnoiSE on a remote server through a shell script, and my fasta is formatted as requested following your paper and the repo, although there was no mention of whether all bases must be as capitals so I have a mix.

$ head UTILA_DSE.fasta

uniq1;size=62441;
TTATTCTACATACCCTGCTAGTGCTTATTTATCAACTGATTTAATAATCTTTTCATTACATTTAGCCGGTGCTAGTTCTATATTGTCTTCAATAAATTTTATTATAACAGTTTTTATGTTGCCTATAAattcttctttttctttttttcaatatcctttatttatagtagctcaaattactgtttcttttttATTATTAATATCTTTACCTGTTTTAGCCGCTGCTATTACTATGTTACTTTTTGATCGTAATTTCAACACTTCttttttttCCAATTATTTGGGTGGTGATGCTCTTCTTTATCAACATTTATTT
uniq2;size=24836;
TTTGAGTAGTGTTCAAGCTCATTCAGGTCCTTCTGTGGATTTGGCTATTTTTAGCCTTCATTTGTCCGGGGCAGCATCTATTATGGGTTCGATTAATTTCATTACAACAATTATTAATATGCGACCGGGAGGAATGGGAATGCATCGTTTGCCGCTATTTGTATGGGCAGTTTTGCTAACCGCAATTCTATTGTTGCTTTCTCTTCCTGTTTTGGCTGGGGGTATTACTATGTTGTTGACTGACCGAAATTTTAACACTACCTTTTTTGATCCCGCTGGAGGAGGAGACCCTGTTCTTTATCAACACCTATTT
...

My file was generated using PEAR for PE merging, and VSEARCH for length filtering and dereplication, and the resulting fasta contains ~44K unique sequences, single line, with length ranging from 303-323 (COI Leray fragment). I attached the fasta I used here.

The command I used for running DnoiSE was this:
$ python3 DnoisE/src/DnoisE.py -i UTILA_DSE.fasta -o Utila -c 20

Any help would be appreciated as I am very keen to see how this compares to my previously generated data using dada2!

UTILA_DSE.zip

@adriantich
Copy link
Owner

adriantich commented Apr 16, 2021 via email

@adriantich
Copy link
Owner

Hi SanniH,
The problem is fixed.
you can update using git pull. Tell me if worked well please!
A.

@SanniH
Copy link
Author

SanniH commented Apr 16, 2021

Hi Adria,

I'll put it to run now and report back if there's any more issues :)
Thanks for the quick response!

Sanni

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants