-
Notifications
You must be signed in to change notification settings - Fork 7
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Failure being caused by certain input sequences. #2
Comments
On further investigation, the failure-causing sequences are those which are the most divergent from the references. it seems that >20 base differences (from ~500) cause the above failure. Should these sequences not be going into the failure.txt rather than crashing the process though? |
Hello! Sorry for the late reply our notifications seem to be acting up, hopefully we can still help. We cannot fully test the issue you were having without the same protein reference file you used, would you be willing to share that with us? |
Would you mind sending us (rdpstaff@msu.edu) both the reference and query
sets to check? Thank you.
Benli
…On Sat, Mar 25, 2017 at 7:30 AM, zoubinok ***@***.***> wrote:
Hello. As I have a series of sequences to process, I try to use local
FrameBot. Unfortunately, I cannot use protein as the reference, which can
be sued on the web FrameBot. But the index works.
So I use the dataset in the RDP pipeline. It has the same problem. So how
can I deal with it? Thank you.
Error Message:
java -jar /home/server/RDPTools/FrameBot.jar framebot -o amoaaob_test
amoA_prot_ref.fasta /home/server/RDPTools/Xander_
assembler/gene_resource/amoA_AOB/originaldata/nucl.fa Exception in thread
"main" java.util.zip.ZipException: Not in GZIP format at
java.util.zip.GZIPInputStream.readHeader(GZIPInputStream.java:164) at
java.util.zip.GZIPInputStream.<init>(GZIPInputStream.java:78) at
java.util.zip.GZIPInputStream.<init>(GZIPInputStream.java:90) at
edu.msu.cme.rdp.framebot.index.FramebotIndex.readExternalIndex(FramebotIndex.java:186)
at edu.msu.cme.rdp.framebot.cli.FramebotMain.main(FramebotMain.java:474)
at edu.msu.cme.rdp.framebot.cli.Main.main(Main.java:50)
—
You are receiving this because you commented.
Reply to this email directly, view it on GitHub
<#2 (comment)>,
or mute the thread
<https://github.com/notifications/unsubscribe-auth/AKlEVq5cMHobdXujAv_ummFskOYAOvqGks5rpProgaJpZM4CAm9d>
.
--
RDP Staff
Ribosomal Database Project
Center for Microbial Ecology
Michigan State University
567 Wilson Rd. Room 2225 A
East Lansing, MI 48824
(517) 353-3842
|
Hi. I ran into a similar problem. One of my sequences would fail with: Turns out my framebot.fa seed file had a sequence with an illegal character in it ("O"). The solution was to fix the sequence, although, I suppose removing the sequence or just the illegal character would also work. It is interesting that this only caused a failure with one of many sequences tested. Cheers, Bill |
I am getting the following error being caused by certain sequences. When I remove the offending sequence the process continues fine, but I can't understand how the error is being caused as the input is exactly the same size and format as the other sequences. No sequences are put into the 'failed' output file.
My input is ~7000 rbcl gene nucleotide sequences being corrected against ~170 ncbi protein references as follows:
Sequence input example (failure causing):
Error Message:
The text was updated successfully, but these errors were encountered: