Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Failure being caused by certain input sequences. #2

Open
passdan opened this issue Jun 3, 2014 · 4 comments
Open

Failure being caused by certain input sequences. #2

passdan opened this issue Jun 3, 2014 · 4 comments

Comments

@passdan
Copy link

passdan commented Jun 3, 2014

I am getting the following error being caused by certain sequences. When I remove the offending sequence the process continues fine, but I can't understand how the error is being caused as the input is exactly the same size and format as the other sequences. No sequences are put into the 'failed' output file.

My input is ~7000 rbcl gene nucleotide sequences being corrected against ~170 ncbi protein references as follows:

java -jar ~/programs/RDPTools/FrameBot.jar framebot  -N -o test bac_protein.fas rbcl_nucl.fas

Sequence input example (failure causing):

>DTM44_641
CGTTTTTTAAATTGTATGGAAGGTATTAACCGTGCTGCAGCTGCAACAGGTGAAGTTAAAGGTTCTTACTTAAACGTTACTGCAGCGACTATGGAAGAAGTACTTAAACGCTGTGAATATGCAAAAGAAGTCGGTTCTATTATTGTTATGATCGATTTAGTTATGGGTTATACAGCAATTCAAAGTGCTGCAATCTGGGCTCGTGACAACGATATGCTTTTACATTTACACCGTGCCGGTAACTCTACTTATGCACGTCAAAAAAGTCATGGTATTAATTTCCGTGTAATCTGTAAATGGATGCGTATGTCTGGTGTTGATCATATTCACGCTGGTACAGTTGTAGGTAAATTAGAAGGTGATCCTTTAATGATTAAAGGTTTCTATGATACTTTACGTTTAACAAAATTTAGAGGTTAATTTACCTTATGGTATTTTTCTTCGAAAGTGACATGGGCAAGTTTACGCCGTTGTATGCCTGTTGCATCTGGTGGTATTCATTGTGGTCAAATGCATCAATTAGTTCACTATTTAGGTGATGATGTAATAT

Error Message:

Exception in thread "main" java.lang.IllegalArgumentException: Cannot score R, ?
        at edu.msu.cme.rdp.alignment.pairwise.ScoringMatrix.score(ScoringMatrix.java:180)
        at edu.msu.cme.rdp.framebot.core.FramebotCore.computeMatrix(FramebotCore.java:81)
        at edu.msu.cme.rdp.framebot.core.FramebotCore.processSequence(FramebotCore.java:67)
        at edu.msu.cme.rdp.framebot.cli.FramebotMain.framebotItUp_prefilter(FramebotMain.java:136)
        at edu.msu.cme.rdp.framebot.cli.FramebotMain.main(FramebotMain.java:381)
        at edu.msu.cme.rdp.framebot.cli.Main.main(Main.java:48)
@passdan
Copy link
Author

passdan commented Jun 3, 2014

On further investigation, the failure-causing sequences are those which are the most divergent from the references. it seems that >20 base differences (from ~500) cause the above failure.

Should these sequences not be going into the failure.txt rather than crashing the process though?

@rdpstaffmsu
Copy link

Hello! Sorry for the late reply our notifications seem to be acting up, hopefully we can still help. We cannot fully test the issue you were having without the same protein reference file you used, would you be willing to share that with us?

@rdpstaffmsu
Copy link

rdpstaffmsu commented Mar 25, 2017 via email

@wichne
Copy link

wichne commented May 3, 2017

Hi. I ran into a similar problem. One of my sequences would fail with:
Exception in thread "main" java.lang.IllegalArgumentException: Cannot score V, O
at edu.msu.cme.rdp.alignment.pairwise.ScoringMatrix.score(ScoringMatrix.java:180)
at edu.msu.cme.rdp.framebot.core.FramebotCore.computeMatrix(FramebotCore.java:81)
at edu.msu.cme.rdp.framebot.core.FramebotCore.processSequence(FramebotCore.java:67)
at edu.msu.cme.rdp.framebot.cli.FramebotMain.framebotItUp_prefilter(FramebotMain.java:165)
at edu.msu.cme.rdp.framebot.cli.FramebotMain.main(FramebotMain.java:496)
at edu.msu.cme.rdp.framebot.cli.Main.main(Main.java:50)

Turns out my framebot.fa seed file had a sequence with an illegal character in it ("O"). The solution was to fix the sequence, although, I suppose removing the sequence or just the illegal character would also work. It is interesting that this only caused a failure with one of many sequences tested.

Cheers, Bill

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants