Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Prepare bug with lowercase bases #413

Closed
swarbred opened this issue Jul 26, 2021 · 0 comments
Closed

Prepare bug with lowercase bases #413

swarbred opened this issue Jul 26, 2021 · 0 comments

Comments

@swarbred
Copy link
Collaborator

From memory we had a previous issue with lowercase bases in the input, if so then it has returned

The attached test.gtf has the following splice sites

NC_037283.1:200919-201095(-) gt ag
NC_037283.1:201282-201445(-) GT ag
NC_037283.1:201512-201775(-) gt ag
NC_037283.1:203421-203569(-) gt ag

i.e. all are canonical

however running mikado prepare on this gtf and attached NC_037283.1.fa file results in no transcripts in the prepare gtf, rerunning with --lenient and this model is output marked as having no canonical introns.

If the NC_037283.1.fa file is converted to uppercase then prepare functions correctly i.e. all introns are regarded as canonical so the issue is with having lowercase bases (commonly used for softmasking)

test.gtf.zip
NC_037283.1.fa.zip

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant