Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[New pipeline] PASA #31

Open
Juke34 opened this issue May 18, 2020 · 0 comments
Open

[New pipeline] PASA #31

Juke34 opened this issue May 18, 2020 · 0 comments

Comments

@Juke34
Copy link
Collaborator

Juke34 commented May 18, 2020

See #17 for the general picture.

The idea of this pipeline is to parallelise PASA to make it faster to run. Having PASA would be nice because it can be used to predict gene from evidence that can be used in different ways:

  • PASA makes probably a a better pure evidence-based annotation than MAKER.
  • the PASA annotation can be used to improve/polish an annotation made with another tool.
  • the PASA annotation can be used to train Abinitio tools

PASA is already available on Conda. The difficulty is to parallelise it. Marc has already implemented something in esga see here.
Can we use it like it is?
It sounds we could make it more generalised and better commented. For GFF related task (e.g GffToFasta we can use AGAT's script, see other NBIS pipelines to use it) and for fastaSplitSize we can use GAAS script gaas_fasta_splitter.pl.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant