Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Keeping all full-length transposons? #21

Closed
ghost opened this issue Sep 20, 2019 · 5 comments
Closed

Keeping all full-length transposons? #21

ghost opened this issue Sep 20, 2019 · 5 comments
Labels
enhancement New feature or request

Comments

@ghost
Copy link

ghost commented Sep 20, 2019

Hello,
I might have a suggestion:
I was wondering if it wouldn't be useful for the final user to be able to get a file with the coordinates of the transposon, for example if one is interested to look at their position in the genome.

Thanks for EDTA, it's a cool pipeline.

@oushujun oushujun added the enhancement New feature or request label Sep 20, 2019
@oushujun
Copy link
Owner

Hi @aderzelle,

Thanks for your suggestion. I am working with a functionality to retain all intact transposons identified by EDTA, which could be useful for people who want to study the biology of transposons. I think this is essentially what you are proposing. Currently, EDTA produces a non-redundant library for the whole-genome annotation purpose.

Best,
Shujun

@oushujun oushujun changed the title Keeping mapping results? Keeping all full-length transposons? Sep 20, 2019
@oushujun
Copy link
Owner

Hi @aderzelle ,

Sorry for the delay response. I have updated EDTA, it now also outputs intact TEs. Please try it out and let me know if there is any issues.

Best,
Shujun

@oushujun
Copy link
Owner

The current version provides outputs for full-length LTR, TIR, and Helitrons in both fasta and gff formats. (*.intact.fa and *.intact.fa.gff). Feel free to reopen this issue if you identify new problems or have any suggestions.

@ghost
Copy link
Author

ghost commented Nov 19, 2019

Hello,
it works however LTR are in gff3 formats while the others are in gff2 format, which might create confusion in downstream analysis (though it's not a big deal of course but as a little improvement they could be in the same gff version).

@oushujun oushujun reopened this Nov 19, 2019
@oushujun
Copy link
Owner

Dear @aderzelle,

Thank you for your suggestions. I have incorporated the intact TE annotation to the whole-genome TE annotation. Also, gff format of the final annotation is more consistent with each other. Now the *EDTA.TEanno.gff file contains both structural-based and homology-based annotations. Please check out the new release or the latest repository for more details.

Happy New Year!

Best,
Shujun

@oushujun oushujun closed this as completed Jan 2, 2020
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

1 participant