-
Notifications
You must be signed in to change notification settings - Fork 71
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
PanEDTA Line Detection #424
Comments
Hi Scott, Thank you for reporting this! Can you please update EDTA to 2.2.0 and test panEDTA again? There are many big changes to the new version for improved SINE/LINE annotations. Thanks, |
Any luck? Shujun |
My apologies for the delayed response Shujun, I will update EDTA this weekend or early next week and follow-up |
Hi Shujun, I am actually having additional trouble with EDTA 2.2.0 now that I have updated. I am having a lot of error getting conda to resolve dependencies when installing, so I elected to use singularity. That installation worked, but now when I run the genomes I get 0 LINE result files and the TIR detection fails, but does not crash. Here is a sample output:
The test that you included in your README works mostly... The dependencies check out, but I get a similar set of warnings:
|
Currently re-trying with a fresh Anaconda installation and conda environment. |
The yml file should be helpful for conda installation. I don’t think the
singularity version is working at the moment
Shujun
…On Tue, Apr 16, 2024 at 3:36 PM Scott Teresi ***@***.***> wrote:
Currently re-trying with a fresh Anaconda installation and conda
environment.
—
Reply to this email directly, view it on GitHub
<#424 (comment)>, or
unsubscribe
<https://github.com/notifications/unsubscribe-auth/ABNX4NDXPHCEGJVCJ6UNAWLY5V4UFAVCNFSM6AAAAABCRVERVWVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDANJZG44TOMJTGQ>
.
You are receiving this because you commented.Message ID:
***@***.***>
|
Hi Shujun, I got the latest version of EDTA to complete the system test, still running it on my genomes. I will report back. I had to install bedtools and samtools on top of the conda environment for this latest upgrade. I did not see those being specified in the yml file, and I was having trouble making the basic install work. Perhaps I am wrong and messed up the install, or maybe they were pre-loaded on your computing cluster system so they were missed. Either way, I hope this helps! |
Hello Shujun,
Hope you are doing well. I am writing to share that I had issues with LINE detection in PanEDTA. I am hoping that this will help anyone else who encounters this issue. It is not a bug, just something that I think folks could easily overlook. When I ran PanEDTA (v2.1.0) on its own without pre-calculating results with regular EDTA, it was not finding any LINE elements in my genomes.
After doing some testing, I think it is because the panEDTA script by default calls
EDTA.pl
without the--sensitive 1
option. The sensitive option calls RepeatModeler. I also observed that when I ran regularEDTA.pl
on a genome without the sensitive option, it did not recover any LINEs. So to summarize, it seems that RepeatModeler was doing the heavy lifting for LINE detection in my strawberry genomes, and without it, I wasn't detecting any LINEs. Jordan B, a post-doc in Pat's lab also had this same LINE issue with some Camelina genomes.In my case, I fixed the issue by running EDTA individually on each genome with the option, and completed the pangenome annotation with panEDTA. That approach worked fine, LINEs were indeed included in my final annotation.
This problem only arises if users decide to use panEDTA to perform all steps of their pangenome annotation. It can easily be sidestepped if user's create the individual annotations with the
--sensitive 1
option first.Sincerely,
Scott Teresi
The text was updated successfully, but these errors were encountered: