-
Notifications
You must be signed in to change notification settings - Fork 145
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[BUG] Pangenome: missing genes with repeat motifs #1955
Comments
Great point. I will add |
This is now solved :)
The parameters used are shown in the output message during runtime:
And additional parameters used stored in the
With this change there is no more |
Short description of the problem
In the anvi'o google group, Emily St John noticed a few genes were missing after
anvi-pan-genome
. I observed the same in a different pangenome analysis.The genes had a low complexity composition with repeats and were skipped by Diamond default masking parameter.
Maybe we should consider changing the default Diamond blastp parameters and remove the default masking with
--masking 0
?This issue echoes with #1922 and the masking option could also be available to the advanced user that want to use it.
anvi'o version
Development branch.
Detailed description of the issue
Here is a gene that was skipped:
To reproduce the error, save the above sequence as
test.fasta
.With the default Diamond command in
anvi-pan-genome
(no output):With the masking off:
The text was updated successfully, but these errors were encountered: