Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Need help for assemblying a high heterozygous genome! #46

Closed
bioinformaticspcj opened this issue May 5, 2021 · 3 comments
Closed

Need help for assemblying a high heterozygous genome! #46

bioinformaticspcj opened this issue May 5, 2021 · 3 comments

Comments

@bioinformaticspcj
Copy link

Hi Alex,
I have tried to assembly the ~4G genome, but found it can not be assemblied well with N50 only 275,951 bp and 2.77G in size. I estimated the genome with 17-mers, the results show it is a high heterozygous genome with 0.9% of heterozygosity.

The distribution of the 17-mers is as follows:

image

Could you give me some advice for parameters setting ?

Thanks a lot.

Best,
Bob

@adigenova
Copy link
Owner

Hi bob,
Can you start the k-mer histogram from at least 10 of frequency? to see the peaks. A thing that you can do is to generate a short-read assembly with the Platanus assembler that is designed for highly heterozygous genomes (http://platanus.bio.titech.ac.jp/platanus/platanus-allee-2-0-was-released), then give that result as input to Wengan (-c option).

best,
Alex

@bioinformaticspcj
Copy link
Author

Hi Alex,
Thanks very much for your timely advice. Given platanus is a memory greedy assembler, I am afraid it could not deal with such a big genome using our limited memory (~1Tb). The followed file is the distribution of 17mers altered as you suggested.
kmer_hist.pdf

Do you have any other advice?

Best,
Bob

@adigenova
Copy link
Owner

Hi bob,

I think that if you feed only 50-60X coverage to Platanus the assembly might fit in 1TB machine.
There are other alternatives but are more complex to implement, so I suggest starting with Platanus first.

Best
Alex

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants