-
Notifications
You must be signed in to change notification settings - Fork 402
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
aligning HiFi data to ONT reads #801
Comments
If the HiFi coverage is high enough, the better approach is to map nanopore reads to HiFi unitigs. 200G vs 60G will take a very long time. Splitting files will complicate downstream analysis. |
good point! The bigger picture is: I am working with a highly heterozygous plant, and I want to:
Regarding splitting the "reference": where do you think the complicated part would be? |
If the heterozygosity is a few percent, most contigs will be haplotigs with HiFi reads only. HiFi+Hi-C is the best automated solution you can use now. HiFi+Nanopore should work better in theory but there are no good tools yet.
minimap2 does local alignment. It can find partial matches. You should add an option something like
Then you should map nanopore reads to unitigs because in comparison to HiFi reads, unitigs carry phasing information in longer range. If you correct Nanopore reads with HiFi reads, you will need to connect the phases of HiFi reads, which is effectively an assembly – it is hard.
First, the phasing issue above. Second, you need to merge results properly. When you split the reference, a HiFi read may hit to irrelevant Nanopore reads. You need to filter them out. Based on your description, I strongly recommend to map nanopore reads to unitigs. You may start with option |
Hello,
I would like to align PacBio HiFi reads to ONT reads with the goal of making a consensus of the alignment against the long read backbone (to correct the errors).
First, I wonder whether Winnowmap is more suitable than minimap2, or if any other aligner (lra?) will work best.
Then, in my case the reference is of lower quality (median Q score 11) and the query has much higher accuracy (Q 31): will this affect the choice of the alignment parameters? Can I use
-x map-ont
orasm20
as presets?Lastly, I have more than 200 Gb of raw ONT reads that I would like to error correct with ~60 Gb HiFi data: I thought of splitting the "reference" to have small jobs and shorter time to compute the index. Is this a good approach or will splitting affect the representativeness of the minimizers?
Thanks,
Dario
The text was updated successfully, but these errors were encountered: