-
Notifications
You must be signed in to change notification settings - Fork 334
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Stitch together overlapping reads? #31
Comments
Yes, I can implement this feature. |
I think it will be used a lot, because
So there is a gap in the open source market for a overlapper tool It would be amazing to have a tool that does adapters, quality AND stitching! |
You'll need stitching support for that to be really true? :-) |
Haha, I've put this feature in fastp's roadmap. |
I'm also interested in this feature! |
me too! |
Okay, I will implement it soon, probably in 1 week. |
There's a lot of literature and existing tools for stitching together reads. It'd be nice to implement whichever is considered "the best", as in, the most accurate. Is there a review paper? Does the peanut gallery have any comments on which is perceived to be the best tool by the community? |
My old blog post is a start, but probably newer tools now: Please note that PEAR is no longer open source and should not be considered. |
ABySS has |
Our old Regards, the 🥜 gallery. |
Random bit trivia. ABySS discards unchaste reads when building the de Bruin graph, but uses unchaste reads when mapping back to the assembly. (if they map, may as well use them) |
Was this implemented? Aligners can penalize unpaired reads, so is it possible that the overlap can be 'clipped' from the read with lower base quality (or randomly if tied)? |
@tseemann Now even the STAR aligner is stitching together the overlapping reads before mapping them in order to get better alignments. |
I promise to implement this in 3 days |
Hi guys, this function is implemented, please have a try and help to update this thread with the results. merge paired-end readsFor paired-end (PE) input, fastp supports stiching them by specifying the In the output file, a tag like For the pairs of reads that cannot be merged successfully, they will be both included in the output by default. But you can specify the Same as the base correction feature, this function is also based on overlapping detection, which has adjustable parameters |
Thank you, Shifu! A couple of questions. Does it handle the case when the sequenced molecule is less than a read length? For example with 2x150 bp sequencing, a result of
Chastity refers to the Illumina chastity filter, which is a different thing, the |
@sjackman thanks for your reply.
Yes, it handles that case. As you suggested, I renamed Please try with the latest code. |
Thanks, Shifu! I'll give it a spin. |
Hi guys, this feature is revised and improved a lot in fastp v0.19.9 (will be released soon), see the update here: merge paired-end readsFor paired-end (PE) input, fastp supports stiching them by specifying the
In the output file, a tag like This function is also based on overlapping detection, which has adjustable parameters |
When the DNA library is overly short, Is it possible that most reads overlap.
Can
fastp
stitch these reads together (instead of just correcting errors) ?So input R1, R2 would produce output R1, R2 and SR (stitched, longer single end reads)
The text was updated successfully, but these errors were encountered: