Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merging Multiple Batches in DADA2 #175

Closed
nmshahir opened this issue Jan 30, 2017 · 5 comments
Closed

Merging Multiple Batches in DADA2 #175

nmshahir opened this issue Jan 30, 2017 · 5 comments

Comments

@nmshahir
Copy link

nmshahir commented Jan 30, 2017

Excellent tool. I have a question regarding merging multiple batches, you mention that the parameters need to be the same in order to merge? Does that mean that trimLeft and truncLen parameters need to be the same or do all the filtering parameters need to be the same?
Thank you in advance.
-Nur

@nmshahir nmshahir changed the title Merging Merging Multiple Batches in DADA2 Jan 30, 2017
@benjjneb
Copy link
Owner

Just the trimming parameters need to be the same (and I'm assuming your primers were the same across your batches) so that the same sequence region is covered by each batch.

Filtering parameters (eg. maxEE) can be different, and sometimes should be different if the quality was very different between your sequencing runs.

@nmshahir
Copy link
Author

Awesome! Extension of previous question, when combining multiple batches should you aim for trimming parameters that preserve the highest the quality or preserve the length or like a compromise?

Thank you.

@benjjneb
Copy link
Owner

No one answer for everyone, but probably a compromise. More length can give better taxonomic resolution, while tighter trimming will let more reads through.

Another thing to keep in mind, when using paired reads that overlap in the middle, only the trimLeft parameter really matters for merging different batches, as that determines the amplicon length.

@nmshahir
Copy link
Author

I'm a little confused now.

Another thing to keep in mind, when using paired reads that overlap in the middle, only the trimLeft parameter really matters for merging different batches, as that determines the amplicon length.

So, I'm using paired reads from Illumina MiSeq runs and all are from the V1V2 regions. Are you saying that I can have varying truncLen parameters for the different batches BUT trimLeft has to be the same across all batches?

@benjjneb
Copy link
Owner

Yes.

Because runs are merged after merging paired reads, and because truncLen does not affect the length of the merged reads (just the length of the overlap between them), then truncLen can vary between runs.

trimLeft must be the same between them all, because it affects the length of the merged reads.

  • caveats: truncLen must always be long enough for reads to overlap, and I'm assuming your reads don't extend past the end of the other primer *

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants