Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Filter FASTQ by length - report #35

Closed
hoelzer opened this issue Jan 26, 2021 · 3 comments
Closed

Filter FASTQ by length - report #35

hoelzer opened this issue Jan 26, 2021 · 3 comments
Labels
enhancement New feature or request wontfix This will not be worked on

Comments

@hoelzer
Copy link
Collaborator

hoelzer commented Jan 26, 2021

As an initial step, FASTQ files are filtered by length and if the file size is too small, the FASTQ is not processed any further. It would be good to have this somehow reported.

E.g. I just tested a FAST5 run (V3 primers) that resulted in 24 barcoded FASTQ and it seems 9 of them were sorted out and not processed any further.

It would be good to have a TSV with e.g. all IDs and a column that states which were sorted out due to low number of reads after filtering.

@hoelzer hoelzer added the enhancement New feature or request label Jan 26, 2021
@replikation
Copy link
Owner

the size is so small for "removal" it should usually only remove barcodes that were falsely assigned via "guppy demultiplex" with the "one barcode only" option. so not sure if this would confuse more?

@hoelzer
Copy link
Collaborator Author

hoelzer commented Jan 26, 2021

okay I see, let me do some checks and maybe you are right and we don't need this. It might be just confuse as well when people used 10 barcodes but only get 9 consensuses out (e.g. bc/ one barcode did not work well and produced only a handful reads).

But in such a case, one can also go back to the pycoQC and check the assigned barcode distribution, ...

@hoelzer hoelzer added the wontfix This will not be worked on label Jan 26, 2021
@hoelzer
Copy link
Collaborator Author

hoelzer commented Jan 26, 2021

I set this to wontfix for now and will check some more data to decide if we necessarily need such an output table

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request wontfix This will not be worked on
Projects
None yet
Development

No branches or pull requests

2 participants