Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Changing Fastq header format #42

Open
vals opened this issue Jul 27, 2017 · 0 comments
Open

Changing Fastq header format #42

vals opened this issue Jul 27, 2017 · 0 comments

Comments

@vals
Copy link
Owner

vals commented Jul 27, 2017

I was speaking to Nuno at the Expression Atlas who said our format for Fastq headers is not compatible with CASAVA standard.

I think it would make sense to change the header format to one similar to described here: https://github.com/nunofonseca/fastq_utils

The biggest difference is that keeping the original read name at the end rather than beginning will make the read follow the CASAVA standard, but also some optimisations using the htslib API would be possible when parsing the header.

This would also allow us to use the fastq_utils script as an optional faster way to transform fastq files in protocols with simpler read topologies not requiring regular expressions (which are the majority of them).

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant