Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Use --origfmt with fastq-dump #37

Open
arteymix opened this issue Jan 24, 2020 · 0 comments
Open

Use --origfmt with fastq-dump #37

arteymix opened this issue Jan 24, 2020 · 0 comments

Comments

@arteymix
Copy link
Member

arteymix commented Jan 24, 2020

Currently, we use the SRA format for FASTQ headers which prefix the SRR run accession to the original string from the sequencer. This format is not compatible with ArrayExpress and local sources and will pose a problem if we try to generalize batch information extraction for arbitrary FASTQs and not just GEO series.

The solution is to add the --origfmt flag to fastq-dump so that the original header will be used instead.

This might require some adjustment in how Gemma parses the batch information.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant