New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Channels not found with grep to $all_reads #1
Comments
and see here --> https://github.com/vsoch/carrierseq/blob/singularity/docs/singularity.md for the overall idea. |
Hi @vsoch, It looks like the Sequence Read Archive (SRA) has replaced the original read headers. Normally, the sequence data would contain either the output information from the Albacore basecaller or from a Poretools fastq conversion command (fast5 > fastq). For example, Albacore would look like [read ID run ID read channel start_time]:
And Poretools [read ID path/to/fast5]:
However, the header information has now been replaced with an SRA ID and only the read ID:
Thank you for the comment, I will investigate how to preserve the original metadata on NCBI. In the meantime I have uploaded the original fastq file to dropbox. https://www.dropbox.com/sh/vyor82ulzh7n9ke/AAC4W8rMe4z5hdb7j4QhF_IYa?dl=0 |
Fantastic! Thanks for your quick response and looking into this - I'll give it another try with the updated file, and will keep a lookout from updates from you here. A similar thing happened to me and a colleague with data URLs, and we ultimately opted to serve the data ourselves. |
hey @amojarro ! I'm working on some singularity images (like Docker but safe for HPC) to go along with a publication for an internal container organization format, and was recommended to use your pipeline by one of the community (do you know Pim?) I'm doing well - I have two versions of the container:
https://github.com/vsoch/carrierseq/tree/singularity
but I am hitting a snag. This call:
returns nothing. I am using the data that you linked, and thinking either it changed or the call with grep should be adjusted. What happens after nothing is found is the python script obviously gets angry when 0 is given for the denominator.
Thanks for your help with this!
The text was updated successfully, but these errors were encountered: