Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Meryl: wrong outputs with compressed FASTQ files #3801

Closed
gallardoalba opened this issue Jul 12, 2021 · 11 comments
Closed

Meryl: wrong outputs with compressed FASTQ files #3801

gallardoalba opened this issue Jul 12, 2021 · 11 comments

Comments

@gallardoalba
Copy link
Contributor

@gallardoalba gallardoalba commented Jul 12, 2021

A user reported an error while running Meryl (original post). The error seems to be related to compressed FASTQ files, however, according to the developer, Meryl should be able to process such files. When using compressed files, Meryl generates count files, but those seem to be corrupted.

The decompressed test worked fine.

@bernt-matthias
Copy link
Contributor

@bernt-matthias bernt-matthias commented Jul 13, 2021

Hi @gallardoalba the datasets in the first two histories are not accessible.

Loading

@gallardoalba
Copy link
Contributor Author

@gallardoalba gallardoalba commented Jul 13, 2021

Hi @gallardoalba the datasets in the first two histories are not accessible.

Hi, thanks. When I try to share the history, it triggers this error, but not sure what is the reason.

Screenshot from 2021-07-13 13-45-13

Loading

@bernt-matthias
Copy link
Contributor

@bernt-matthias bernt-matthias commented Jul 13, 2021

@bgruening can you help here?

Loading

@bgruening
Copy link
Member

@bgruening bgruening commented Jul 13, 2021

The fastq dataset is only shared with @gallardoalba. So he has not permissions to share it further.

Loading

@gallardoalba
Copy link
Contributor Author

@gallardoalba gallardoalba commented Jul 13, 2021

Thank!

Loading

@gallardoalba
Copy link
Contributor Author

@gallardoalba gallardoalba commented Jul 13, 2021

I fixed the links @bernt-matthias; I tested the problem by running Meryl in the command line, and both generated correct outputs.

Loading

@bernt-matthias
Copy link
Contributor

@bernt-matthias bernt-matthias commented Jul 13, 2021

So I can close the issue?

Loading

@gallardoalba
Copy link
Contributor Author

@gallardoalba gallardoalba commented Jul 13, 2021

So I can close the issue?

Nop, it still doesn't work in Galaxy.

Loading

@bernt-matthias
Copy link
Contributor

@bernt-matthias bernt-matthias commented Aug 2, 2021

Can you post the command line that you used? In my experiments the command line for count does not work:

meryl count k=19  /tmp/tmp5is6p6g8/files/000/dataset_1.dat  output read-db.meryl

Problem might be the .dat extension. But this should be easy to fix.

Loading

@gallardoalba
Copy link
Contributor Author

@gallardoalba gallardoalba commented Aug 26, 2021

This is the command I used:

meryl count Galaxy4-\[Illumina.fastq.gz\].fastqsanger.gz k=19 output tmp

Loading

@bernt-matthias
Copy link
Contributor

@bernt-matthias bernt-matthias commented Aug 26, 2021

@gallardoalba this has been fixed in #3876. Latest version (1.3+galaxy4) is already on eu.

Loading

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Linked pull requests

Successfully merging a pull request may close this issue.

None yet
3 participants