Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Only parse sample names from files if given congif #1390

Closed
BJWiley233 opened this issue Mar 8, 2021 · 3 comments
Closed

Only parse sample names from files if given congif #1390

BJWiley233 opened this issue Mar 8, 2021 · 3 comments

Comments

@BJWiley233
Copy link

BJWiley233 commented Mar 8, 2021

I hope this is already possible.

I have pipeline that gives same sample name "final" to most of the files I want to use for multiqc. Instead I copy the files out and prefix them with the sample name. How can I tell multiqc to ONLY look at the file names and disregard parsing files for sample name. I created config like this

sp:
  picard/markdups:
    fn: "*.merged.NameSorted.mark_dups_metrics.txt"
  picard/alignment_metrics:
    fn: "*.AlignmentSummaryMetrics.txt"
  picard/hsmetrics:
    fn: "*.roi-HsMetrics.txt"
  picard/insertsize:
    fn: "*.InsertSizeMetrics.txt"
  fastqc:
    fn: "*_fastqc.zip"

and file names:

CHAJ-470026-281.AlignmentSummaryMetrics.txt
CHAJ-470026-281.InsertSizeMetrics.txt
CHAJ-470026-281.VerifyBamId.depthSM
CHAJ-470026-281.VerifyBamId.selfSM
CHAJ-470026-281.bam.flagstat
CHAJ-470026-281.base-clinvar-HsMetrics.txt
CHAJ-470026-281.merged.NameSorted.mark_dups_metrics.txt
CHAJ-470026-281.roi-HsMetrics.txt
CHAJ-470026-281.target-acmg_genes-HsMetrics.txt
CHAJ-470026-281_fastqc.html
CHAJ-470026-281_fastqc.zip
@ewels
Copy link
Member

ewels commented Mar 8, 2021

Sorry, currently you can't, but it would be nice to have such an option.

Duplicate of #890 really (also #949 is similar), but I will keep this open as it more obviously general and not module-specific.

@BJWiley233
Copy link
Author

Cool thanks. Yes def not module specific but config.yaml specific.

@ewels
Copy link
Member

ewels commented Jul 4, 2021

Hi @BJWiley233,

I have added new functionality to take the log filename as the sample name as a general feature for any module / file search pattern in fa84c47 and will be included in the MultiQC v1.11 release.

The documentation for this new feature is here: https://multiqc.info/docs/#using-log-filenames-as-sample-names

I think this should solve the issue you were getting above. If you find any problems, let me know.

Cheers,

Phil

@ewels ewels closed this as completed Jul 4, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants