-
Notifications
You must be signed in to change notification settings - Fork 582
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
FastQC: add top overrepresented sequences table #2075
Conversation
🚀 Deployed on https://mqc-pr-2075--multiqc.netlify.app |
How about having a third column that shows the read count as a percentage of the total read count across all samples? |
@multiqc-bot changelog |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Great!
@multiqc-bot fix linting |
Could be nice, but might be approaching overkill a little. I think I'm happy to merge as-is for now, can wait to see if we get any feedback about this and always add it at a later date. |
@multiqc-bot fix linting |
Added a commit to fix https://github.com/ewels/MultiQC/pull/2082/files#r1340743794 |
* master: Just run CI on the oldest + newest supported Python versions (MultiQC#2074) Picard: fix parsing mixed strings/numbers, account for trailing tab (MultiQC#2083) FastQC: add top overrepresented sequences table (MultiQC#2075) Add GitHub Actions bot workflow to fix code linting from a PR comment (MultiQC#2082) Use custom exception type instead of `UserWarning` when no samples are found. (MultiQC#2049) Lint modules for missing `self.add_software_version` (MultiQC#2081) Changelog bot: Update docs (MultiQC#2077) Changelog action: remove `.capitalize()`, add changelog entry (MultiQC#2080) Add action to populate the change log from PR titles triggered by `@multiqc-bot changelog` (MultiQC#2025) # Conflicts: # CHANGELOG.md # multiqc/modules/ngsderive/ngsderive.py
Fix #926
Add a table into the FastQC module showing the most common overrepresented sequences across all samples:
By default, it shows top 20 sequences occurring in the most number of samples. The number can be customised:
It can also be customised to rank sequences by the total number of occurrences instead of the number of samples: