-
Notifications
You must be signed in to change notification settings - Fork 81
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Adjustments for #76 , rename certain options to be more explicit #81
Conversation
docs/usage.md
Outdated
|
||
This can be used to only keep mapped reads for downstream analysis. By default turned off, all reads are kept in the BAM file. Unmapped reads are stored both in BAM and FastQ format e.g. for different downstream processing. | ||
This can be used to only keep mapped reads in the BAM file for downstream analysis. By default turned off, all reads are kept in the BAM file. Unmapped reads are stored both in BAM and FastQ format e.g. for different downstream processing. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Maybe change following line to: 'By default turned off, where all reads are kept in the bam file.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Done!
Generally looks good, made a minor suggestion for description. only think that isn't clear to me now is what if you want to discard unmapped reads entirely? Unless I misunderstand - I currently see only two cases: 1) where mapped and unmapped reads are retained but in separate BAMs and 2) where mapped and unmapped reads are in the same BAM. I am thinking for cases where people have storage space issues and want to minimise their data to stuff of interest. (This would be another issue, unless I have misunderstood) |
I added an option that can remove the unrequired unmapped reads on user request. By default, its turned off, but of course can be turned on by users if they want to :-) Also added documentation on that @jfy133 let me know what you think .-) |
docs/usage.md
Outdated
|
||
This can be used to only keep mapped reads for downstream analysis. By default turned off, all reads are kept in the BAM file. Unmapped reads are stored both in BAM and FastQ format e.g. for different downstream processing. | ||
This can be used to only keep mapped reads in the BAM file for downstream analysis. By default turned off, where all reads are kept in the bam file. Unmapped reads are stored both in BAM and FastQ format e.g. for different downstream processing. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Possibly rephrase again for clarity (my bad):
Use only mapped reads in the BAM file for downstream analysis. Unmapped reads are stored in a separate BAM and FASTQ format e.g. for different downstream processing. By default turned off, where default behaviour is to keep both mapped and unmapped reads in the output BAM file.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I wonder if it makes sense to instead have a independent flag to save unmapped reads in FASTQ rather than BAM format. In this case we will be make redundancy.
docs/usage.md
Outdated
|
||
### `--bam_discard_unmapped_entirely` | ||
|
||
This discards all unmapped and extracted reads entirely. By default, this is turned off. | ||
|
||
### `--bam_keep_all` |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
What is the difference of this flag verses --bam_retain_all_reads
?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
--bam_discard_unmapped_entirely
Removes the unmapped reads file only, the BAM file contains only mapped reads and unmapped reads are entirely discarded (no fastq/bam at all present).
I guess I'll have to give this another proper thought. Right now bam_retain_all_reads just is the thing to turn on BAM filtering in general.
This renames a couple of things to different names, to make sure people understand what these options mean.
cf #76
Namely:
bam_keep_mapped_only ==> bam_analyse_mapped_only
bam_filter_reads ==> bam_retain_all_reads
nextflow run . -profile test,docker
).nf-core lint .
).docs
is updatedCHANGELOG.md
is updatedLearn more about contributing: https://github.com/nf-core/eager/tree/master/.github/CONTRIBUTING.md