Ambiguous reads should be filtered by default #4

ressy · 2018-04-23T20:44:28Z

Currently sequences including N for base calls (or anything outside of [ACTG]) are not handled specially, so they may appear in final output including in purported alleles. By default these sequences should marked in the output of analyze_sample and filtered in the output of summarize_sample to avoid this.

The text was updated successfully, but these errors were encountered:

Add columns named "Ambiguous" to sample and sample summary data frames to track sequences with non-ACTG characters and track filtering of these sequences, respectively.

ressy added this to the Version 0.2.0 milestone Apr 23, 2018

ressy added a commit that referenced this issue Apr 24, 2018

Fix #4: track and filter ambiguous sequences

936bbef

Add columns named "Ambiguous" to sample and sample summary data frames to track sequences with non-ACTG characters and track filtering of these sequences, respectively.

ressy closed this as completed Apr 24, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Ambiguous reads should be filtered by default #4

Ambiguous reads should be filtered by default #4

ressy commented Apr 23, 2018

Ambiguous reads should be filtered by default #4

Ambiguous reads should be filtered by default #4

Comments

ressy commented Apr 23, 2018