You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The sequence in this case is 29 bases so I think the quality is being output incorrectly (it has the extra 2 on the end that isn't in the input). Here is the corresponding read fastq:
Mapping just this read works so there might be some buffer overflow or other issue on the full set. This read set is available from the NA12878 consortium: http://s3.amazonaws.com/nanopore-human-wgs/rel3-nanopore-wgs-152889212-FAB45271.fastq.gz mapped to the 1000 genome reference: ftp://ftp.ncbi.nih.gov/1000genomes/ftp/technical/reference/GRCh38_reference_genome/. So far I've only seen this on unmapped reads.
The text was updated successfully, but these errors were encountered:
I have a script to check inconsistent SAM (e.g. cigar length inconsistent with sequence length, etc). However, the first step of the script is to skip unmapped reads. It failed to catch this bug. Thanks again. It is really helpful to have someone to catch bugs before they get wide spread.
In some sam outputs, unaligned read sequence and quality doesn't match in the output sam file like this example:
The sequence in this case is 29 bases so I think the quality is being output incorrectly (it has the extra 2 on the end that isn't in the input). Here is the corresponding read fastq:
running minimap2 as:
Mapping just this read works so there might be some buffer overflow or other issue on the full set. This read set is available from the NA12878 consortium: http://s3.amazonaws.com/nanopore-human-wgs/rel3-nanopore-wgs-152889212-FAB45271.fastq.gz mapped to the 1000 genome reference: ftp://ftp.ncbi.nih.gov/1000genomes/ftp/technical/reference/GRCh38_reference_genome/. So far I've only seen this on unmapped reads.
The text was updated successfully, but these errors were encountered: