The command corrects a DNA sequencing error of cell barcoding containing RT primer sets for Quartz-Seq2 which one of a high throughput single-cell RNA-sequencing . The cell barcodes of Quartz-Seq2 are designed such that the minimum Sequence–Levenshtein distance between two sequences should be greater than 5, which leads to the correction of two nucleotides of mismatch, insertion, and deletion in sequence reads . The program can correct cell barcode if sequence error occurs.
- Sasagawa Y et al. Quartz-Seq2: a high-throughput single-cell RNA-sequencing method that effectively uses limited sequence reads
- Buschmann T. DNABarcodes: an R package for the systematic construction of DNA sample tags. Bioinformatics. 2017;:btw759.
- Python 2.7
correct_barcode.py Usage: correct_barcode.py (-i input_file) (-b barcode_file) (-o output_file) (-m metric) (-d distance) correct_barcode.py -h | --help correct_barcode.py -v | --version Options: -i input_file Bam file resulted of TagReadWithGeneExon -b barcode_file Single-column file for designed barcode (No header) -o output_file Bam file name for output of this program -m metric The distance metric to be used. (seqlev or hamming) -d distance Distance for error correction -h --help Show this screen -v --version Show version