Skip to content

mMHtyper is an end-to-end pipeline for recognizing the multi-SNP microhaplotypes in Nanopore sequencing data.

Notifications You must be signed in to change notification settings

willow2333/mMHtyper

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

16 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Overview

    mMHtyper is an end-to-end pipeline for recognizing the multi-SNP microhaplotypes in Nanopore sequencing data. If you have more than two SNPs to recognize microhaplotypes, you can choose mMHtyper not MHtyper.

mMHtyper workflow

1

Step1:

 The raw reads were filtered by NanoFilt, you can set the Q-value and the length according to your work.

Step2:

 To get the high-quality alignments, we filtered the ".bam" files by setting the MAPQ>50 and Flag =16 & 0.

Step3:

1. The filtered ".bam" files and a bed file were as the input of mMHtyper, the header of bed file as below table.

MiniHap name chromosome db SNP Position (GRCh38) REF ALT

 MiniHap name: The name of your multi-SNP microhaplotypes.

 Chromosome: The name of Chromosome.

 db SNP: The ID of SNP.

 Position (GRCh38):The positions of SNP in GRCh38.

 REF: The base of SNP in GRCh38.

 ALT: The alleles of SNP.

2. The mMHtyper can create the "Phasing.txt", the header of it as below table.

chr pos REF sample db SNP genotypes depth phase1 phase2

 chr: The name of Chromosome.

 pos: The positions of SNP in GRCh38.

 REF: The base of SNP in GRCh38.

 sample: The name of your multi-SNP microhaplotypes.

 db SNP: The ID of SNP.

  genotypes: The genotypes of SNP.

 depth: The depth of SNP, the max depth is 10000.

 phase1: The haplotype 1 of SNP.

 phase2: The haplotype 2 of SNP.

Python environment construction and required software installation

   conda create -n mMHtyper  
   conda activate mMHtyper 
   conda config --add channels bioconda 
   conda config --add channels
   conda install --yes --file requirements.txt

mMHtyper installation

   git clone https://github.com/willow2333/mMHtyper.git
   cd mMHtyper  
   python mMHtyper.py --h
   
   usage: Phasing.py [-h] [--sampledir SAMPLEDIR] [--ref REF] [--bed BED]

    optional arguments:
      -h, --help            show this help message and exit
      --sampledir SAMPLEDIR
                            The absolute path of fastq files, the fastq files need
                            be gzip and named *.fq.gz
      --ref REF             The absolute path of reference file
      --bed BED             The absolute path of bed file

Illustration

1.Test

   python mMHtyper.py  --sampledir {path} --ref hg38.fa  --bed XX.bed 

© 2023 by Yiping Hou (forensic@scu.edu.cn), Zheng Wang (wangzhengtim@scu.edu.cn), Liu Qin (ql_willow@163.com)

About

mMHtyper is an end-to-end pipeline for recognizing the multi-SNP microhaplotypes in Nanopore sequencing data.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages