Skip to content

A repository to set up the human SNP calling pipeline

License

Notifications You must be signed in to change notification settings

Lucas04-nhr/Human-SNP

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Human SNP

This is a repository for the Human SNP project.

See notes at WPS cloud document: Human SNP.

Mirrored Repositories


Dataset Location

Raw Data

data
├── 00_bwa_index                  # Index files and reference genome, known sites for the pipeline
│   ├── chm13v2                   # Reference genome and known sites for CHM13v2, deprecated
│   │   └── known_sites
│   └── GRCh38                    # Reference genome and known sites for GRCh38
│       ├── known-sites           # Known sites for GRCh38, using when performing SNP calling
│       │   ├── 1000g             # hg38_v0_1000G_phase1.snps.high_confidence.modified.hg38.vcf(.idx)
│       │   ├── dbsnp138          # hg38_v0_Homo_sapiens_assembly38.dbsnp138.modified.vcf(.idx)
│       │   ├── hapmap            # hg38_v0_hapmap_3.3.hg38.modified.vcf(.idx)
│       │   └── omni              # hg38_v0_1000G_omni2.5.hg38.modified.vcf(.idx)
│       └── ref                   # 1000G reference genome for GRCh38, using when performing genome phasing
│           └── 1000G
├── 01_align                      # Alignment results, end with *.sam
│   ├── Beijing
│   └── Guangzhou
├── 02_filter                     # Alignment results, filtered out unmapped reads, end with *.bam
│   ├── Beijing
│   └── Guangzhou
├── 03_sort                       # Alignment results, sorted by read name, end with *.bam, index files end with *.bam.bai
│   ├── Beijing
│   └── Guangzhou
├── 05_BaseRecalibrator           # Base recalibration results, end with *.recal_data.table
│   ├── Beijing
│   └── Guangzhou
├── 06_ApplyBQSR                  # BQSR results, end with *.recalibrated.bam
│   ├── Beijing
│   └── Guangzhou
├── 07_HaplotypeCaller            # HaplotypeCaller results, end with *.g.vcf.gz
│   ├── Beijing
│   └── Guangzhou
├── 08_GenotypeGVCF               # Combined gVCF files, joint_genotyped.vcf.gz
│   ├── Beijing
│   └── Guangzhou
├── 09_09_VariantRecalibrator     # VariantRecalibrator results, output{.tranches, .rscript, .model}
│   ├── Beijing
│   └── Guangzhou
├── 10_ApplyVQSR                  # VQSR results, joint_genotyped.filtered.vcf.gz
│   ├── Beijing
│   └── Guangzhou
├── skin_metagenome               # Skin metagenome original data, end with *.fastq.gz
├── skin_microbiome_2022          # Compressed skin metagenome original data, end with *.fastq.gz
└── test                          # Simlink files to sample BJ001, used for testing the pipeline

Pipeline Scripts

Human-SNP
├── filter                        # Filter out unmapped reads
│   ├── filter
│   └── mapq_analysis
├── gatk                          # GATK pipeline, used for SNP calling
│   ├── single                    # Single calling
│   ├── joint                     # Joint calling
│   └── prepare                   # Prepare scripts for GATK pipeline
├── manta                         # Manta pipeline, used for SV calling, deprecated
│   ├── 00_convert_format
│   ├── 01_configurate
│   └── 01_execute
├── old                           # Old scripts, deprecated
│   ├── BQSR
│   ├── bwa
│   ├── env
│   ├── files
│   ├── other
│   └── samtools
└── prepare                       # Preparation of SNP calling pipeline

About

A repository to set up the human SNP calling pipeline

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published