Citation

If you use this software, please cite these papers:

Yuan S, Johnston HR, Zhang G, Li Y, Hu Y-J, Qin ZS (2015) One Size Doesn't Fit All - RefEditor: Building Personalized Diploid Reference Genome to Improve Read Mapping and Genotype Calling in Next Generation Sequencing Studies. PLoS Comput Biol 11(8): e1004448. doi:10.1371/journal.pcbi.1004448

Download citation (Right click -> Save as...):

refEditor.ris (compatible with EndNote, Reference Manager, ProCite, RefWorks)
refEditor.bib (compatible with BibDesk, LaTeX)

Shuai Y, Zhaohui Q: Read-mapping using personalized diploid reference genome for RNA sequencing data reduced bias for detecting allele-specific expression. In Bioinformatics and Biomedicine Workshops (BIBMW), 2012 IEEE International Conference on; 4-7 Oct. 2012. 2012: 718-724.

Download citation (Right click -> Save as...):

diploid_mapping.ris (compatible with EndNote, Reference Manager, ProCite, RefWorks)
diploid_mapping.bibtex (compatible with BibDesk, LaTeX)

Contacts

Shuai Yuan, Email: [shuaiyuan.emory@gmail.com]

Downloads

Latest Version

Please use the panel on the right to download source code.

Old Versions

Not provided.

Minimum System Requirement:

CPU 2.0G Hz

Memory 8 GB

Hard disk 50 GB free space

Linux 64-bit or MacOS 64-bit, gcc, Read Mapping Tools (e.g. BWA or Bowtie).

Introduction

This software package is to improve read mapping accuracy for diploid individuals.

It has two major programs: DiploidConstructor and MappingConvertor.

DiploidConstructor converts a haploid reference genome to a diploid reference genome according to the genotypes and sequencing read length.

MappingConvertor converts the SAM file created from being mapped to a diploid reference genome to being mapped to a haploid reference genome according to a dipmap file.

DiploidConstructor should be executed before calling read mapping tools (BWA, Bowtie etc.).

MappingConvertor should be executed after calling read mapping tools.

Build & Install

Download and uncompress this software, and then:

cd refeditor-master
make

then you can copy compiled executables to anywhere in your PATH variable ( e.g ~/bin/ ).

Run

DiploidConstructor

Usage:

DiploidConstructor -r haploid.fa -g genotypes -l length -o diploid.fa

Convert haploid reference genome to diploid reference genome according to the genotypes and sequencing read length

Example:

./DiploidConstructor -r hg19.fa -g known.genotypes -l 36 -o diploid.fa

Parameters (mandatory):

-r    haploid reference genome file 

-g    genotypes file

-l    read length

-o    output diploid reference genome file

Parameters (optional):

-d    maximal deletion in a read and mapped to alternative alleles [default=0]

-s    sex of the individual [default="m"]

MappingConvertor

Usage:

MappingConvertor -i input.sam -m dipmap -o output.sam

Convert the sam file created from being mapped to a diploid reference genome to being mapped to a haploid reference genome according to a dipmap file.

Example:

./MappingConvertor -i input.sam -m hg19.di.fa.dipmap -o output.sam

Parameters (mandatory):

-i    input SAM file 

-m    dipmap file created by DiploidConstructor

-o    output SAM file

An example

Read mapping using RefEditor

Compile
```
 cd refeditor-master
 make
```

Create genotypes file for target individual

 ./vcf2genotypes test.vcf NA19238 > test.genotypes

Create customized reference genome
```
 ./DiploidConstructor -r hg19test.fa -g test.genotypes -l 36 -o hg19test.di.fa
```
That will create two files: hg19test.di.fa and hg19test.di.fa.dipmap

Map the reads against the diploid reference genome using BWA (Version: 0.7.5a-r405)

Create index for reference genome

 bwa index -a bwtsw hg19test.di.fa

Perform alignment

 bwa aln hg19test.di.fa test.fastq > test.sai
 bwa samse -n 16 hg19test.di.fa test.sai test.fastq > test.sam

Convert intermediate SAM file to final result SAM file

 ./MappingConvertor -i test.sam -m hg19test.di.fa.dipmap -o test.final.sam

Done! Now the reads in the test.fastq file are mapped successfully and stored in test.final.sam file.

Read mapping using traditional method without RefEditor

Map the reads against the haploid reference genome using BWA (Version: 0.7.5a-r405)

1.1 Create index for reference genome

bwa index -a bwtsw hg19test.fa

1.2 Make alignment

bwa aln hg19test.fa test.fastq > test.ha.sai
bwa samse -n 16 hg19test.fa test.ha.sai test.fastq > test.ha.sam

Done! Now the mapping results are stored in test.ha.sam file. By viewing this file, you will find the mapping quality is 0 for all reads. That's because traditional method does not take into account the genotypes information

Conclusion

Using RefEditor for read mapping can improve mapping accuracy and downstream analysis.

Revisions

v1.4_r24

Minor bugs fixing

v1.3_r17

Add test.vcf and vcf2genotypes

v1.2_r16

Update the example

v1.1_r13

Add files that are needed in the example Create release tags automatically

v1.0_r7

First working release

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
CLineFields.cpp		CLineFields.cpp
CLineFields.h		CLineFields.h
CLineFields1.cpp		CLineFields1.cpp
CLineFields1.h		CLineFields1.h
DiploidConstructor.cpp		DiploidConstructor.cpp
GenomeMapperToSam.cpp		GenomeMapperToSam.cpp
Makefile		Makefile
MappingConvertor.cpp		MappingConvertor.cpp
README.md		README.md
addGeneName.py		addGeneName.py
addPValue.py		addPValue.py
addRSNo.py		addRSNo.py
alnSNPsInfo.cpp		alnSNPsInfo.cpp
alterref.cpp		alterref.cpp
appendPValue.py		appendPValue.py
bioshared.py		bioshared.py
commonMutation.py		commonMutation.py
compareSamTrue.cpp		compareSamTrue.cpp
countKeyLines.py		countKeyLines.py
cut.fastq.cpp		cut.fastq.cpp
diploid_mapping.bibtex		diploid_mapping.bibtex
diploid_mapping.ris		diploid_mapping.ris
expand.genotypext.cpp		expand.genotypext.cpp
fastakit.cpp		fastakit.cpp
fastq.list		fastq.list
findColumn		findColumn
findColumnsContain		findColumnsContain
genePercent.cpp		genePercent.cpp
generateReads.cpp		generateReads.cpp
genotypesCoverage.cpp		genotypesCoverage.cpp
genotypext.cpp		genotypext.cpp
get.header		get.header
getUCSC.py		getUCSC.py
haploid2diploid.cpp		haploid2diploid.cpp
hg19test.fa		hg19test.fa
importFa2Mysql.cpp		importFa2Mysql.cpp
importGeneRef2Mysql.cpp		importGeneRef2Mysql.cpp
importPileup2Mysql.cpp		importPileup2Mysql.cpp
join.fasta.hapmap.cpp		join.fasta.hapmap.cpp
join1file.cpp		join1file.cpp
join2files.cpp		join2files.cpp
join2sortedfiles.cpp		join2sortedfiles.cpp
join3files.cpp		join3files.cpp
join5files.cpp		join5files.cpp
lookupAround.py		lookupAround.py
lookupBack.py		lookupBack.py
maskSNP.cpp		maskSNP.cpp
matchCount.cpp		matchCount.cpp
merge2Files.cpp		merge2Files.cpp
merge2Files.py		merge2Files.py
merge2PValues.py		merge2PValues.py
mergeGenotypesFrom2Alleles.cpp		mergeGenotypesFrom2Alleles.cpp
mergeSNP.cpp		mergeSNP.cpp
mergeSNP.py		mergeSNP.py
mergeSNPOL.py		mergeSNPOL.py
only.file1.cpp		only.file1.cpp
overlap.files.cpp		overlap.files.cpp
pileup.cpp		pileup.cpp
readsAllele.cpp		readsAllele.cpp
refEditor.bib		refEditor.bib
refEditor.ris		refEditor.ris
remove.header		remove.header
rotate.cpp		rotate.cpp
rpkm.cpp		rpkm.cpp
snp_prob.cpp		snp_prob.cpp
sort.chroms.cpp		sort.chroms.cpp
test.fastq		test.fastq
test.genotypes		test.genotypes
test.vcf		test.vcf
ucsc2chr.cpp		ucsc2chr.cpp
union.files.cpp		union.files.cpp
vcf2genotypes		vcf2genotypes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Citation

Contacts

Downloads

Latest Version

Old Versions

Minimum System Requirement:

Introduction

Build & Install

Run

DiploidConstructor

MappingConvertor

An example

Read mapping using RefEditor

Read mapping using traditional method without RefEditor

Conclusion

Using RefEditor for read mapping can improve mapping accuracy and downstream analysis.

Revisions

v1.4_r24

v1.3_r17

v1.2_r16

v1.1_r13

v1.0_r7

About

Releases

Packages

Languages

superyuan/refeditor

Folders and files

Latest commit

History

Repository files navigation

Citation

Contacts

Downloads

Latest Version

Old Versions

Minimum System Requirement:

Introduction

Build & Install

Run

DiploidConstructor

MappingConvertor

An example

Read mapping using RefEditor

Read mapping using traditional method without RefEditor

Conclusion

Using RefEditor for read mapping can improve mapping accuracy and downstream analysis.

Revisions

v1.4_r24

v1.3_r17

v1.2_r16

v1.1_r13

v1.0_r7

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages