Skip to content

Useful NGS utilities for everyday use in the form of stand-alone tool.

License

Notifications You must be signed in to change notification settings

manojbhosale/ngsUtils

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

41 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

MAutomaton

This is an attempt to tackle the daily tasks in NGS data analysis. This tool is written in Java Swing and has below features.

  1. VCF comparison utility: To compare SNP, Indels, CNVs in VCF_4.2 format.
  2. Duplicate metrics utility: To calculate %duplicates from a BAM file.
  3. VCF Ts/Tv util: To calculate transition and transversion ratio from given VCF file
  4. Comparison of QC metrics in the form of .properties file.
  5. Intersect a VCF file(Indexed) with a BED file.
  6. Interset 2 BED files to get intersecting intervals.

How to use

1. VCF comparator

Prerequisite: Java 1.8 or higher

Steps

  1. Right click on the jar file and open with java
  2. In "VCF comparator" tab select the VCF 1, VCF 2 files to compare.
  3. Click "Compare"
  4. Based on size of the VCF files it will take time to compare the VCF. in the mean time the UI will freeze as its a single threaded application.
  5. After comparison is done the UI will update with the comparison results as displayed in below screenshot. The red colored cells indicate non-zero mismatches

alt text

  1. If you are interested to find out the actual mismatching and common variants in VCF format visit "C:\gatKAutomator\comparisonResults" directory of your computer, not to mentione you need to have read/write access to this location.The result VCF are named in following way,
  • Common variants in both vcf: <VCF 1>_common.vcf
  • Insertion variants only in VCF 1: <VCF 1>_insOld.vcf
  • Insertion variants only in VCF 2: <VCF 2>_insNew.vcf
  • Deletion variants only in VCF 1: <VCF 1>_delOld.vcf
  • Deletion variants only in VCF 2: <VCF 2>_delNew.vcf
  • SNP variants only in VCF 1: <VCF 1>_snpOld.vcf
  • SNP variants only in VCF 2: <VCF 2>_snpNew.vcf
  1. If you want ot compare another pair then just select those files and hit "Compare". The results will get appended in the table and the VCFs will get generated at the designated location.

  2. If you have miltiple files to compare then use "Multiple files" option. the input is the tab separated mapping file of VCFs to caompare like below. create comparison.txt with below content. File paths need to be absolute paths of the VCFs.

c:\one.vcf c:\two.vcf

c:\a.vcf c:\b.vcf

c:\x.vcf c:\y.vcf