We released three programs, InvBFM, fixRealData, and fixSimuData.
Both fixRealData and fixSimuData are written in Shell for data preprocessing.
InvBFM is written in Python which proposed in our paper.All three programs are packaged as executables,
downloading from their source code and changed their permissions to executable for using as a tool.
For example: chmod 777 InvBFM
The purpose of fixRealData is to is to detect inversions based on existing tools.
Usage:
- ./fixRealData < resultDirectory > < realBamDirectory > < realData.bam > < referenceDirectory > < referenceChrom.fa > < referenceWholeGene.fa >
For example:./fixRealData /home/realBamResult/ /home/bam/ NA19982.chrom11.bam /home/gao/ref/hs37d5/ 11.fa hs37d5.fa
The purpose of fixRealData is to simulate inversion bam files.
Usage:
- ./fixSimuData < simulatedTopDirectory > < simulatedSecondaryDirectory > < simulatedReference.fa > < deviation > < resultReference.fa > < output > "--Inv < inversionNumber >--start: < firstBreakpointLeftPosition >-end:< firstBreakpointRightPosition > --size < inversionLength >" < ISPE > < ISSD > < coverage > < readLength >
For example:./fixSimuData /home/simulateData3k/ INV3011_0.005/ hs37d5.fa 0.005 Seq.test.fa INV3011_0.005 "--Inv 61-start:45000-end:48010 --size 3011" 360 30 25X 128
./InvBFM
Getting usage of InvBFM.
options:
./InvBFM -rt
Usage for fixing Pindel/Delly/Lumpy/Lumpyexpress results for real bam.
./InvBFM -rp
Usage for getting candidate inversions for a real bam.
./InvBFM -rm
Usage for merging candidate inversions and its features from numeral real files.
./InvBFM -fm
Usage for getting features selected by InvBFM.
./InvBFM -sn
Usage for getting simulated no inversion points.
./InvBFM -sm
Usage for merging inversion/NoInversion features for simulated data.
./InvBFM -ci
Usage for calling true inversions by SVM.
- ./InvBFM -rt < resultDirectory > < pindelResultFile > < pindelPointsFile > < dellyResultFile > < dellyPointsFile > < lumpyexpressResultFile > < lumpyexpressPointsFile > < lumpyResultFile > < lumpyPointsFile >
For example:./InvBFM -rt /home/realBamResult/ pindelResult/NA19982_pindelResult_ pindelResult/NA19982_pindelPoints_ dellyResult/NA19982_dellyResult_ dellyResult/NA19982_dellyPoints_ lumpyexpressResult/NA19982_lumpyexpressResult_ lumpyexpressResult/NA19982_lumpyexpressPoints_ lumpyResult/NA19982_lumpyResult_ lumpyResult/NA19982_lumpyPoints_
- ./InvBFM -rp < pindelPointsFile > < dellyPointsFile > < lumpyPointsFile > < lumpyexpressPointsFile > < outputFile > < mean_ISPE > < ISSD >
For example:./InvBFM -rp NA19982_pindelPoints_INV NA19982_dellyPoints_INV NA19982_lumpyPoints_INV NA19982_lumpyexpressPoints_INV NA19982_cadidatePoints_INV 320 37.5
- ./InvBFM -rm < resultDirectory > < realFeatureFiles > < outputPointsFile > < outputFeaturesFile >
For example:./InvBFM -rm /home/realBamResult/ realFeatureFilesList candidatePoints candidateFeatures
- ./InvBFM -fm < minedPointsFile > < outputFile > < bamFile > < "chromNumber" > < mean_ISPE > < ISSD >
For example:./InvBFM -fm points_INV features_INV sample.bam "11" 320 37.5
- ./InvBFM -sn < simulatedInversionPointsFile > < simulatedNoInversionPointsFile >
For example:./InvBFM -sn simulatedPoints_INV simulatedPoints_NoINV
- ./InvBFM -sm < simulatedDirectory > < simulatedFeatureFiles > < outputFile >
For example:./InvBFM -sm /home/simulatedResult/ simulateFeatureFilesList simulatedFeatures
- ./InvBFM -ci < dataDirectory > < trainDataFile > < testDataFile > < testPointsFile > < outputFile >
For example:./InvBFM -ci /home/result/ realFeatures simulatedFeatures candidatePoints finalInversions