Skip to content

yjthu/obaDIA

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

38 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

obaDIA

obaDIA: one-step biological analysis pipeline for data-independent acquisition and other quantitative proteomics data

obaDIA takes a FASTA fromat protein sequence file and a fragment-level, peptide-level or protein-level abundance matrix file from data-independent acquisition (DIA) mass spectrometry experiment, and performs differential protein expression analysis, functional annotation and enrichment analysis in a completely automated way. obaDIA was designed for DIA data initially, it can also be applied to protein-level data produced by other quantitative proteomic techniques, such as DDA, TMT/iTRAQ, Label-free proteomics. obaDIA is easy to use and runs fast. All source codes and example data of obaDIA are distributed for academic use only. For any other use, including any commercial use, please contact us first (info@e-omics.com).

Contents

Installation

1. Download

git clone https://github.com/yjthu/obaDIA.git

2. Install

Installation of obaDIA is quite easy, just like this:

cd obaDIA
bash INSTALL.sh

Alternatively, if you can not get an academic license for signalP, install with -n parameter:

cd obaDIA
bash INSTALL.sh -n

3. Install database

Download and build database:

cd db
bash build_db.sh

This step may take several minutes to hours, depends on your Internet speed.

Then, download species data from Kobas 3.0 database (ftp://ftp.cbi.pku.edu.cn/pub/KOBAS_3.0_DOWNLOAD), both the files from seq_pep and sqlite3 directories are needed. Abbreviation of species refers to db/species_abbr.txt.

Finally, set the home directory of Kobas 3.0 database in env/.kobasrc, and copy this file to your home dirctory, like this:

cp env/.kobasrc ~

4. Install dependencies

We have built a docker image with all dependencies installed for obaDIA, if you are a docker user, just get the image using the following command:

docker pull yjcbscau/eomics-base:latest

Since an academic license is required for download and installation of signalP 4.1, you need to install it independently and set the enviroment variable correctly in env/.bashrc file. This step can be skipped when you build obaDIA with bash INSTALL.sh -n command.

Alternatively, if you want to install all dependencies from the beginning, please follow these steps:

4.1. Install conda and activate a new conda environment

To download and install a version of minicoda from https://docs.conda.io/en/latest/miniconda.html, then add bioconda channels, creat a new python2 evironment and activate it.

conda create -n obadia python=2
. activate obadia

4.2. Install R packages

conda install R==3.6 r-xml

Then, to enter R environment and install required packages

# Install R packages
# 1. from CRAN
req.pcg <- function(pcg){
    new <- pcg[!(pcg %in% installed.packages()[, "Package"])]
    if (length(new)) install.packages(new, dependencies = T)
    sapply(pcg, require, ch = T)
  }
all.pcg <- c("reshape2", "ggplot2", "corrplot", "pheatmap", "optparse", "PerformanceAnalytics")
req.pcg(all.pcg)

# 2. from bioconductor
if (!requireNamespace("BiocManager", quietly = TRUE))
    install.packages("BiocManager")

BiocManager::install("GO.db")
BiocManager::install("goseq")
BiocManager::install("goTools")
BiocManager::install("topGO")
BiocManager::install("Rgraphviz")

4.3. Install Kobas 3.0

conda config --add channels bioconda
conda install kobas

4.4. Install perl5 and python2 libraries

cpanm GO::Parser DBI DBD::SQLite
pip install eventlet bs4 django==1.8.3

4.5. Check the build-in dependencies

Binary executable program of hmmpress, hmmscan, diamond, mapDIA and a slightly modified version of Trinotate-v3.1.0 software (named Trinotate-v3.1.0-pro) have already been included into the src folder along with the obaDIA release, and to make sure that they can work correctly in your system. Otherwise, to reinstall them with conda or source.

Finally, copy the enviroment variable for obaDIA in env/.bashrc file to your home ~/.bashrc file

Quick start

1. From local

We provide four example scripts and a set of test data in example directory. You can use the example script example_fast.sh for a quick test of obaDIA, it shoud be finished within three minutes. The basic command line is like this:

cd example
nohup sh example_fast.sh &

2. From docker

If you use docker to run obaDIA, you should creat a docker container first. The command line is like this:

# 1. set directories
workdir=/storage/data/PROJECT/biouser1/TestDocker
obadir=/storage/data/PROJECT/biouser1/TestDocker/obaDIA
kobasdbdir=/storage/data/PUBLIC/databases/KOBAS_3.0_db
signalpdir=/storage/data/PUBLIC/softwares/SignalP/signalp-4.1

# 2. creat a docker container named obadia
docker run -v $workdir:$workdir -v $obadir:$obadir -v $obadir/env:/root -v $kobasdbdir:$kobasdbdir -v $signalpdir:$signalpdir --name obadia yjcbscau/eomics-base /bin/bash
docker run -it -w $workdir --privileged=true --volumes-from obadia yjcbscau/eomics-base /bin/bash

# 3. run script in the container
cd example
nohup sh example_fast.sh &

If you build obaDIA with bash INSTALL.sh -n command, signalpdir and -v $signalpdir:$signalpdir is no more needed. Then, the command line is like this:

# 1. set directories
workdir=/storage/data/PROJECT/biouser1/TestDocker
obadir=/storage/data/PROJECT/biouser1/TestDocker/obaDIA
kobasdbdir=/storage/data/PUBLIC/databases/KOBAS_3.0_db

# 2. creat a docker container named obadia
docker run -v $workdir:$workdir -v $obadir:$obadir -v $obadir/env:/root -v $kobasdbdir:$kobasdbdir --name obadia yjcbscau/eomics-base /bin/bash
docker run -it -w $workdir --privileged=true --volumes-from obadia yjcbscau/eomics-base /bin/bash

# 3. run script in the container
cd example
nohup sh example_fast.sh &

If you want to run script outside a docker container, the --env parameter needs to be set. Then, the command line is like this:

# 1. set directories
workdir=/storage/data/PROJECT/biouser1/TestDocker
obadir=/storage/data/PROJECT/biouser1/TestDocker/obaDIA
kobasdbdir=/storage/data/PUBLIC/databases/KOBAS_3.0_db
signalpdir=/storage/data/PUBLIC/softwares/SignalP/signalp-4.1

# 2. run script outside a temporary docker container
docker run --rm --env PATH=$obadir/src:$obadir/src/mapDIA:$obadir/src/Trinotate-v3.1.0-pro:$signalpdir:$PATH --env PERL5LIB=$signalpdir/lib/:$PERL5LIB -w $workdir -v $workdir:$workdir -v $obadir:$obadir -v $obadir/env:/root -v $kobasdbdir:$kobasdbdir -v $signalpdir:$signalpdir --privileged=true yjcbscau/eomics-base nohup sh example_fast.sh &

For your own data, please copy the example script to your work directory and edit it referring to the Usage.

Usage

1. Input files

A protein sequence file in FASTA fromat and an abundance matrix file in TSV format are needed. These files can be generated by upstream proteins discovery and abundance estimation softwares for DIA or other quantitative proteomics experiment data.

For DIA data, three types of abundance matrix file are accepted, including fragment-level, peptide-level and protein-level. For data generated by other techniques, such as DDA, only protein-level abundance matrix is accepted.

There are four example files in example directory, they are:

seq.fa			# protein sequence file in FASTA format. The protein ID must match the abundance file, uniprot ID is prefered.
ab.tsv			# protein-level abundance matrix file in TSV format. The first column is the protein id, the other columns are the abundance for each sample.
pep.ab.tsv		# peptide-level abundance matrix file in TSV format. The first column is the protein id, the second columns is the peptide sequence, the other columns are the abundance for each sample.
frag.ab.tsv		# fragment-level abundance matrix file in TSV format. The first column is the protein id, the second columns is the peptide sequence, the third columns is the fragment information, the other columns are the abundance for each sample.

2. Edit one of the example scripts

We provide four example script files in example directory, they are:

example_prot.sh		# A standard script to run obaDIA using a protein-level matrix as input.
example_pep.sh		# A standard script to run obaDIA using a peptide-level matrix as input.
example_frag.sh		# A standard script to run obaDIA using a fragment-level matrix as input.
example_fast.sh		# Another script to run obaDIA using a protein-level matrix as input, which is the fast mode skipping several time-consuming steps.

What you need to do is copy one of these files to your work dirctory and edit it according to your own needs. Now, to take the example_prot.sh file as an example, and explain how the script works:

# get the current work directory
dir=`pwd`

# set the absolute directory of input and output files
fa=$dir/seq.fa
ab=$dir/ab.tsv
out=$dir/outdir

# run the main program of obaDIA to generate a workflow
# the parameters will be explaind in detail in the next section 
perl ../bin/oba.pl \
-fas $fa \
-exp $ab \
-out $out \
-group A1/A2/A3,B1/B2/B3,C1/C2/C3,D1/D2/D3,E1/E2/E3,F1/F2/F3 \
-name A,B,C,D,E,F \
-comp 2:1,3:1,4:1,5:1,6:1 \
-fc 0.5 \
-fdr 0.2 \
-spe hsa \
-thread 40

# then, an all-in-one workflow script named 'OneStep.sh' will be generated in the output dirctory
# run this workflow
sh $out/OneStep.sh >$out/OneStep.sh.o 2>&1

3. Run the modified script

Once the modification of example_prot.sh is completed, you can run it locally or submit it to the cluster

nohup sh example_prot.sh &

Galaxy GUI

We developed a GUI for Galaxy users, which looks like this: image

All parameters in Galaxy GUI are exactly the same as the command line tool, except for output file directory, which requires a relative directory. To use this GUI, you just need to edit the path in oba.xml file and add a section to the tool_conf.xml file in Galaxy, which looks like this:

 <section name="myTools" id="mTools">
    <tool file="obaDIA/oba.xml" />
    <tool file="obaDIA/toolExample.xml" />
 </section>

If all the dependencies of obaDIA have been installed under the Galaxy enviroment, you can run workflow through Galaxy GUI directly. Otherwise, scripts for the whole workflow can still be generated, and then run it in any way you're used to.

Parameters

The parameters and description of obaDIA can be obtained through the command:

perl ../bin/oba.pl

All parameters and short descriptions will be shown like this:

Required:
		-fas            <s> : protein squence file, fasta format.
		-exp            <s> : protein abundance file, tsv format.
								protein-level, peptide-level or fragment-level file is acceptable
		-out            <s> : output directory. [An absolute path is required!]
		-group          <s> : group that the samples belong to. [eg: s1/s2/s3,s4/s5/s6]
								groups should be seperated by ',',
								samples within a group should be seperated by '/'.
		-name           <s> : the name of each group. [eg: group1,group2]
								should be seperated by ',' and ordered the same as '-group'.
		-spe            <s> : KOBAS3.0 KEGG enrichment species abbr. [refer to "db/species_abbr.txt" file]
								dowload KOBAS database from 'ftp://ftp.cbi.pku.edu.cn/pub/KOBAS_3.0_DOWNLOAD'
Optional:
		-fc             <f> : log2(foldChange) cutoff for DE proteins. [default: 1]
		-fdr            <f> : FDR cutoff for DE proteins by mapDIA. [default: 0.1]
		-comp           <s> : how to compare the groups, use indexs of the group, [default: '1:2']
								required for multiple comparisons
		-alt            <s> : KOBAS3.0 GO/Rectome enrichment species abbr. [default: the same as '-spe']
		-mod            <s> : background choose for enrichment, total(T) or expressed(E). [default: E]
		-thread         <s> : thread number for hmmscan to perform Pfam annotation. [default: 1]
		-level          <s> : level of abundance matrix. Choice are prot/pep/frag [default: prot]
								required for peptide-level or fragment-level abundance
		-fast               :  fast mode to run the pipeline. a part of time-consuming analysis will be skipped.
		-h|?|help           :  Show this help

Each parameter is explained in detail as follows:

-fas

This parameter is used to set the input protein sequence file. Standard FASTA format file is only accepted-the first line is the protein identifier starting with > and the second line is the protein sequence that can also be broken into multiple lines. The protein identifiers must consist of letters and numbers, and no blank or special characters are allowed, except for _. For the the protein identifier uniprot ID is prefered because we need to link it to the uniprotKB database, but this is not necessary.

-exp

This parameter is used to set the input abundance matrix file. Standard TSV format file is only accepted, which means each column is seperated by tab. For DIA data, protein-level, peptide-level or fragment-level matrix is acceptable, if using protein-level file as input, no need to set the -level parameter; if using peptide-level or fragment-level matrix as input, the -level parameter must be set to either pep or frag.

-level

This parameter is used to set the level or type of input abundance matrix file. The default value is prot; if using peptide-level or fragment-level matrix as input, it must be set to either pep or frag.

-out

This parameter is used to set the output directory, it must be an absolute path since a reletive path is not allowed.

-group

This parameter is used to assign the samples to the groups and set the order of groups. Using sample names rather than group names to set it. For samples in the same group, the sample names should be seperated by /. For different groups, the delimiter between samples should be ,. The sample names must be found in the first row of the abundance matrix and consist of letters and numbers, and no blank or special characters are allowed except for _.

-name

This parameter is used to set the group names, according to the order of -group parameter.

-comp

This parameter is used to set the comparison between groups for DE analysis. For only one comparison, the default value is set to 1:2; for multiple comparisons, it must be set according to the index of the order of -name parameter. For example, -comp 2:1,3:1,4:1,5:1,6:1, indicating every group shoud be compared to the first group that is designed as control.

-fc

This parameter is used to set the log2(foldChange) cutoff for DE proteins. The default value is 1.

-fdr

This parameter is used to set the FDR(false discovery rate) cutoff generated by mapDIA for DE proteins. The default value is 0.1.

-spe

This parameter is used to set the abbreviation of species for KEGG enrichment analysis. At the present release of KOBAS 3.0, more than 4000 species are supported which can refer to the "db/species_abbr.txt" file; for species not in this list, a relative or model species should be selected.

-alt

This parameter is used to set the abbreviation of species for GO/Rectome enrichment analysis. At the present release of KOBAS 3.0, dozens of species are supported for Rectome enrichment analysis; default value is the same as -spe, but for a species missing the Rectome annotation, a relative or model species must be set by this parameter.

-mod

This parameter is used to set the backgroud for enrichment analysis. We proposed a new enrichment strategy in obaDIA, using the expressed protein as backgroud for the enrichment analysis, which is the default choice. If you want to perform enrichment analysis by means of a traditional manner, which uses the total proteins in a species as backgroud, please set this parameter as -mod T.

-thread

This parameter is used to set the thread number for hmmscan software in annotation step. Because Pfam annotation is time-consuming, using multiple threads could dramatically accelerate the annotation step.

-fast

This parameter is used to skip several time-consuming steps in obaDIA. No value is needed. It can make the total time-consuming of obaDIA pipeline shorten from hours to minutes.

Output files

The output files could be slightly different based on the parameters. A typical output files dirctory is as follow, and the main result files are given descriptive explanation:

outdir/                                               
├── Anno                                               # Functional annotation output directory
│   ├── Annotation.txt
│   ├── Annotation.xls                                 # formatted annotation summay table
│   ├── blastp.outfmt6
│   ├── eggNOG.anno
│   ├── eggNOG.anno.plotdata
│   ├── eggNOG.anno.plotdata.bar.pdf                   # eggNOG annotation bar plot
│   ├── eggNOG.anno.plotdata.bar.png
│   ├── GO.anno
│   ├── GO.anno.output
│   │   ├── GO_classification_bar.pdf                  # GO annotation bar plot
│   │   ├── GO_classification_bar.png
│   │   ├── GO_classification_count.txt
│   │   ├── GO_classification.R
│   │   └── GO_classification.xls                      # GO annotation detail
│   ├── Kegg.anno
│   ├── Kegg.anno.output
│   │   ├── KEGG_classification_count.txt
│   │   ├── KEGG_classification.pdf                    # KEGG annotation bar chart
│   │   ├── KEGG_classification.png
│   │   ├── KEGG_classification.R
│   │   └── KEGG_classification.xls                    # KEGG annotation detail
│   ├── Kegg.anno.plotdata
│   ├── PFAM.anno
│   ├── pfam.log
│   ├── seq.fa.gene_trans_map
│   ├── signalp.out
│   ├── trinotate_annotation_report.xls
│   └── TrinotatePFAM.out
├── Diff                                               # Abundance and Differntial expression(DE) analysis directory
│   ├── compare.txt
│   ├── diff_results                                   # DE analysis output directory
│   │   ├── BvsA                                       # One directory for each comparison 
│   │   │   ├── BvsA.diff.id
│   │   │   ├── BvsA.diff.seq
│   │   │   ├── BvsA.diff.updown
│   │   │   ├── BvsA.diff.xls                          # formatted differntial expression information table 
│   │   │   ├── BvsA.diff.xls.volcano.pdf              # volcano plot
│   │   │   └── BvsA.diff.xls.volcano.png
│   │   ├── CvsA
│   │   │   ├── ...
│   │   ├── DvsA
│   │   │   ├── ...
│   │   ├── EvsA
│   │   │   ├── ...
│   │   └── FvsA
│   │       ├── ...
│   ├── group.txt
│   ├── mapDIA                                         # raw DE analysis output directory
│   │   ├── analysis_output.txt
│   │   ├── analysis_output_wide_format.txt
│   │   ├── fragment_selection.txt
│   │   ├── log2_data.txt
│   │   ├── mapDIA.conf
│   │   └── param.txt
│   ├── mapDIA.input
│   └── tableProcess                                   # Abundance analysis output directory
│       ├── DIA.count.xls                              # peak number statistic for each sample
│       ├── DIA.means.xls                              # mean abundance table
│       ├── DIA.means.xls.abHeatmap.pdf                # mean abundance heatmap
│       ├── DIA.means.xls.abHeatmap.png
│       ├── DIA.means.xls.boxplot.pdf                  # mean abundance box plot
│       ├── DIA.means.xls.boxplot.png
│       ├── DIA.means.xls.corHeatmap.pdf               # mean correlation heatmap
│       ├── DIA.means.xls.corHeatmap.png
│       ├── DIA.means.xls.corMatrix.pdf                # mean correlation-matrix plot
│       ├── DIA.means.xls.corMatrix.png
│       ├── DIA.means.xls.density.pdf                  # mean abundance density plot
│       ├── DIA.means.xls.density.png
│       ├── DIA.means.xls.violon.pdf                   # mean abundance violin plot
│       ├── DIA.means.xls.violon.pdf
│       ├── DIA.normalized.xls                         # normalized abundance table
│       ├── DIA.normalized.xls.abHeatmap.pdf           # normalized abundance heatmap
│       ├── DIA.normalized.xls.abHeatmap.png
│       ├── DIA.normalized.xls.boxplot.pdf             # normalized abundance box plot		
│       ├── DIA.normalized.xls.boxplot.png
│       ├── DIA.normalized.xls.corHeatmap.pdf          # normalized correlation heatmap
│       ├── DIA.normalized.xls.corHeatmap.png
│       ├── DIA.normalized.xls.corMatrix.pdf           # normalized correlation-matrix plot
│       ├── DIA.normalized.xls.corMatrix.png
│       ├── DIA.normalized.xls.density.pdf             # normalized abundance density plot
│       ├── DIA.normalized.xls.density.png
│       ├── DIA.normalized.xls.violon.pdf              # normalized abundance violin plot
│       └── DIA.normalized.xls.violon.png
├── Enrich                                             # Functional enrichment analysis directory
│   ├── All                                            # Expressed protein functional enrichment analysis directory
│   │   ├── All.go_bar.pdf                             # GO enrichment bar plot
│   │   ├── All.go_bar.png
│   │   ├── All.hsa.annotate
│   │   ├── All.hsa.dmout
│   │   ├── All.identify.GO
│   │   ├── All.identify.GO.xls                        # formatted GO enrichment result table
│   │   ├── All.identify.Kegg
│   │   ├── All.identify.Rectome
│   │   ├── All.identify.Rectome.top20
│   │   ├── All.identify.Rectome.xls                   # formatted Rectome enrichment result table
│   │   ├── All.KeggEnrich
│   │   ├── All.KeggEnrich.html                        # KEGG enrichment html formatted report
│   │   ├── All.KeggEnrich.top20
│   │   ├── All.KeggEnrich.top20.scatterplot.pdf       # KEGG enrichment scatter plot
│   │   ├── All.KeggEnrich.top20.scatterplot.png
│   │   ├── All.KeggEnrich.xls                         # formatted KEGG enrichment result table
│   │   ├── All.Rectome.Enrich.scatterplot.pdf         # Rectome enrichment scatter plot
│   │   ├── All.Rectome.Enrich.scatterplot.png
│   │   ├── go.resource
│   │   ├── seq.len
│   │   └── src
│   ├── BvsA                                           # DE enrichment analysis directory, one directory for each comparison
│   │   ├── BvsA.annomap
│   │   ├── BvsA.go_bar.pdf                            # GO enrichment bar plot
│   │   ├── BvsA.go_bar.png
│   │   ├── BvsA.GO_BP_DAG.pdf                         # GO enrichment DAG plots (BP)
│   │   ├── BvsA.GO_BP_DAG.png
│   │   ├── BvsA.GO_CC_DAG.pdf                         # GO enrichment DAG plots (CC)
│   │   ├── BvsA.GO_CC_DAG.png
│   │   ├── BvsA.GO_MF_DAG.pdf                         # GO enrichment DAG plots (MF)
│   │   ├── BvsA.GO_MF_DAG.png
│   │   ├── BvsA.hsa.annotate
│   │   ├── BvsA.hsa.dmout
│   │   ├── BvsA.identify.GO
│   │   ├── BvsA.identify.GO.xls                       # formatted GO enrichment result table
│   │   ├── BvsA.identify.Kegg
│   │   ├── BvsA.identify.pfam
│   │   ├── BvsA.identify.pfam.xls                     # formatted PFAM enrichment result table
│   │   ├── BvsA.identify.pfam.xls.top20
│   │   ├── BvsA.identify.Rectome
│   │   ├── BvsA.identify.Rectome.top20
│   │   ├── BvsA.identify.Rectome.xls                  # formatted Rectome enrichment result table
│   │   ├── BvsA.KeggEnrich
│   │   ├── BvsA.KeggEnrich.html
│   │   ├── BvsA.KeggEnrich.top20
│   │   ├── BvsA.KeggEnrich.top20.scatterplot.pdf      # KEGG enrichment scatter plot
│   │   ├── BvsA.KeggEnrich.top20.scatterplot.png
│   │   ├── BvsA.KeggEnrich.xls                        # formatted KEGG enrichment result table
│   │   ├── BvsA.PFAM.Enrich.scatterplot.pdf           # PFAM enrichment scatter plot
│   │   ├── BvsA.PFAM.Enrich.scatterplot.png
│   │   ├── BvsA.Rectome.Enrich.scatterplot.pdf        # Rectome enrichment scatter plot
│   │   ├── BvsA.Rectome.Enrich.scatterplot.png
│   │   ├── gene2pfam_formated.txt
│   │   ├── go.resource
│   │   └── src
│   ├── CvsA
│   │   ├── ...
│   ├── DvsA
│   │   ├── ...
│   ├── EvsA
│   │   ├── ...
│   ├── FvsA
│   │   ├── ...
└── OneStep.sh                                         # all-in-one workflow script

Modular version

A modular version of obaDIA is provided in a recent release, with each functional module can be run independently, and facilitate integration into personalized workflows. Its usage is quite similar to the one-step version. Its parameters and description can be obtained through the command:

perl ../bin/oba_module.pl

All parameters and short descriptions will be shown like this:

Options

        Common:
                -m              <s> : choose the module you want to run [diff/anno/enrich]
                -o              <s> : output directory
                -h|?|help           :  Show this help

        Diff:
                -expr           <s> : Abundance matrix, can be protein-level, peptide-level or fragment-level
                -group          <s> : group that the samples belong, e.g. sample1/sample2,sample3
                -groupname      <s> : group names, default group1,group2,...
                -compare        <s> : how to compare the groups, e.g. 1:2,1:3; Please input in order of trait:control 1vs2
                -fc_cutoff      <f> : default: 1
                -fdr_cutoff 	<f> : default: 0.1
                -level          <s> : level of abundance matrix. Choice are prot/pep/frag [default: prot]

        Anno:
                -fa             <s> : protein squence file, fasta format.

        Enrich:
                -name           <s> : sample name for output files
                -species        <s> : species for kegg or background annotate file
                -fa             <s> : DE protein sequence fasta
  1. Use the ABDE module:
perl ../bin/oba_module.pl \
-m diff \
-o $PWD/diff \
-expr $PWD/ab.tsv \
-group A1/A2/A3,B1/B2/B3,C1/C2/C3,D1/D2/D3,E1/E2/E3,F1/F2/F3 \
-groupname A,B,C,D,E,F \
-compare 2:1,3:1,4:1,5:1,6:1
  1. Use the Annotation module:
perl ../bin/oba_module.pl \
-m anno \
-o $PWD/anno \
-fa $PWD/seq.fa
  1. Use the Enrichment module:
perl ../bin/oba_module.pl \
-m enrich \
-o $PWD/enrich \
-name test \
-species hsa \
-fa $PWD/seq.fa

Citation

If you use obaDIA in your work, please cite the publication as follows:

Jun Yan, Hongning Zhai, Ling Zhu, Sasha Sa, Xiaojun Ding. obaDIA: one-step biological analysis pipeline for data-independent acquisition and other quantitative proteomics data, Bioinformatics, 2020, btaa893, https://doi.org/10.1093/bioinformatics/btaa893

About

One-step biological analysis pipeline for quantitative proteomics data

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published