Content
====================================================
pip3 install -r requirements.txt
pip2 install -r requirements_python2.txt
Variant Effect Predictor (VEP) requires a few modules and programs which are currently not installed on the AWS instance. Notes on how to install VEP are here:
sudo -u root -s
cpanm File::Copy::Recursive
cpanm Bio::Root::Version
cpanm Archive::Zip
cpanm Class::HPLOO::Base
apt-get install libssl-dev (for openssl.h)
cpanm LWP::Protocol::https
apt-get install mysql-server
apt-get install libmysqlclient-dev (for mysql_config)
cpanm DBD::mysql
cd $HOME
wget 'https://github.com/Ensembl/ensembl-tools/archive/release/86.zip'
unzip 86.zip
cd $HOME/ensembl-tools-release-86/scripts/variant_effect_predictor/
perl INSTALL.pl
cp variant_effect_predictor.pl /usr/local/bin
chmod a+x /usr/local/bin/variant_effect_predictor.pl
cd $HOME/ensembl-tools-release-86/scripts/variant_effect_predictor/
cp -rn Bio /usr/local/share/perl/5.18.2/
chmod -R a+x /usr/local/share/perl/5.18.2/Bio
chmod -R a+r /usr/local/share/perl/5.18.2/Bio
head -1000 /home/data/vcf/hisat_tags_output_SRR1616919.sorted.vcf > $HOME/test.vcf
variant_effect_predictor.pl --input_file $HOME/test.vcf \
--format vcf --output_file test.output --vcf --symbol --terms SO --database \
--force_overwrite
mkdir -p /home/data/vep
cd /home/data/vep
wget ftp://ftp.ensembl.org/pub/release-86/variation/VEP/homo_sapiens_vep_86_GRCh38.tar.gz
cp homo_sapiens_vep_86_GRCh38.tar.gz /home/data/vep
mkdir -p /home/data/vep/Plugins
wget 'https://github.com/griffithlab/pVAC-Seq/archive/master.zip'
unzip master.zip
cp pVAC-Seq-master/pvacseq/VEP_plugins/Wildtype.pm /home/data/vep/Plugins
cd cd /home/data/vep/Plugins
wget https://github.com/Ensembl/VEP_plugins/archive/release/86.zip
unzip 86.zip
mv VEP_plugins-release-86/Downstream.pm .
rm -rf VEP_plugins-release-86/
rm 86.zip
nohup variant_effect_predictor.pl \
--input_file /home/data/vcf/hisat_tags_output_SRR1616919.sorted.vcf \
--format vcf \
--terms SO --offline --force_overwrite \
--plugin Wildtype --plugin Downstream \
--dir /home/data/vep --vcf --symbol \
--fork 4 \
--output_file /home/data/imm/hisat_tags_output_SRR1616919.sorted..annotated.vcf &
Download protein data and change Ensembl's fasta header to work with our tools downstream.
mkdir -p /home/data/peptides
cd /home/data/peptides
wget ftp://ftp.ensembl.org/pub/release-86/fasta/homo_sapiens/pep/Homo_sapiens.GRCh38.pep.all.fa.gz
perl ./Cancer_Epitopes_CSHL/src/fix_headers.pl Homo_sapiens.GRCh38.pep.all.fa.gz > Homo_sapiens.GRCh38.pep.all.fixheader.fa