# fungal_Effectors
Predicting fungal effectors is a key step in understanding host–pathogen interactions, virulence mechanisms, and the molecular strategies fungi use to colonize plant tissues. EffectorP is one of the most widely used machine-learning tools for the identification of fungal effector proteins. It analyzes secreted protein sequences and classifies them based on learned features characteristic of known fungal effectors.

# Tool Installation
First change run type - Go to runtime option and change time option to GPU
We'll install the necessary tools, including the source code for EffectorP and Weka, which EffectorP3 relies on as a machine learning dependency. You can access these at:

- [EffectorP Source](https://effectorp.csiro.au/)
- [Weka (Waikato Environment for Knowledge Analysis)](https://ml.cms.waikato.ac.nz/weka/), an open-source machine learning suite developed by the University of Waikato in New Zealand.

# Detection of Fungal Effectors

This notebook guides you through using EffectorP3, a machine learning tool specifically designed to identify effectors in fungi and oomycetes. Prior to running EffectorP3, it’s essential to pre-filter the input protein sequences, focusing on those likely to be extracellular. Ensuring high confidence in secreted protein predictions enhances the accuracy and relevance of EffectorP3’s results.

In [None]:
# Use wget to download the ZIP archive of the repository
!wget https://github.com/JanaSperschneider/EffectorP-3.0/archive/refs/heads/main.zip

# Unzip the downloaded file
!unzip main.zip

# Optionally, rename the folder for convenience
!mv EffectorP-3.0-main EffectorP-3.0


In [None]:
!unzip EffectorP-3.0/weka-3-8-4.zip
!mv weka-3-8-4 /content/EffectorP-3.0/

Install Conda and Seqtk

In [None]:
! wget https://repo.anaconda.com/miniconda/Miniconda3-py37_4.8.2-Linux-x86_64.sh
! chmod +x Miniconda3-py37_4.8.2-Linux-x86_64.sh
! bash ./Miniconda3-py37_4.8.2-Linux-x86_64.sh -b -f -p /usr/local
import sys
sys.path.append('/usr/local/lib/python3.7/site-packages/')

In [None]:
!conda install bioconda::seqtk -y
!conda install -c conda-forge ncbi-datasets-cli -y

In [None]:
!datasets download genome accession GCF_013085055.1 --include protein,seq-report
!unzip ncbi_dataset.zip
!mv ncbi_dataset/data/GCF_013085055.1/protein.faa ./

# Effector prediction
Run effectorP

In [None]:
!python EffectorP-3.0/EffectorP.py -i extracelullar.fa > effectorP_output.txt

# Detect extracelullar proteins using Wolfpsort
To ensure high-confidence predictions in protein secretion before running EffectorP3,You can detect extracelullar proteins and prepare you input proteins with WolFPPsort. https://wolfpsort.hgc.jp/aboutWoLF_PSORT.html.en

Run WolFPPsort

In [None]:
!wget https://github.com/fmaguire/WoLFPSort/archive/refs/heads/master.zip
!unzip master.zip

In [None]:
# Prompt the user to input the file name
input_file = input("Enter the path to the input FASTA file (e.g., 'proteins.fa'): ")

# Run the WoLFPSort command using the user-provided file name
!WoLFPSort-master/bin/runWolfPsortSummary fungi < {input_file} > psortb

Prepare your new input and run EffectorP

In [None]:
#You can detect extracelullar proteins and prepare you input proteins with WolFPPsort. https://wolfpsort.hgc.jp/aboutWoLF_PSORT.html.en

input_file = input("Enter the path to the input FASTA file (e.g., 'proteins.fa'): ")
!cut -f 1,2 -d " " psortb | grep extr > extracelullar_list
!seqtk subseq {input_file} extracelullar_list > extracelullar.fa

Run effectorP

In [None]:
!python EffectorP-3.0/EffectorP.py -i extracelullar.fa > effectorP_output.txt

You can use SignalP and TMHMM instead of  WOLPSORT to dtecet extrcaelular proteins

Additional Tools for Preparing Input Data

For both SIgnalP and TMHMM, registration and an academic agrement are required. Please request access via the following links:

- [SignalP](https://services.healthtech.dtu.dk/cgi-bin/sw_request?software=signalp&version=6.0&packageversion=6.0h&platform=fast)
- [TMHMM](https://services.healthtech.dtu.dk/cgi-bin/sw_request?software=tmhmm&version=2.0c&packageversion=2.0c&platform=Linux)

After receiving the source code, move it to your workspace and follow the installation instructions provided below. We’ll also use PSortB and WolfPsort as alternatives for localization predictions and Seqtk for efficient sequence parsing.