Skip to content

jihunni/BGCfinder

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 

Repository files navigation

BGCfinder : Biosynthetic Gene Cluster detection with Graph Neural Network

BGCfinder detects biosynthetic gene clusters in bacterial genomes using deep learning. BGCfinder takes a fasta file containing protein sequences and convert each of them into a graph. Graph neural network takes the input graphs to detect biosynthetic gene cluster.

Installation requirement:

To construct the conda environment,

$ conda create --name BGCfinder --clone base
$ conda init bash
$ conda activate BGCfinder
$ conda install pytorch cudatoolkit=11.3 -c pytorch
$ conda install pyg -c pyg
$ pip install BGCfinder

To download the BGCfinder model and test files,

$ bgc-download

To find the protein-coding gene in bacterial genome (Installation of Prodigal is required for this step),

$ prodigal -f gff -i bacterial_genome_seq.fasta -a bacterial_protein_seq.fasta -o bacterial_genome_seq.gff 

To run BGCfinder with a fasta file containing amino acid sequence with CPU (recommended),

bgcfinder bacterial_genome.fasta -o output_filename.tsv -l log_record.log -d False

To run BGCfinder with a fasta file containing amino acid sequence with GPU,

bgcfinder bacterial_genome.fasta -o output_filename.tsv -l log_record.log -d True

The development environment of BGCfinder :

'torch==1.10.0',
'torch-geometric==2.0.2',
'torch-scatter==2.0.9',
'torch-sparse==0.6.12'

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published