Skip to content

ChemEXL/BiCEV

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

28 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

De Novo Design of Molecules with Multi-action Potential from Differential Gene Expression using Variational Autoencoder

This repository contains the code to implement BiCEV (Bidirectional Compound-Expression Variational Autoencoder) for generating molecules from given gene expression data.

Additional Files

  • Compound and gene expression datasets (including GSE70138, gene-knockdown profiles, and combined signatures of synergistic drug pairs) can be downloaded from the dataset folder in this link: https://kmutt.me/bicev_data.
  • Please copy the downloaded datasets to the data folder to implement.
  • The weights of CLA and BiCEV models are also provided in the same link in the weights folder.

Installation

BiCEV implementation relies on NumPy, Pandas, PyTorch, PyTorch Lightning, RDKit-pypi, cmapPy, and fcd. You may install these dependencies using the following command (recommend for cuda version 12.0):

pip install -r requirements.txt

Training model

Pretraining Chemical Language Autoencoder Model (CLA)

import pytorch_lightning as pl
from model.cla_model import CLA

cla_model = CLA()
cla_trainer = pl.Trainer(max_epochs=20)
cla_trainer.fit(cla_model)

Training BiCEV

import pytorch_lightning as pl
from model.cea_model import BiCEV

model = BiCEV(cla_enc_weight='cla_encoder_weight.ckpt',
                cla_dec_weight='cla_decoder_weight.ckpt')
trainer = pl.Trainer(gpus=[0], max_epochs=1)
trainer.fit(model)

References

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages