RNAmigos 2.0

Welcome on RNAmigos 2.0 !

Description

RNAmigos is a virtual screening tool : given the binding site of a target and a library of chemical compounds, it ranks the compounds so that better ranked compounds have a higher chance to bind the target. It is based on a machine learning model using the PyTorch framework and was trained leveraging unsupervised and synthetic data. It was shown to display similar enrichment factors to docking while running in a fraction of the time. A detailed description of the tool is available on BioRxiv.

If you find this tool useful, please cite

@article{carvajal2023semi,
  title={Semi-supervised learning and large-scale docking data accelerate RNA virtual screening},
  author={Carvajal-Patino, Juan G and Mallet, Vincent and Becerra, David and Nino, L Fernando and Oliver, Carlos and Waldispuhl, Jerome},
  journal={bioRxiv},
  pages={2023--11},
  year={2023},
  publisher={Cold Spring Harbor Laboratory}
}

Using the tool with Collab

The easiest way to use the tool is to use Google Colab.

You will need to provide a cif file, a binding site in the form of a list of binding pocket nodes and a list of ligand smiles.

Using the tool locally

A local use of the tool is also possible by following the next steps. First, create a conda environment:

conda create -n rnamigos2
conda activate rnamigos2
pip install -r requirements.txt

To run RNAmigos2.0 on your own target and ligands, use the experiments/inference.py script.

You will need to provide the following:

Path to an mmCif file
Path to a .txt file with one SMILES string per line
A list of binding site residue identifiers

Now you can just run the inference script to get a score for each ligand in your SMILES .txt file. Taking example structure and ligand file from /sample_files, selecting residues 16-20 of chain A as the binding site, the corresponding command is :

python experiments/inference.py cif_path=sample_files/3ox0.cif \
                                pdbid=3ox0 \
                                residue_list=\[A.20,A.19,A.18,A.17,A.16\] \
                                ligands_path=sample_files/test_smiles.txt \
                                out_path=scores.txt

Once this executes you will have scores.txt that looks like this:

CCC[S@](=O)c1ccc2[nH]/c(=N\C(=O)OC)[nH]c2c1 0.2639017701148987
O=C(O)[C@@H](O)c1ccccc1 0.6267350912094116
CC(=O)Oc1ccccc1C(=O)O 0.6304176449775696
CN1[C@H]2CC[C@@H]1CC(OC(=O)[C@H](CO)c1ccccc1)C2 0.47674891352653503
...

Reproducting results and figures

The steps necessary to reproduce results and figures are detailed in REPRODUCE.md.

Name		Name	Last commit message	Last commit date
Latest commit History 329 Commits
conf		conf
data		data
experiments		experiments
fig_scripts		fig_scripts
graveyard		graveyard
job_scripts		job_scripts
pretrained		pretrained
rnamigos_dock		rnamigos_dock
sample_files		sample_files
saved_models		saved_models
scripts		scripts
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
REPRODUCE.md		REPRODUCE.md
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
rnamigos_inference.ipynb		rnamigos_inference.ipynb

License

cgoliver/rnamigos2

Folders and files

Latest commit

History

Repository files navigation

RNAmigos 2.0

Table of Contents:

Description

Using the tool with Collab

Using the tool locally

Reproducting results and figures

About

Resources

License

Stars

Watchers

Forks

Languages