Skip to content

cgoliver/rnamigos2

Repository files navigation

RNAmigos 2.0

Welcome on RNAmigos 2.0 !

Table of Contents:

Description

RNAmigos is a virtual screening tool : given the binding site of a target and a library of chemical compounds, it ranks the compounds so that better ranked compounds have a higher chance to bind the target. It is based on a machine learning model using the PyTorch framework and was trained leveraging unsupervised and synthetic data. It was shown to display similar enrichment factors to docking while running in a fraction of the time. A detailed description of the tool is available on BioRxiv.

If you find this tool useful, please cite

@article{carvajal2023semi,
  title={Semi-supervised learning and large-scale docking data accelerate RNA virtual screening},
  author={Carvajal-Patino, Juan G and Mallet, Vincent and Becerra, David and Nino, L Fernando and Oliver, Carlos and Waldispuhl, Jerome},
  journal={bioRxiv},
  pages={2023--11},
  year={2023},
  publisher={Cold Spring Harbor Laboratory}
}

Using the tool with Collab

The easiest way to use the tool is to use Google Colab.

Open In Colab

You will need to provide a cif file, a binding site in the form of a list of binding pocket nodes and a list of ligand smiles.

Using the tool locally

A local use of the tool is also possible by following the next steps. First, create a conda environment:

conda create -n rnamigos2
conda activate rnamigos2
pip install -r requirements.txt

To run RNAmigos2.0 on your own target and ligands, use the experiments/inference.py script.

You will need to provide the following:

  • Path to an mmCif file
  • Path to a .txt file with one SMILES string per line
  • A list of binding site residue identifiers

Now you can just run the inference script to get a score for each ligand in your SMILES .txt file. Taking example structure and ligand file from /sample_files, selecting residues 16-20 of chain A as the binding site, the corresponding command is :

python experiments/inference.py cif_path=sample_files/3ox0.cif \
                                pdbid=3ox0 \
                                residue_list=\[A.20,A.19,A.18,A.17,A.16\] \
                                ligands_path=sample_files/test_smiles.txt \
                                out_path=scores.txt

Once this executes you will have scores.txt that looks like this:

CCC[S@](=O)c1ccc2[nH]/c(=N\C(=O)OC)[nH]c2c1 0.2639017701148987
O=C(O)[C@@H](O)c1ccccc1 0.6267350912094116
CC(=O)Oc1ccccc1C(=O)O 0.6304176449775696
CN1[C@H]2CC[C@@H]1CC(OC(=O)[C@H](CO)c1ccccc1)C2 0.47674891352653503
...

Reproducting results and figures

The steps necessary to reproduce results and figures are detailed in REPRODUCE.md.

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published