Skip to content
Switch branches/tags

Latest commit


Git stats


Failed to load latest commit information.
Latest commit message
Commit time

COVID Molecules

** The contents of this repository website are for research and educational purposes only. **

This repository contains a crowd sourced list of molecules derived from literature and other sources to aid computational screenings for molecules that have been screened experimentally or computationally against coronaviruses (SARS,MERS, SARS-CoV-2).

Lit Database - Data Structure

/LIT/LIT.csv - A csv file with headers ["molecule", "virus", "reference", "type", "smiles", "pubchem_id", "similarity_calculated", "release"]

  • molecule - the name of the molecule (best effort)
  • virus - the virus the molecule was screened against (SARS, MERS, SARS-CoV-2)
  • reference - the reference from which the information was collected
  • type - the type of screening performed ["computational","experimental","mixed"]
  • pubchem_smiles - The SMILES as retrieved from pubchem
  • canonical_smiles - The oBabel canonical SMILES as computed from the pubchem SMILES
  • pubchem_id - The PubChem ID of the molecule if available
  • similarity_calculated - 1 if similarities have been calculated to other databases, 0 otherwise
  • release - The current release associated with the row of information

Note that similarities will be calculated to molecules in the databases available from the The nCov-Group Data Repository, and made available in future releases.

/LIT/references.csv - A line delimited set of references that have been surveyed to arrive at the results


v0.1 (2020-04-21) - initial availability

v0.2 (2020-05-08) - new molecules and inlude Pubchem and canonical SMILES


Yadu Nand Babuji, Ben Blaiszik, Kyle Chard, Ryan Chard, Ian Foster, India Gordon, Zhi Hong, Kasia Karbarz, Zhuozhao Li, Linda Novak, Susan Sarvey, Marcus Schwarting, Julie Smagacz,Logan Ward, Monica Orozco White

Research was supported by the DOE Office of Science through the National Virtual Biotechnology Laboratory, a consortium of DOE national laboratories focused on response to COVID-19, with funding provided by the Coronavirus CARES Act.

Data storage and computational support for this research project has been generously supported by the following resources. The data generated have been prepared as part of the nCov-Group Collaboration, a group of over 200 researchers working to use computational techniques to address various challenges associated with COVID-19.

Petrel Data Service at the Argonne Leadership Computing Facility (ALCF) This research used resources of the Argonne Leadership Computing Facility, a DOE Office of Science User Facility supported under Contract DE-AC02-06CH11357.

Argonne Leadership Computing Facility (ALCF) This research used resources of the Argonne Leadership Computing Facility, a DOE Office of Science User Facility supported under Contract DE-AC02-06CH11357.

Frontera at the Texas Advanced Computing Center (TACC)

Comet at the San Diego Supercomputing Center (SDSC)

Data and Computing Infrastructure Many aspects of the data and computing infrastructure have been leveraged from other projects including but not limited to:

Data processing and computation:

Data Tools, Services, and Expertise:


For All Information

Unless otherwise indicated, this information has been authored by an employee or employees of the UChicago Argonne, LLC., operator of the Argonne National laboratory with the U.S. Department of Energy. The U.S. Government has rights to use, reproduce, and distribute this information. The public may copy and use this information without charge, provided that this Notice and any statement of authorship are reproduced on all copies.

While every effort has been made to produce valid data, by using this data, User acknowledges that neither the Government nor UChicago Argonne LLC. makes any warranty, express or implied, of either the accuracy or completeness of this information or assumes any liability or responsibility for the use of this information. Additionally, this information is provided solely for research purposes and is not provided for purposes of offering medical advice. Accordingly, the U.S. Government and UChicago Argonne LLC. are not to be liable to any user for any loss or damage, whether in contract, tort (including negligence), breach of statutory duty, or otherwise, even if foreseeable, arising under or in connection with use of or reliance on the content displayed on this site.

For Scientific and Technical Information Only © Copyright UChicago Argonne LLC. All Rights Reserved.


No description, website, or topics provided.



No releases published


No packages published