Skip to content
/ toxprint Public

Free library of CSRML-based ToxPrint chemotypes *** Download the latest version from the "Releases" tab

Notifications You must be signed in to change notification settings

mn-am/toxprint

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

55 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

ToxPrint Header

 

Welcome to ToxPrint

​ToxPrint is a publicly-available invariant reference set (or library) of structural features (substructures) targeted to cover chemical structures from the large toxicity databases and regulatory inventories (Yang, 2015). ToxPrint chemotypes were developed by Altamira LLC (now part of ​MN-AM) for the CERES (Chemical Evaluation and Risk Estimation System) project of U.S. FDA Center for Food Safety and Applied Nutrition (CFSAN). Chemotypes including the ToxPrint chemotypes are implemented in the ChemoTyper, which was contracted from U.S. FDA to Molecular Networks GmbH (now part of ​MN-AM).

Chemotypes

This repository houses various chemotype files to support knowledge representation. Chemotypes are structural fragments (substructures) that can encode physicochemical, atomic, bond, and electronic properties in addition to the substructural connectivity. They can be associated with biological properties and modes of action in the toxicity pathways or with AOP (adverse outcome pathways) through the encoded properties in addition to a structural motif. Chemotypes use the CSRML (Chemical Substructures and Reactions Mark-up) language to represent both atom-bond connectivity as well as their properties such as pi-system or partial charges.

ToxPrint Chemotypes

The ChemoTyper organizes the current version ToxPrint chemotypes into the following three functional areas.

  • Generic structural fragments
  • Structural rules and alerts
  • Category classifiers

Generic Structural Fragments

Generic structural fragments are organized by atom, bond, chain, ring types as well as chemical groups including amino acids, carbohydrates, ligands, and nucleobases based on 729 essential chemotypes of the current ToxPrint set (version 2.0 r1520). These chemotypes can be used to generate chemical fingerprints, either in binary (0/1) or counts data. They can be used to calculate similarity measures or structural feature descriptors for building predictive models. (Yang 2015)

Structural Rules and Alerts

Structural rules and alerts can be developed using ToxPrint chemotypes as building blocks. The chemotypes defined in the ToxPrint set can be further refined or coded with properties (atom, bond, molecular, electronic, or physicochemical) to constrain the matches and to enhance the signal-to-noise ratio of ToxPrint chemotypes when profiling the biological observations. To this end, the Chemotype Editor empowers users with the ability to fluently manipulate the CSRML query definitions graphically in a molecular editor. For further information, please contact MN-AM.

  • Ashby-Tennant Genotoxic Carcinogen Alerts
  • DNA binders
  • Protein binders
  • General liver alerts

Category Classifiers

When characterizing different databases, TTC datasets, or inventories to differentiate their chemical spaces, an invariant reference set of structural features are required. In addition, when developing categories for regulatory inventories (cosmetics, drugs, agrochemicals, etc.) or representing particular toxicity or biological patterns within adverse outcome (AOP)/mode-of-action (MOA) pathways or ontology networks, designed chemotypes specifically to capture the underlying knowledge can be a powerful set of tools. To this end, the Chemotype Editor empowers users with the ability to fluently manipulate the CSRML query definitions graphically in a molecular editor. For further information, please contact MN-AM. Here are a few exicting category classifiers.

  • Threshold of Toxicological Concern (TTC) approach (part of ToxPrint library)
  • ​Organic Flame Retardant Chemotypes
  • Antimicrobial Chemotypes (Yang 2020)
  • Per- and Polyfluoroalkyl Substances (PFAS) Chemotypes (based on ToxPrints) (Richard 2023, also downloadable from this repository)
  • Bisphenol Chemotypes (also downloadable from this repository)

The ToxPrint chemotype library is implemented in the XML-based CSRML language and can be applied using the publicly available ​ChemoTyper application.

Updating ToxPrint Chemotype File in ChemoTyper Application

Please note. Updated ToxPrint CSRML files are not automatically replaced in your current ​ChemoTyper installation. To use new versions of ToxPrint chemotypes in your ChemoTyper installation, please follow the instructions below.

  • Close the ChemoTyper application
  • Download the ZIP archive that contains the latest ToxPrint CSRML file (named like "ToxPrint_V2.0_r<number>.xml")
  • Extract its content into a folder of your choice, e.g., "Documents"
  • Start Windows File Explorer with administrative privileges (use "Run as Administrator" command)
  • Move the extracted ToxPrint_V2.0_r.xml file you into the folder "share" of your ChemoTyper installation at "C:\Program files (x86)\Molecular Networks\ChemoTyper\share"

After that the content of the latest ToxPrint file can be loaded as usual from the Start page of the ChemoTyper application.

References

Contact

For technical support please contact ​support@mn-am.com.

How to Cite ToxPrint Chemotypes

Please use the full information provided in the section "Cite this repository" in the "About" tab or as a minimum as follows.

(a) Yang et al. J. Chem. Inf. Model. 2015, 55(3), 510-528 (DOI: doi.org/10.1021/ci500667v). (b)ToxPrint chemotypes by MN-AM, Version 2.0 r711 (2014-06-11), github.com/mn-am/toxprint, accessed on $DATE.

Acknowledgement

The ChemoTyper application was developed by Molecular Networks GmbH, Erlangen, Germany under a contract from the U.S. FDA Center for Food Safety and Applied Nutrition (CFSAN), Office of Food Additive Safety.

The XML-based substructure (or chemotype) definition language CSRML was co-developed in collaboration with Altamira LLC, Columbus, OH, USA.

Visit the ​website of Molecular Networks GmbH and Altamira LLC.