Skip to content

EnvCommons/ADME

Repository files navigation

ADME

OpenReward Environment

Description

ADME is an environment for evaluating agents on ADME (Absorption, Distribution, Metabolism, Excretion) property prediction. Given a molecule's SMILES string and an ADME endpoint name, agents predict the numerical property value. The dataset pools 8 ADME regression datasets from Therapeutics Data Commons (TDC), covering Caco-2 permeability, lipophilicity, aqueous solubility, plasma protein binding, clearance, volume of distribution, and half-life.

Capabilities

  • Predicting diverse ADME molecular properties from SMILES notation
  • Quantitative molecular property prediction across multiple endpoints
  • Understanding structure-property relationships in pharmacokinetics

Compute Requirements

ADME does not require a sandbox. It has minimal compute requirements.

License

CC BY 4.0 (following the TDC dataset licenses).

Tasks

There are two splits: train (1,000 tasks) and test (100 tasks), totaling 1,100 tasks. Tasks are sampled proportionally from 8 TDC ADME regression datasets:

Dataset Property Units Molecules Source
Lipophilicity_AstraZeneca Lipophilicity LogP 4,200 TDC
Solubility_AqSolDB Aqueous Solubility log(mol/L) 9,982 TDC
PPBR_AZ Plasma Protein Binding Rate % 2,828 TDC
Caco2_Wang Caco-2 Permeability log cm/s 910 TDC
Clearance_Hepatocyte_AZ Hepatocyte Clearance uL/min/10^6 cells 1,213 TDC
Clearance_Microsome_AZ Microsome Clearance uL/min/mg 1,102 TDC
VDss_Lombardo Volume of Distribution L/kg 1,130 TDC
Half_Life_Obach Half-Life hr 667 TDC

Each task provides a molecule's SMILES string, the property name, and the expected units.

Reward Structure

This is a sparse, verifiable reward environment with continuous scoring. The agent calls submit_prediction once with a predicted value. The reward is based on relative error using inverse hyperbolic cosine scaling:

$$\text{Reward} = \frac{1}{\cosh(\text{relative_error} \times 3.0)}$$

where $\text{relative_error} = \frac{|\hat{y} - y|}{|y|}$.

Relative Error Reward
0% (exact) 1.000
10% 0.957
50% 0.425
100% 0.099

For actual values of 0, the reward falls back to absolute error scaling.

We do not use LLM graders for this task.

Data

Task data is pooled from 8 TDC ADME regression datasets (~22,000 molecules total). 1,100 molecules are sampled proportionally across datasets. Data files are stored on the OpenReward platform.

Tools

Agents are given a single tool:

  • submit_prediction: Submit a predicted numerical value for the ADME property. Returns the reward based on prediction accuracy. This tool can only be called once per task.

Time Horizon

ADME is a single-turn environment. The agent receives a molecule and property endpoint, then submits one prediction. Each task requires exactly one tool call.

Environment Difficulty

[Fill in: baseline model performance, human expert comparison, variance across property types]

Other Environment Requirements

There are no further environment requirements; ADME works out of the box with the OpenReward endpoint.

Safety

Agents in ADME are asked to predict pharmacokinetic properties for molecules. The environment does not present direct safety risks, as agents only provide numerical predictions with no access to external systems or real pharmacological processes.

However, this is a dual-use domain and models trained for capabilities in this environment could be used for malicious uses in other agentic workflows.

Citations

@dataset{GRADME,
  author    = {General Reasoning Inc. Team},
  title     = {ADME},
  year      = {2026},
  publisher = {OpenReward},
  url       = {https://openreward.ai/GeneralReasoning/ADME}
}

@article{huang2021therapeutics,
  title={Therapeutics Data Commons: Machine learning datasets and tasks for drug discovery and development},
  author={Huang, Kexin and Fu, Tianfan and Gao, Wenhao and Zhao, Yue and Roohani, Yusuf and Leskovec, Jure and Coley, Connor W and Xiao, Cao and Sun, Jimeng and Zitnik, Marinka},
  journal={Proceedings of NeurIPS Datasets and Benchmarks},
  year={2021}
}

About

ADME prediction

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors