Skip to content
A high-quality hand-curated logD7.4 dataset of 1,130 compounds
Branch: master
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Type Name Latest commit message Commit time
Failed to load latest commit information.

logD7.4 of 1,130 Compounds logo

This repository archives a high-quality hand-curated lipophilicity dataset that includes the chemical structure (SMILES) of 1,130 organic compounds and their n-octanol/buffer solution distribution coefficients at pH 7.4 (logD7.4), originally curated by our paper (PDF).

About logD7.4

As a determinant of several ADME properties, lipophilicity (logD7.4) is a key physical property in the development of small molecule oral drugs. This dataset can be applied for method benchmarking in regression modeling, cheminformatics, and chemometrics research.

Paper Citation

If you find this dataset useful in your research, please cite our paper:

Formatted citation:

Wang, J-B., D-S. Cao, M-F. Zhu, Y-H. Yun, N. Xiao, Y-Z. Liang (2015). In silico evaluation of logD7.4 and comparison with other prediction methods. Journal of Chemometrics, 29(7), 389-398.

BibTeX entry:

  title={\textit{In silico} evaluation of $\text{logD}_{7.4}$ and comparison with other prediction methods},
  author={Wang, Jian-Bing and Cao, Dong-Sheng and Zhu, Min-Feng and Yun, Yong-Huan and Xiao, Nan and Liang, Yi-Zeng},
  journal={Journal of Chemometrics},
  publisher={Wiley Online Library}
You can’t perform that action at this time.