🧬 Geometry-aware Lightweight Convolutional Network for Efficient Molecular Property Prediction

📖 abstract

Molecular representation learning (MRL) has demonstrated significant potential in various fields such as drug discovery, particularly in extracting molecular features under limited supervision. However, most existing approaches rely on one-dimensional sequences or two-dimensional topological structures, which fail to adequately capture the complexity of molecular three-dimensional (3D) geometry, thereby limiting their performance in complex property prediction tasks. To more effectively model spatial structural information, three-dimensional convolutional neural networks have recently gained attention in MRL research due to their ability to directly process voxelized 3D molecular data. Nevertheless, these methods often suffer from severe computational inefficiencies caused by the inherent sparsity of voxel data, resulting in a large number of redundant operations. In addition, the commonly used large convolutional kernels—though beneficial for increasing model capacity—introduce substantial computational overhead, which restricts scalability in practical applications. To address these challenges, we propose Prop3D, an efficient 3D molecular representation learning model. Prop3D adopts a kernel decomposition strategy that significantly reduces computational cost while maintaining high predictive accuracy. Experimental results on multiple public benchmark datasets demonstrate that Prop3D consistently outperforms several state-of-the-art methods in molecular property prediction.

📝 Key Contributions

Modeling molecular data as 3D grids.
Proposed a representation learning model for molecular property prediction called Prop3D.
Introduced the large kernel decomposition strategy into molecular representation learning.
Achieved significant performance improvements on multiple public datasets.

🚀 Quick Start

1. Set up the environment

conda create -n Prop3D python=3.9
conda activate Prop3D
pip install -r requirements.txt

2.Dataset Download Instructions

1.QM9:Source: Atom3D - QM9 Dataset

2.ESOL Freesolv Tox21:Source: Drug3D-Net GitHub

3.Model Training and Evaluation

This project supports training and evaluation on four widely-used molecular datasets:

Dataset	Type	Task	Script
QM9	Regression	Quantum chemistry properties	`train.py`
ESOL	Regression	Aqueous solubility prediction	`esol.py`
FreeSolv	Regression	Hydration free energy	`freesolv.py`
Tox21	classification	Toxicity classification (12 tasks)	`Tox21.py`

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
Prop3D		Prop3D
README.md		README.md
fig1.png		fig1.png
requirements		requirements

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

🧬 Geometry-aware Lightweight Convolutional Network for Efficient Molecular Property Prediction

📖 abstract

📝 Key Contributions

🚀 Quick Start

1. Set up the environment

2.Dataset Download Instructions

3.Model Training and Evaluation

About

Uh oh!

Releases

Packages

Languages

zh-netizen/Prop3D

Folders and files

Latest commit

History

Repository files navigation

🧬 Geometry-aware Lightweight Convolutional Network for Efficient Molecular Property Prediction

📖 abstract

📝 Key Contributions

🚀 Quick Start

1. Set up the environment

2.Dataset Download Instructions

3.Model Training and Evaluation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages