Name		Name	Last commit message	Last commit date
parent directory ..
data_utils		data_utils
static_ops		static_ops
tests		tests
README.md		README.md
benchmarks.yml		benchmarks.yml
conftest.py		conftest.py
embedding_layers.py		embedding_layers.py
layers.py		layers.py
model.py		model.py
requirements.txt		requirements.txt
run_training.py		run_training.py
utils.py		utils.py
xpu.py		xpu.py

README.md

Message Passing Neural Network (MPNN)

MPNN implementations (Graph Isomorphism Network, Graph Network and Interaction Network), optimised for Graphcore's IPU.

Framework	Domain	Model	Datasets	Tasks	Training	Inference	Reference
TensorFlow 2	GNNs	MPNN	MolHIV		✅ Min. 1 IPU (POD4) required	❌	How powerful are graph neural networks?, Relational inductive biases, deep learning, and graph networks, Neural message passing for quantum chemistry

Instructions summary

Install and enable the Poplar SDK (see Poplar SDK setup)
Install the system and Python requirements (see Environment setup)
Download the MolHIV dataset (See Dataset setup)

Poplar SDK setup

To check if your Poplar SDK has already been enabled, run:

 echo $POPLAR_SDK_ENABLED

If no path is provided, then follow these steps:

Navigate to your Poplar SDK root directory
Enable the Poplar SDK with:

cd poplar-<OS version>-<SDK version>-<hash>
. enable.sh

More detailed instructions on setting up your Poplar environment are available in the Poplar quick start guide.

Environment setup

To prepare your environment, follow these steps:

Create and activate a Python3 virtual environment:

python3 -m venv <venv name>
source <venv path>/bin/activate

Navigate to the Poplar SDK root directory
Install the TensorFlow 2 and IPU TensorFlow add-ons wheels:

cd <poplar sdk root dir>
pip3 install tensorflow-2.X.X...<OS_arch>...x86_64.whl
pip3 install ipu_tensorflow_addons-2.X.X...any.whl

For the CPU architecture you are running on

Build the custom ops:

cd static_ops && make

More detailed instructions on setting up your TensorFlow 2 environment are available in the TensorFlow 2 quick start guide.

Dataset setup

MolHIV

Download the "molhiv" dataset via the ogb repository. The leaderboard for this task is found here.

[1] Hu, Weihua, et al. "Open graph benchmark: Datasets for machine learning on graphs." arXiv preprint arXiv:2005.00687 (2020).

Custom training

Model Details

To integrate the use of edges (i.e. the information from atomic bonds), we follow [5] and embed the edges afresh at each layer. These embedded edges are combined with the neighborhood aggregation from the nodes.

We used similar parameters to the example in [5], but use Layer Normalization instead of Batch Normalization at the hidden layers of the multi-layer perceptrons.

[2] Hu, Weihua, et al. "Strategies for pre-training graph neural networks." arXiv preprint arXiv:1905.12265 (2019).

Performance

We compare our model with the GIN implemented on the OGB leaderboard. As per the instructions, training is performed 10 times and the results reported.

The command used is: python run_training.py --micro_batch_size=24 --cosine_lr=True --epochs=150 --nodes_dropout=.1, which uses FP16 training.

The test results are consistent with the reported results.

Model	Parameters	Test ROC-AUC	Validation ROC-AUC
GIN on leaderboard	1,885,206	0.7558 ± 0.0140	0.8232 ± 0.0090
IPU GIN model	1,708,106	0.7574 ± 0.0215	0.7915 ± 0.0114

Other features

Grouped gathers/scatters

This repository now implements grouped gathers and scatters. The compiler is fully aware that each member of the batch can only communicate within its own batch. To use this efficiently, we concatenate several graphs into each member of the batch. The packing_strategy_finder plans the packing so as to minimise padding. Using this grouping improves our throughput.

Licensing

This example is licensed under the MIT license.

Permission is hereby granted, free of charge, to any person obtaining a copy of this software and associated documentation files (the "Software"), to deal in the Software without restriction, including without limitation the rights to use, copy, modify, merge, publish, distribute, sublicense, and/or sell copies of the Software, and to permit persons to whom the Software is furnished to do so, subject to the following conditions:

The above copyright notice and this permission notice shall be included in all copies or substantial portions of the Software.

THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE SOFTWARE.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

tensorflow2

tensorflow2

README.md

Message Passing Neural Network (MPNN)

Instructions summary

Poplar SDK setup

Environment setup

Dataset setup

MolHIV

Custom training

Model Details

Performance

Other features

Grouped gathers/scatters

Licensing

Files

tensorflow2

Directory actions

More options

Directory actions

More options

Latest commit

History

tensorflow2

Folders and files

parent directory

README.md

Message Passing Neural Network (MPNN)

Instructions summary

Poplar SDK setup

Environment setup

Dataset setup

MolHIV

Custom training

Model Details

Performance

Other features

Grouped gathers/scatters

Licensing