Comparative Study — DeepWalk vs Graph Neural Networks

Do message‑passing models really outclass random walks? A systematic revisit across citation and e‑commerce graphs.

Project Overview

This project benchmarks three families of graph‑representation methods:

Model	Paradigm	Supervision	Implementation
DeepWalk	Random walks + Skip‑Gram	✗ (unsupervised)	Implemented from scratch (Word2Vec via gensim)
GNN w/ MLP	Message passing	✓ (supervised)	Pure PyTorch
GAT	Attention‑based message passing	✓ (supervised)	Pure PyTorch (no PyG)

Our goal is to quantify node‑classification accuracy and embedding quality on three standard graphs: Cora, Citeseer, and Amazon‑Computers. See the PDF report for the full theoretical background and extended discussion.

Methodology

1 · DeepWalk

Random walks: 100 walks × length 10 per node
Embedding size: 128
Training: Skip‑Gram + negative sampling on the walk corpus

2 · Graph Neural Network (GNN)

Layers: 2
Aggregation: mean → MLP
Hidden size: 128
Activation: ReLU

3 · Graph Attention Network (GAT)

Layers: 2 (8 heads each)
Attention: additive (LeakyReLU, α = 0.2)
Dropout: input 0.6 | attention 0.6

For the mathematical update rules, see Section 3 of the full report.

Datasets

Dataset	Nodes	Edges	Classes
Cora	2 708	5 429	7
Citeseer	3 327	4 732	6
Amazon‑Computers	13 752	245 861	10

All graphs are treated as undirected and pre‑processed with self‑loops removed.

Experiments

Training & Evaluation Setup

Split: 60 % train · 20 % val · 20 % test (stratified)
Optimizer: Adam (lr = 1 e‑3) + OneCycleLR
Loss:
- Cross‑entropy for GNN / GAT
- Logistic regression classifier on fixed DeepWalk embeddings
Early stopping: patience = 50 epochs on validation accuracy

Results

1 · Node‑Classification Accuracy

Model	Cora	Citeseer	Amazon‑Computers
GNN	85.6 %	70.0 %	63.5 %
GAT	84.3 %	69.8 %	87.1 %
DeepWalk	85.1 %	59.6 %	88.6 %

2 · Embedding Visualisation (t‑SNE)

Model	Cora	Citeseer	Amazon‑Computers
GNN
GAT
DeepWalk

For all theoretical background and extended discussion of these numbers and plots, please refer to the accompanying PDF report.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
assets		assets
deepwalk		deepwalk
gnn		gnn
report		report
results		results
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Comparative Study — DeepWalk vs Graph Neural Networks

Table of Contents

Project Overview