Skip to content
Node Embeddings in Dynamic Graphs
Python Shell
Branch: master
Clone or download
Latest commit f856bdf Aug 27, 2019
Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
docs extended abstract uploaded Dec 12, 2018
python eval with twittertennis package 1 Jun 24, 2019
scripts preprocess script updated Jul 1, 2019
.gitignore updating online node2vec codebase Dec 8, 2018
README.md Update README.md Aug 27, 2019

README.md

Online Node2Vec

This repository contains the code related to the research of Ferenc Béres, Róbert Pálovics, Domokos Miklós Kelen and András A. Benczúr.

Cite

We propose two online node embedding models (StreamWalk and second order similarity) for temporally evolving networks. Two nodes are required to be mapped close in the vector space whenever they lie on short paths formed by recent edges in the first model, and whenever the set of their recent neighbors is similar in the second model.

Please cite our paper if you use our work:

@Article{Béres2019,
author="B{\'e}res, Ferenc
and Kelen, Domokos M.
and P{\'a}lovics, R{\'o}bert
and Bencz{\'u}r, Andr{\'a}s A.",
title="Node embeddings in dynamic graphs",
journal="Applied Network Science",
year="2019",
volume="4",
number="1",
pages="64",
}

I presented a former version of our work at the 7th International Conference on Complex Networks and Their Applications that is availabe on this branch.

Data

US Open 2017 (UO17) and Roland-Garros 2017 (RG17) Twitter datasets were published in our previous work for the first time. Please cite this article if you use our data sets in your research:

@Article{Béres2018,
author="B{\'e}res, Ferenc
and P{\'a}lovics, R{\'o}bert
and Ol{\'a}h, Anna
and Bencz{\'u}r, Andr{\'a}s A.",
title="Temporal walk based centrality metric for graph streams",
journal="Applied Network Science",
year="2018",
volume="3",
number="1",
pages="32",
}

These Twitter datasets are available on the website of our research group. In order to process the data you must install the twittertennis Python package. It will automatically download and process the data sets for you.

Requirements

  • UNIX environment
  • Python 3.5 conda environment with pre-installed jupyter:
conda create -n YOUR_CONDA_PY3_ENV python=3.5 jupyter
source activate YOUR_CONDA_PY3_ENV
  • Install the twittertennis Python package for downloading and processing the data for evaluation.
  • Install the following packages with conda or pip:
    • data processing: pandas, numpy
    • scientific: scipy, gensim, networkx, gmpy2
    • general: sys, os, time, random, collections

Usage

After installing every requirement execute the following script to run both node representation learning and evaluation for the similarity search task.

cd scripts
bash run.sh

The major steps in our pipeline are:

You can’t perform that action at this time.