This repository contains the core streaming heterogeneous graph clustering and anomaly detection code.
Before attempting execution, ensure you have the following available as two separate files:
Edges: A file containing one edge per line for all input graphs in the dataset. A sample is provided as
test_edges.txt. The edge file used for the experiments in the paper is available at sbustreamspot-data.
Bootstrap clusters: A file describing the bootstrap clusters. A sample is provided as
test_bootstrap_clusters.txt. Bootstrap clusters used for the experiments in the paper are available here.
The output will contain a summary of the execution parameters, runtime, and the graph cluster assignments and anomaly scores every 10,000 edges. This output can be further analyzed in various dimensions with sbustreamspot-analyze.
Compilation and execution has been tested with GCC 5.2.1 on Ubuntu 15.10.
git clone https://github.com/sbustreamspot/sbustreamspot-core.git cd sbustreamspot-core make clean optimized ./streamspot --edges=test_edges.txt \ --bootstrap=test_bootstrap_clusters.txt \ --chunk-length=10 \ --num-parallel-graphs=10 \ --max-num-edges=10 \ --dataset=all
To use a different dataset, change
Most parameter settings are via the command-line; details on setting
them can be viewed by running
A few parameters are set at compile-time and can be found in