Skip to content

DerekDiJin/Domain_Knowledge

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

39 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Exploratory Analysis of Graph Data by Leveraging Domain Knowledge

This is the code repository for the paper: Exploratory Analysis of Graph Data by Leveraging Domain Knowledge

By Di Jin, Danai Koutra.

"Summarize an unknown graph from known ones."

Table of Contents

DATA

The data directory contains "real_train", the directory containing raw files of the domain knowledge (known graphs) and "real_test", the directory containing the input unknown graph file.

analysis (and supplementary results to the paper)

The directory with experiments conducted in the paper. To run the experiments, run "exp_effectiveness", "exp_scalability_1", "exp_scalability_2" and "exp_sensitivity". For example, the evaluation of the diversity and domain-specificity of the graph invariant distributions selected by EAGLE and the baselines is conducted with the command

$ exp_effectiveness

The supplementary results of Satisfaction of Desired Properties (Section V, part D) from the paper can be obtained by running the same script with different correlation metrics. To be specific, the first figure shows the evaluation based on correlation using Pearson correlation Pearson which is the figure in the paper. The second figure shows the evaluation based on correlation using Kendall's tau. Kendall's tau The third figure shows the evaluation based on correlation using Spearman's rank correlation. Spearman As stated in the paper, in all three cases we observe that EAGLE outperforms the baseline methods.

extra_Features

The directory contains the extra graph invariants computed through SNAP.

lib

processed

This directory contains the processed raw graphs in the format to run EAGLE.

records

This directory contains the stored experimental results in the paper conducted with scripts in analysis.

src

This code is built in MATLAB 2016a. The preprocessing procedure is time-consuming, the command to run without preprocessing the raw data files is:

$ main

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published