GraphSimilarity

Created a solution for graph similiarity using a stochastic approach.

The algorithm doesn't seem to be in academia to the best of my knowledge; The closest thing to this approach is the Weisfeiler-Lehman Graph Hashing (but not really?) This approach is fast but not optimized in lower languages or parallelized (its just a proof of concept that seemlying works really well). If enough interest is put here I might do it.

The nice thing about this algorithm, is that it embeds each graph into a characteristic function where it can be represented as a 1 dimensional array of complex numbers, so the graphs can be embeded into this form for caching purposes and then the similarity metric calculation is just an average of the difference vector magnitudes of the embedding vectors. If more characteric values are needed for the embedding (for higher precision), the new ones just need to be calculated (rather then the whole embedding redone); they are appended to the end of the vector. The calculation of the embedding procedure is parallelizable and the output vector size doesn't rely on graph size, only precision does. Another note, the characteristic function, although calculated using a random procedure, should converge with to the same value for a single graph in theory. In practice, the characteristic function converges to the same values really well from my test, given only using 30 samples (a value used for convergence).

The approach should have an average case time complexity of O((V + E) * C), but the implementation is more like O((V^2) * C) I think I can of optimized this. Just for reference, V is number of verticies of the graph, E is the number of edges, and C is the number of values calculated from the characteristic function used for the embedding 1-d vector. After embedding each graph once, the time complexity for the similiarity value is just O(C) between any two graph embeddings.

For a future addition, I would like to make the characteric function values continuously change after slight changes in the graph all the way to when the graph is disjoint. I have a way of doing this, but it would require base functionality to change, but the overall method will stay the same.

For more information, please look at the test file, hopefully it's not that confusing.

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
src/graph_similarity		src/graph_similarity
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

GraphSimilarity

About

Releases

Packages

Languages

License

mackimaow/GraphSimilarity

Folders and files

Latest commit

History

Repository files navigation

GraphSimilarity

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages