SparklingGraph provides easy to use set of features that will give you ability to proces large scala graphs using Spark and GraphX.
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Failed to load latest commit information.
api
examples migration to sbt 1.0.2 May 30, 2018
experiments migration to sbt 1.0.2 May 30, 2018
generators migration to sbt 1.0.2 May 30, 2018
loaders
operators
project
utils
.gitignore
.travis.yml
CONTRIBUTING.md
LICENSE
README.md
build.sbt
codecov.yml
sonatypeCredentials.sbt Adding community awware graph repartitioning, code reformat and impor… Apr 12, 2016
version.sbt Setting version to 0.0.8-SNAPSHOT May 16, 2017

README.md

sparkling-graph

Build Status Documentation Status codecov.io Codacy Badge Maven Central MLOSS Spark Packages API Gitter FOSSA Status

SparklingGraph provides easy to use set of features that will give you ability to proces large scala graphs using Spark and GraphX.

Dependencies

Snapshot

resolvers +=  "Sonatype OSS Snapshots" at "https://oss.sonatype.org/content/repositories/snapshots"
// one or all from:
libraryDependencies += "ml.sparkling" %% "sparkling-graph-examples" % "0.0.8-SNAPSHOT"
libraryDependencies += "ml.sparkling" %% "sparkling-graph-loaders" % "0.0.8-SNAPSHOT"
libraryDependencies += "ml.sparkling" %% "sparkling-graph-operators" % "0.0.8-SNAPSHOT"

Release

// one or all from:
libraryDependencies += "ml.sparkling" %% "sparkling-graph-examples" % "0.0.7"
libraryDependencies += "ml.sparkling" %% "sparkling-graph-loaders" % "0.0.7"
libraryDependencies += "ml.sparkling" %% "sparkling-graph-operators" % "0.0.7"

Current features

  • Loading
    • Formats:
      • CSV
      • GraphML
    • DSL
  • Measures - measures can be configured to treat graphs as directed and undirected
    • Measures DSL - easy to use domain specific language that boost productivity of library
    • Graph
      • Modularity
      • Freeman's network centrality
    • Vertex
      • Closeness
      • Local clustering
      • Eigenvector
      • Hits
      • Neighbor connectivity
      • Vertex embeddedness
      • Betweenness
        • Edmonds
        • Flow
        • Hua
    • Edges
      • Adamic/Adar
      • Common neighbours
  • Comunity detection methods
    • PSCAN (SCAN)
  • Graph coarsening
    • Label Propagation based
  • Link prediction
    • Similarity measure based
  • Generators
    • Ring
    • Watts And Strogatz
  • Experiments
    • Describe graph using all measures to CSV files

Planned features

  • Loading
    • GML
  • Measures
    • Katz
  • Comunity detection methods
    • Modularity maximization
    • Infomap
  • More Generators
  • API
    • Random walk
    • BFS
  • ML
    • Vertex classification

Used by

Supported by:

provides us awesome IDE

How to

Please check API, examples or docs

Citation

If you use SparklingGraph in your research and publish it, please consider citing us, it will help us get funding for making the library better. Currently manuscript is in preparation, so please us following references:

Bartusiak et al. (2017). SparklingGraph: large scale, distributed graph processing made easy. Manuscript in preparation.

@unpublished{sparkling-graph
title={SparklingGraph: large scale, distributed graph processing made easy},
author={Bartusiak R., Kajdanowicz T.},
note = {Manuscript in preparation},
year = {2017}
}

License

FOSSA Status