🌺 Extraction/Analysis for graph features from public datasets, using NetworkX. 🌺
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
scripts_features
scripts_features_adv
scripts_sampling
scripts_to_extract_data
src
.gitignore
README.md
final_report.pdf

README.md

🌺 NetAna - Complex Network Analysis Package 🌺

Extraction and analysis of several graph features from publicly available datasets using NetworkX. Check out my draft paper with some of the preliminary results I have obtained and check out my final report.

Analyzed Features

  • Assortativity
  • Clique number
  • Clustering
  • Density
  • Diameter
  • Edge connectivity
  • Node connectivity
  • Number of cliques
  • Number of edges
  • Number of nodes
  • Radius
  • Clustering and Transitivity
  • Betweeness centrality
  • Closeness centrality
  • Communicability centrality
  • Coreness
  • Degree centrality
  • Eccentricity
  • Number of triangles
  • Pagerank
  • Square clustering
  • Transitivity

Networks

  • Social networks: online social networks, edges represent interactions between people
  • Ground truth: ground-truth network communities in social and information networks
  • Communication: email communication networks with edges representing communication
  • Citation: nodes represent papers, edges represent citations
  • Collaboration: nodes represent scientists, edges represent collaborations (co-authoring a paper)
  • Web graphs: nodes represent webpages and edges are hyperlinks
  • Products: nodes represent products and edges link commonly co-purchased products
  • p2p: nodes represent computers and edges communication
  • Roads: nodes represent intersections and edges roads connecting the intersections
  • Autonomous systems: graphs of the Internet
  • Signed networks: networks with positive and negative edges (friend/foe, trust/distrust)
  • Location based networks: Social networks with geographic check-ins
  • Wikipedia: Talk, editing and voting data from Wikipedia
  • Bio Atlas: Food-webs selected from Ecosystem Network Analysis site and from ATLSS.
  • Bio Cellular: Substrate in cellular network of corresponding organism.
  • Bio Metabolic: Metabolic network of corresponding organism.
  • Bio Carbon: Carbon exchanges in the cypress wetlands of South Florida during the wet and dry season.
  • Bio Yeast: Protein-protein interaction network in budding yeast.

Normalization and Graph Sampling

Performed using snowball sampling (choosing the sample order, i.e. number of nodes). Optimized for number of edges and multiple samplings.

Next Steps


License

When making a reference to my work, please use my website.

Creative Commons License

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.