• A toolset to work with the Wikidata Graph

    Python Updated Sep 21, 2018
  • JavaScript Updated Sep 21, 2018
  • Generalized Conventional Mutual Information (GenConvMI) - NMI for overlapping (soft, fuzzy) clusters (communities), compatible with standard NMI, pure C++ version (single executable)

    C++ 7 3 LGPL-3.0 Updated Sep 15, 2018
  • Clubmark: a Parallel Isolation Framework for Benchmarking and Profiling Clustering Algorithms on NUMA Architectures

    Python Updated Sep 4, 2018
  • Extremely fast evaluation of the extrinsic clustering measures: various (mean) F1 measures and Omega Index (Fuzzy Rand Index) for the multi-resolution clustering with overlaps/covers, standard NMI, clusters labeling

    C++ 3 1 Apache-2.0 Updated Sep 4, 2018
  • Overlapping Normalized Mutual Information and Omega Index evaluation for the overlapping community structure produced by clustering algorithms

    C++ 4 17 GPL-3.0 Updated Sep 4, 2018
  • Python Multi-Process Execution Pool: concurrent asynchronous execution pool with custom resource constraints (memory, timeouts, affinity, CPU cores and caching), load balancing and profiling capabilities of the external apps on NUMA architecture

    Python 125 13 Updated Aug 31, 2018
  • This repository contains the pipeline for table detection/extraction from 'Bundesarchive' documents.

    HTML Updated Aug 29, 2018
  • Implementation of HistoSketch and D2HistoSketch in MATLAB

    Matlab Updated Aug 29, 2018
  • AGS Script Updated Aug 20, 2018
  • AGS Script Updated Aug 13, 2018
  • Resolution levels clustering merger with filtering and clusters deduplication. Flattens a hierarchy/list of multiple resolutions levels (clusterings) into the single flat clustering (collection), synchronizing the node base and deduplicating.

    C++ 1 Apache-2.0 Updated Jul 26, 2018
  • Python Benchmarking Framework for the Clustering Algorithms Evaluation: networks generation and shuffling; failover execution and resource consumption tracing (peak RAM RSS, CPU, ...); evaluation of Modularity, conductance, NMI and F1 Score for overlapping communities

    Python 16 2 Updated Jul 16, 2018
  • RG (Randomized Greedy clustering), CGGC_RG (Core Groups Graph ensemble Clustering) or CGGCi_RG (Core Groups Graph ensemble Clustering Iterative) algorithms

    C++ 1 1 LGPL-2.1 Updated Jul 11, 2018
  • Python 1 Updated Jun 28, 2018
  • Python 1 Updated Jun 11, 2018
  • AGS Script Updated Jun 1, 2018
  • AGS Script Updated Jun 1, 2018
  • Type Inference Evaluation Scripts & Accessory Apps (used for the StaTIX benchmarking)

    Python 1 1 Apache-2.0 Updated May 28, 2018
  • Statistical Type Inference (both fully automatic and semi supervised) for RDF datasets

    Java 5 1 Apache-2.0 Updated May 28, 2018
  • (Scalable) High-order proximity-preserving Unique node embeddings for undirected graphs

    C Updated May 27, 2018
  • C Updated Apr 23, 2018
  • Network (Graph) Format Converter: RCG, Pajek, Metis, NSL (NCol, SNAP, ...)

    Python 2 1 Apache-2.0 Updated Apr 21, 2018
  • Python 1 3 MIT Updated Apr 15, 2018
  • Benchmark for Centroid Decomposition of streams

    C++ Updated Sep 26, 2017
  • A collections of 30 random Wikipedia pages manually annotated with entities.

    1 Apache-2.0 Updated Sep 6, 2017
  • Role Tagger

    JavaScript Updated Jul 7, 2017
  • SCD

    Forked from DAMA-UPC/SCD
    C++ 12 GPL-3.0 Updated Jun 20, 2017
  • Log Analysis Stack

    Python Apache-2.0 Updated Jun 20, 2017
  • Extended version of the Lancichinetti-Fortunato-Radicchi Benchmark for Undirected Weighted Overlapping networks to evaluate clustering algorithms using generated ground-truth communities

    C++ 12 2 GPL-2.0 Updated May 31, 2017