data commons entity store and utilities to help match records
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Failed to load latest commit information.
bin
docs
etc
matchbox
.gitignore
LICENSE
README.md
setup.py

README.md

Matchbox

Matchbox is a toolkit for the matching and merging of entities. It was developed to assist Sunlight Labs in the creation of the Data Commons, a repository of linked open government data sets.

Features

  • supports creation and updating of entities
  • handles merging of entities and history of merges
  • Python module for interacting with the entity store
  • simple HTTP-based API

Coming Soon

  • text matching algorithms for finding merge candidates
  • web-based administration interface for managing the merge process
  • importers and exporters for getting data into and results from the system

Installation

Matchbox requires:

Please refer to docs/install.rst for full installation instructions.