Skip to content

HTTPS clone URL

Subversion checkout URL

You can clone with HTTPS or Subversion.

Download ZIP
data commons entity store and utilities to help match records
branch: master

Fetching latest commit…

Cannot retrieve the latest commit at this time

Failed to load latest commit information.
bin
docs update install docs with directories that need to be created
etc
matchbox
.gitignore
LICENSE
README.md
setup.py

README.md

Matchbox

Matchbox is a toolkit for the matching and merging of entities. It was developed to assist Sunlight Labs in the creation of the Data Commons, a repository of linked open government data sets.

Features

  • supports creation and updating of entities
  • handles merging of entities and history of merges
  • Python module for interacting with the entity store
  • simple HTTP-based API

Coming Soon

  • text matching algorithms for finding merge candidates
  • web-based administration interface for managing the merge process
  • importers and exporters for getting data into and results from the system

Installation

Matchbox requires:

Please refer to docs/install.rst for full installation instructions.

Something went wrong with that request. Please try again.