Skip to content


Subversion checkout URL

You can clone with HTTPS or Subversion.

Download ZIP
data commons entity store and utilities to help match records
tree: 3112563f4b

Fetching latest commit…

Cannot retrieve the latest commit at this time

Failed to load latest commit information.
bin add indexing and index search
docs update install docs with directories that need to be created
matchbox webapi


Matchbox is a toolkit for the matching and merging of entities. It was developed to assist Sunlight Labs in the creation of the Data Commons, a repository of linked open government data sets.


  • supports creation and updating of entities
  • handles merging of entities and history of merges
  • Python module for interacting with the entity store
  • simple HTTP-based API

Coming Soon

  • text matching algorithms for finding merge candidates
  • web-based administration interface for managing the merge process
  • importers and exporters for getting data into and results from the system


Matchbox requires:

Please refer to docs/install.rst for full installation instructions.

Something went wrong with that request. Please try again.