Skip to content
This repository


Subversion checkout URL

You can clone with HTTPS or Subversion.

Download ZIP

data commons entity store and utilities to help match records

branch: master

This branch is 0 commits ahead and 0 commits behind master

Fetching latest commit…

Cannot retrieve the latest commit at this time


Matchbox is a toolkit for the matching and merging of entities. It was developed to assist Sunlight Labs in the creation of the Data Commons, a repository of linked open government data sets.


  • supports creation and updating of entities
  • handles merging of entities and history of merges
  • Python module for interacting with the entity store
  • simple HTTP-based API

Coming Soon

  • text matching algorithms for finding merge candidates
  • web-based administration interface for managing the merge process
  • importers and exporters for getting data into and results from the system


Matchbox requires:

Please refer to docs/install.rst for full installation instructions.

Something went wrong with that request. Please try again.