Skip to content

HTTPS clone URL

Subversion checkout URL

You can clone with HTTPS or Subversion.

Download ZIP
data commons entity store and utilities to help match records
branch: master

This branch is even with sunlightlabs:master

Fetching latest commit…

Cannot retrieve the latest commit at this time

Failed to load latest commit information.
bin
docs
etc
matchbox
.gitignore
LICENSE
README.md
setup.py

README.md

Matchbox

Matchbox is a toolkit for the matching and merging of entities. It was developed to assist Sunlight Labs in the creation of the Data Commons, a repository of linked open government data sets.

Features

  • supports creation and updating of entities
  • handles merging of entities and history of merges
  • Python module for interacting with the entity store
  • simple HTTP-based API

Coming Soon

  • text matching algorithms for finding merge candidates
  • web-based administration interface for managing the merge process
  • importers and exporters for getting data into and results from the system

Installation

Matchbox requires:

Please refer to docs/install.rst for full installation instructions.

Something went wrong with that request. Please try again.