Skip to content
This repository

HTTPS clone URL

Subversion checkout URL

You can clone with HTTPS or Subversion.

Download ZIP

data commons entity store and utilities to help match records

branch: master

Fetching latest commit…

Octocat-spinner-32-eaf2f5

Cannot retrieve the latest commit at this time

Octocat-spinner-32 bin
Octocat-spinner-32 docs
Octocat-spinner-32 etc
Octocat-spinner-32 matchbox
Octocat-spinner-32 .gitignore
Octocat-spinner-32 LICENSE
Octocat-spinner-32 README.md
Octocat-spinner-32 setup.py
README.md

Matchbox

Matchbox is a toolkit for the matching and merging of entities. It was developed to assist Sunlight Labs in the creation of the Data Commons, a repository of linked open government data sets.

Features

  • supports creation and updating of entities
  • handles merging of entities and history of merges
  • Python module for interacting with the entity store
  • simple HTTP-based API

Coming Soon

  • text matching algorithms for finding merge candidates
  • web-based administration interface for managing the merge process
  • importers and exporters for getting data into and results from the system

Installation

Matchbox requires:

Please refer to docs/install.rst for full installation instructions.

Something went wrong with that request. Please try again.