Block or report user
Developer Program Member

Organizations

@mozilla @okfn @openspending @OpenNewsLabs @bundestag @okfde @stadtlandcode @occrp

Pinned repositories

  1. aleph

    Sift through large sets of structured and unstructured data, and find the people and companies you look for.

    Python 414 45

  2. dataset

    Easy-to-use data handling for SQL data stores with support for implicit table creation, bulk loading, and transactions. Dataset also includes support for freezing data to CSV and JSON flat files.

    Python 2.6k 153

  3. opennames

    An open database of persons of interest and politically exposed persons

    Python 37 8

  4. openspending/spendb

    Next-gen web application for public finance data warehouses, formerly OpenSpending

    Python 46 10

  5. normality

    A tiny library for Python text normalisation. Useful for ad-hoc text processing.

    Python 35 7

  6. fingerprints

    A library to generate entity fingerprints

    Python 17 2

2,182 contributions in the last year

Mar Apr May Jun Jul Aug Sep Oct Nov Dec Jan Feb Mon Wed Fri

Contribution activity First pull request First issue First repository Joined GitHub

February 2017

Created an issue in datamade/dedupe that received 5 comments

Is there a way to do pairwise comparison?

Having trained an instance of Deduper and after using it to cluster a larger dataset, it would be very useful to be able to do pairwise comparisons…

23 contributions in private repositories Feb 3 – Feb 17

Seeing something unexpected? Take a look at the GitHub profile guide.