Skip to content


@open-city @datamade @dssg
Block or Report

Block or report fgregg

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

"using my custom formula, I would get a prediction"


  1. 🆔 A python library for accurate and scalable fuzzy matching, record deduplication and entity-resolution.

    Python 3.5k 476

  2. 🇺🇸 a python library for parsing unstructured United States address strings into address components

    Python 1.3k 273

  3. 🔖 A toolkit for making domain-specific probabilistic parsers

    Python 742 82

  4. 🆔 Command line tool for deduplicating CSV files

    Python 361 79

  5. Estimating Markov Random Fields models with Pseudolikelihood

    Python 1 1

  6. 👪 a python library for parsing unstructured western names into name components.

    Python 504 64

6,975 contributions in the last year

Aug Sep Oct Nov Dec Jan Feb Mar Apr May Jun Jul Mon Wed Fri

Contribution activity

August 2022

Reviewed 3 pull requests in 2 repositories
fpdcc/ccfp-asset-dashboard 2 pull requests
dedupeio/dedupe 1 pull request

Created an issue in dedupeio/dedupe that received 1 comment

Documenting the guarantee that fingerprinter won't emit duplicate tokens for the stame field.

Right now this is true because we are careful to make sure that every predicate returns unique keys. It would be safer, and sometimes more efficien…

1 comment
Opened 2 other issues in 2 repositories
550 contributions in private repositories Aug 1 – Aug 11

Seeing something unexpected? Take a look at the GitHub profile guide.