Skip to content


  • Arctic Code Vault Contributor
  • Pro




  1. IRSx: Turn the IRS' versioned XML 990 nonprofit annual tax returns into standardized python objects, json, or human readable text with original line number and description.

    Python 77 17

  2. Django app to consume and store 990 data and metadata

    Python 13 10

  3. Plumb a PDF for detailed information about each char, rectangle, line, et cetera — and easily extract text and tables.

    Python 1.3k 228

  4. process--partially--the senate clerk's report on spending.

    Python 1 6

  5. a python parser for the .fec file format

    Python 21 8

  6. Tooling to extract data from scanned paper forms OCR-ed by Tesseract using the HOCR standard.

    HTML 77 16

381 contributions in the last year

Nov Dec Jan Feb Mar Apr May Jun Jul Aug Sep Oct Mon Wed Fri

Contribution activity

October 2020

Created a pull request in simonw/datasette that received 2 comments

Fix table name in spatialite example command

The example query for creating a new point geometry seems to be using a table called 'museums' but at one point it instead uses 'events'. I believe

+1 −1 2 comments

Created an issue in simonw/datasette that received 7 comments

"Edit SQL" button on canned queries

Feature request: Would it be possible to add an "edit this query" button on canned queries? Clicking it would open the canned query as an editable …

2 contributions in private repositories Oct 23

Seeing something unexpected? Take a look at the GitHub profile guide.

You can’t perform that action at this time.