- Great Lakes
- https://bunkum.us
- @bunkum.us
Starred repositories
Split a JSON file with hierarchical data to multiple CSV files
golang client for the culturedcode things cloud
Computer Vision Basics in Microsoft Excel (using just formulas)
Put realtime data on a Leaflet map
GitHub Action to build Python manylinux wheels
Scraper for the State of Michigan's Department of Licensing and Regulatory Affairs' business entity database
📐 Compute distance between sequences. 30+ algorithms, pure python implementation, common interface, optional external libs usage.
Html Content / Article Extractor, web scrapping lib in Python
Extending queryset and SolrBackend models for Django Haystack, that lets Django Haystack support Solr's Cursor Pagination, eDisMax(in progressing)
Materials for "Queer Communities, Civic Tech, and Open Data" workshop at MozFest 2018
Natural Language Processing of Chicago news articles
A pytest plugin for preserving test isolation in Flask-SQLAlchemy using database transactions.
Learning String Alignments for Entity Aliases
A repository of scripts for extracting news articles from US newspapers
python3 package supporting efficient storage and querying of sets of sets using the trie data structure. Supports finding all the supersets/subsets of a given set from a collection of sets. Also in…
Pweave is a scientific report generator and a literate programming tool for Python. It can capture the results and plots from data analysis and works well with numpy, scipy and matplotlib.
A Python package for efficient evaluation based on OASIS (Optimal Asymptotic Sequential Importance Sampling).
sample streams using reservoir sampling
Automated data extraction from U.S. state Comprehensive Annual Financial Reports (CAFR).
Logical Replication extension for PostgreSQL 17, 16, 15, 14, 13, 12, 11, 10, 9.6, 9.5, 9.4 (Postgres), providing much faster replication than Slony, Bucardo or Londiste, as well as cross-version up…
Python search module for fast approximate string matching
A Python encapsulation of Steorts, et. al. (2015) graphical record linkage system
MALLET is a Java-based package for statistical natural language processing, document classification, clustering, topic modeling, information extraction, and other machine learning applications to t…