Skip to content
View chrismeller's full-sized avatar

Organizations

@habari @ziada @habari-extras
Block or Report

Block or report chrismeller

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
chrismeller/README.md

Howdy 👋

Hi, I'm Chris, and I'm a backend C# and NodeJS Typescript developer. I spend most of my time at work writing distributed processes to handle complex workflows and large amounts of data. At home I'm usually writing code to harvest data or developing small apps and hardware projects to make something easier for me. So far none of them have been helpful to anyone else, but I can dream, right?

Once or twice that's been (politely) polling an API. More often it involves page scraping a government website or two, sometimes even with Selenium, to provide data in the format it should have been made available in all along.

My Starbucks dataset has been referenced by many "name brand" news agencies, including: QZ, CBS 2 Chicago, Politico, and The Guardian. It's also been cited without attribution by Computer World and Business Insider, who are still on my list and should cry themselves to sleep every night, dreaming of forgiveness. It's been referenced in academia by a project at the Ohio State University and in a joint paper from Princeton, UC Riverside, Carnegie Mellon, and City University Hong Kong presented at the 2017 15th IEEE International Symposium on Modeling and Optimization in Mobile, Ad Hoc, and Wireless Networks (WiOpt).

I've also scraped millions of LastFM profiles, processed 10's of gigabytes of TIGER/Line US Census data files to extract a definitive list of zip codes, digitized thousands of South Carolina school district purchase order and credit card transactions, and filed dozens of Freedom of Information Act requests. Once upon a time I also discovered while hacking my own router that I could take over every consumer device for the largest internet provider in the US Virgin Islands. Most recently, I'm harvesting traffic camera footage to track the arrival and departure of trams because I don't think the schedules are reliable.

Pinned

  1. StarbucksScraper StarbucksScraper Public archive

    A very mean, very barebones, C# Console app that hammers the Starbucks Store Locator API to extract every store in the world.

    C# 12 5

  2. EfCeeEm EfCeeEm Public

    WIP. A port of chrismeller/fourchan-monitor to .NET 6 using MassTransit.

    C# 1

  3. SCAlcoholLicenses SCAlcoholLicenses Public

    A C# library to page scrape the South Carolina Department of Revenue's listing of issued alcohol licenses using Microsoft Playwright w/ Chromium.

    C# 5 1

  4. fourchan-monitor fourchan-monitor Public

    A NestJS microservices app using NATS that politely polls the 4chan API for all threads and posts in a board and dumps them in a SQLite database. Currently at 25gb of data saved and climbing.

    TypeScript 3

  5. habari/habari habari/habari Public

    A basic Habari site, ready to be forked and customized!

    PHP 178 48

  6. estonian-aircraft estonian-aircraft Public

    Flat data of all the aircraft registered in Estonia

    HTML 1