Join GitHub today
GitHub is home to over 28 million developers working together to host and review code, manage projects, and build software together.Sign up
MeasurementLab (M-Lab) is a consortium of research, industry and public-interest partners dedicated to:
Providing an open, verifiable measurement platform for global network performance
Hosting the largest open Internet performance dataset on the planet
Creating visualizations and tools to help people make sense of Internet performance
Getting Started with BigQuery Documentation: https://www.measurementlab.net/data/docs/bq/quickstart/
We have a few different projects for the hackathon related to data analysis or tool development.
Project 1 (easy): ASN Annotation
Extend our Annotation Service to annotate test data with the correct ASN information from Maxmind. As data is run through our ETL pipeline or our gardener processing service, it is annotated with relevant fields from Maxmind, e.g. location information. We would like to extend the annotation to include ASN data, and have that ASN data be correct at the time that the test was run, i.e. not the most recent Maxmind file, but the Maxmind historic data that matches the correct ASN assignment at the time of data collection. The data should all be available via Maxmind.
Project 2 (medium/advanced): SignalSearcher
SignalSearcher is our analysis toolkit for applying TensorFlow models to M-Lab data. Thus far we have an example that looks for significant/anomalous degredation or improvement that might signal an interconnection issue. We are interested in questions around detecting internet anomalies (e.g. large scale internet events, internet shutdowns, nation-state interference, country or ASN throttling). At the moment SignalSearcher only works on NDT data, so there's also an opportunity to bring in additional data sets, e.g. Paris Traceroute data, or external data sets, e.g. BGP or OONI measurements). Additionally, we haven't yet explored ways that we might want to surface events in our visualization tools, so exploring how to visually present anomalies would also be of interest.
Project 3 (advanced): Paris Traceroute Data Analysis
Everytime any test is run against the M-Lab platform, a Paris Traceroute test is trigged from the M-Lab server back to the client. As of yet, there's been minimal analysis of PTR data, so the sky is the limit. We've brainstormed ideas around combining path data with NDT measurements, as well as looking at paths that cross country boundaries, just to give some ideas here.
Project 4 (medium) -- NDTjs Mock Testing Environment
NDT can be integrated into applications to provide measurement as a service. More and more we are seeing interest in integration NDT into webbased applications, but as of yet we do not have a good local testing environment for the NDTjs library. Ideally this would include the ability to run a local mock NDT server, and have unit tests built in so that an implementer would be able to test their application.
Project 5 (medium) -- Extend the M-Lab Viz Platform
Our visualization platform is built in React and D3 and can be easily extended to include new visualizations. You could develop new visualization techniques that allow users to understand and analyze the measurements in new ways, add datasets to the visualization tool, or explore ways to surface anomalies (either from SignalSearcher or M-Lab Data Annotations).
Project 6 (easy) -- Browser Extension
We have had a beta Chrome extension for a few years now, but we'd love to have a Firefox Extension as well, and we'd welcome feature improvements to the current Chrome Extension.
We will have a few team members at SIGCOMM, most likely Peter Boothe (email@example.com) and Georgia Bullen (firstname.lastname@example.org). You can also contact the broader team by emailing email@example.com or pose questions to the community on our discuss list (firstname.lastname@example.org).