Skip to content

tspannhw/FLaNK-MTA

Repository files navigation

FLaNK-MTA

MTA Data Sources as well as other New York, New Jersey and Pennsylvania Data

Finding the Best Way Around

See: https://medium.com/@tspann/finding-the-best-way-around-7491c76ca4cb

TITLE: Utilizing Real-Time Transit Data for Travel Optimization

ABSTRACT: There are a lot of factors involved in determining how you can find our way around and avoid delays, bad weather, dangers and expenses. In this talk I will focus on public transport in the largest transit system in the United States, the MTA, which is focused around New York City. Utilizing public and semi-public data feeds, this can be extended to most city and metropolitan areas around the world. As a personal example, I live in New Jersey and this is an extremely useful use of open source and public data.

Once I am notified that I need to travel to Manhattan, I need to start my data streams flowing. Most of the data sources are REST feeds that are ingested by Apache NiFi to transform, convert, enrich and finalize it for usage in streaming tables with Flink SQL, but also keep that same contract with Kafka consumers, Iceberg tables and other users of this data. I do not need to many user interfaces to interopt with the system as I want my final decision sent in a Slack message to me and then I'll get moving. Along the way data will be visible in NiFi lineage, Kafka topic views, Flink SQL output, REST output and Iceberg tables.

I am implementing this real-time travel optimization application utilizing many Apache projects including Apache NiFi, Apache Kafka, Apache OpenNLP, Apache Tika, Apache Flink, Apache Avro, Apache Parquet and Apache Iceberg,

Other Related Datasources (especially with Latitude, Longitude, City Name, Zip Code, Country, State, Location)

  • NYC Traffic Cameras
  • TRANSCOM Feeds
  • Government Status
  • Social Media (facebook, twitter, mastodon, bluesky, reddit, youtube, RSS, ...)
  • Air Quality (2 sources)
  • ADSB Plane Tracking (Lat,Long)
  • Weather (Lat, Long, Zip)
  • RSS Feeds
  • News Feeds
  • CitiBike
  • Government Feeds
  • NJ Transit Bus, Rail, Light Rail, Travel Advisories as RSS/XML
  • PA Traffic Cameras
  • Weather Cameras

Developer Docs

GTFS Real-Time Feed

Data

GTFS Real-Time Data

More Data

TODO

References