Skip to content
No description, website, or topics provided.
Branch: master
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Type Name Latest commit message Commit time
Failed to load latest commit information.

Project Summary:

The free online encyclopedia Wikipedia says: "The term Network Monitoring describes the use of a system that constantly monitors a computer network for slow or failing systems and that notifies the network administrator in case of outages via email, pager or other alarms."

For any company to grow and be successful, it is essential the it establish a method of monitoring its network activity is essential to. This should include solutions that can automatically detect and respond to threats and performance issues in real time, as well as predict possible issues in the future.

Project Description:

The object of this project is to add the monitoring infrastructure to the data engineering pipeline. This provides the reporting capability on the capacity and performance of the hardware and applications so you can know when you are running into issues as well use it for forecasting future needs.

Project Challenge:

  1. Learning curve

  2. Identifying what needs to be monitored

    • Creating a “visual” map of the most critical parts of the network
    • avoiding information overload
  3. Deciding how to measure what is being monitored

    • Setting base lines

Data Pipeline:

Ingestion: AWS S3
Data Processing: Spark
Storage: MySQL
User Interface (UI): Flask

Monitoring System:

Display Grafana

Monitoring System Prometheus

Future Work:

  • Containerize the monitoring system
  • Add caching to the system

Data Source

New York City Parking Violations


[Prometheus] (
[How to use prometheus ] (
[ How to Install and Secure Grafana] (
[mysql_exporter] (
[Monitoring MySQL] (
Monitoring Read/Write Latency

You can’t perform that action at this time.