You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
This repository provides the full experimental pipeline for building benchmark datasets from the “Aeolus: A Multi‑structural Flight Delay Dataset” (available on Kaggle: mfdd‑multi‑modal‑flight‑delay‑dataset). It demonstrates how to construct the tabular and graph-based datasets from Aeolus, run multimodal baselines, and visualize delay patterns. You can reproduce experiments across modalities and methodologies, and explore visualizations that help analyze and understand flight delay dynamics.
📂 Repository Structure
Datasets/: Data extraction, tabular and graph constructors.
exp/: Experimental scripts for chain, network, and tabular baselines.
util/: Utility modules for validation and figure generation.
Flight-graph construction (airport/airspace resource interactions)
Support baseline models: MLP, AutoInt, TabulaRNN, ResNet, VGAE+AFM, etc.
🔬 Experiments & Visualization
Tabular baseline experiments under exp/Tab_exp
Time-based modeling pipelines under exp/Chain_exp
Graph-based modeling pipelines under exp/Network_exp
Some visualizations are shown below. For more visualizations, see util/figures
SHAP Summary Plot
The SHAP summary beeswarm plot illustrates how the top‑15 features influence model predictions—each point shows an instance’s feature value and its positive or negative contribution (SHAP value) to delay prediction, with color encoding feature magnitude and horizontal position indicating impact on output
Delay propagation visualization in Flight Chains
The delay propagation visualization plots a flight chain across hub airports—with color‑coded lines showing average arrival delays—highlighting how an upstream delay travels through connections to downstream locations.
Delay propagation visualization in Flight Networks
The delay propagation visualization highlights how average arrival delays flow through connected airports in the flight network—edges colored by delay indicate how upstream disruptions spread across routes in the airport graph.
Monthly Average Arrival Delay Trends (2019-2022) with COVID-19 Impact
This graph illustrates the monthly average arrival delay for the years 2019 to 2022, highlighting the atypical delay patterns during the COVID-19 pandemic, particularly between March and June 2020, when flight disruptions caused an abnormal trend in the data.
🚧 Limitations & Future Work
Lacking fine-grained operational signals such as real‑time ATC interventions, crew rotation, or passenger itineraries
Strong geographic bias: ~78% of flights originate in North America; limited coverage for regional hubs in Africa or South Asia
Large scale and multimodal nature may impose high computational requirements; reproducibility supported via full configuration scripts, with lightweight subsets planned