FlightSense — Flight Delay Predictor

Loading LSTM model...

Flight Details

Day of Week

Distance (miles)

Departure Hour (0-23)

Arrival Hour (0-23)

Airline

Origin Airport

Destination Airport

Prediction Result

Fill in the flight details and click Run Prediction.

--

probability of delay (15+ min arrival)

Model Breakdown

Logistic Regression simulated

--%

MLP simulated

--%

Bidirectional LSTM simulated

--%

Target variable: arrival delay of 15 minutes or more. Click any chart to view it full size.

Class Distribution

Delayed vs. not delayed — shows class imbalance

Delay Rate by Hour

Evening flights accumulate more delays

Delay Rate by Day of Week

Fridays and Sundays have the highest delay rates

Delay Rate by Distance

Short-haul flights are more exposed to congestion

Correlation Heatmap

No single feature dominates — problem is non-linear

Best ROC-AUC

0.6905

Bidirectional LSTM

Best F1 Score

0.4571

Bidirectional LSTM

Best Recall

0.6761

Bidirectional LSTM

Model	Accuracy	Precision	Recall	F1 Score	ROC-AUC
Logistic Regression Baseline	0.6099	0.3305	0.6452	0.4371	0.6552
MLP	0.6189	0.3414	0.6711	0.4526	0.6847
Bidirectional LSTM Best	0.6230	0.3453	0.6761	0.4571	0.6905

Training Curves

MLP vs LSTM loss and AUC over epochs

Model Comparison Chart

All metrics side by side

Confusion Matrices

Where each model makes mistakes

ROC Curves

All three models overlaid

Precision-Recall Curves

More informative than ROC for imbalanced data

Why accuracy looks low: The dataset is ~80% not-delayed. A model that always predicts "not delayed" scores 80% accuracy without learning anything. We focus on Recall (catching actual delays) and F1 Score as primary metrics. The LSTM's ROC-AUC of 0.6905 means it correctly ranks a delayed flight above a non-delayed flight 69% of the time, versus 50% for random guessing.

Understanding model errors is as important as measuring accuracy. These charts show how confident each model is on delayed vs. non-delayed flights, and which departure hours are hardest to predict correctly.

Probability Score Distributions

How confident each model is — separated by true class

LSTM Error Rate by Departure Hour

Early morning and late night departures are the hardest to predict

Probability distributions: A well-separated model produces two distinct humps — one near 0 for non-delayed flights and one near 1 for delayed flights. The overlap in the middle represents the hard cases where the model is uncertain. Error by hour: Errors cluster at early morning (0–5 AM) and late night (21–23) departures. These windows have sparse training data and unpredictable disruption patterns that schedule-based features alone cannot capture.

Team

EE
Elias Estacion
RH
Rochane Hurst
MR
Meliton Rojas
BB
Bricio Blancas Salgado
WS
Wendy Santiago
MV
Michael Vu

Dataset

Source U.S. Bureau of Transportation Statistics
Period May – October 2025
Size ~4.2 million flights
Target Arrival delay >= 15 minutes
Features 7 (schedule-based)
Split 80% train / 10% val / 10% test

Features Used

DAY_OF_WEEK Day the flight operates
DEP_HOUR Scheduled departure hour
ARR_HOUR Scheduled arrival hour
DISTANCE Flight distance in miles
CARRIER_ENC Airline (label encoded)
ORIGIN_ENC Origin airport (label encoded)
DEST_ENC Destination airport (label encoded)

LSTM Architecture

1

Input sequence (7 timesteps x 1 feature)

↓

2

Bidirectional LSTM (64 units) + BatchNorm + Dropout

↓

3

Bidirectional LSTM (32 units) + BatchNorm + Dropout

↓

4

Dense (64) + Dense (32) classification head

↓

5

Sigmoid output — delay probability

Predictions use the actual trained Bidirectional LSTM model loaded via TensorFlow.js. Logistic Regression and MLP values are approximated for comparison. Source code and notebooks available in this repository.

Flight Delay Predictor

Flight Details

Prediction Result

Model Breakdown

Exploratory Data Analysis

Class Distribution

Delay Rate by Hour

Delay Rate by Day of Week

Delay Rate by Distance

Correlation Heatmap

Model Results

Training Curves

Model Comparison Chart

Confusion Matrices

ROC Curves

Precision-Recall Curves

Error Analysis

Probability Score Distributions

LSTM Error Rate by Departure Hour

About This Project

Team

Dataset

Features Used

LSTM Architecture