# Airlines

## Load Data
Airlines Dataset Inspired in the regression dataset from Elena Ikonomovska. The task is to predict whether a given flight will be delayed, given the information of the scheduled departure.

In [None]:
from xautoml.util.datasets import openml_task

X_train, y_train = openml_task(189354, 0, train=True)
X_train

## Install FLAML
If you haven't installed FLAML yet, you can install it via

```
pip install flaml
```

## Start the Model Building

You load the data set in an AutoML tool you have found on the internet, to create a predictive model. After starting the optimization, the AutoML tool tests various possible models and evaluates how good each candidate is. In the meantime you have to wait for the program to finish its optimization.

In [None]:
from sklearn.pipeline import Pipeline
from sklearn.compose import ColumnTransformer
from sklearn.preprocessing import StandardScaler, OrdinalEncoder
from flaml import AutoML

pipeline = Pipeline([
    ("encoding", ColumnTransformer([('ordinal', OrdinalEncoder(), ['Airline', 'AirportFrom', 'AirportTo'])],
                                   remainder='passthrough')),
    ("standardizer", StandardScaler()),
    ("automl", AutoML())
])

pipeline.fit(X_train, y_train,
             automl__time_budget=60,
             automl__metric='accuracy',
             automl__estimator_list=['xgboost', 'catboost', 'lgbm'],
             automl__model_history=True,
             automl__log_training_metric=True,
             automl__keep_search_state=True)

## Visualize the Optimization Run in XAutoML

The visualization of FLAML is still experimental.


In [None]:
from xautoml.main import XAutoML
from xautoml.adapter import import_flaml
from xautoml.util.datasets import openml_task

X_test, y_test = openml_task(189354, 0, test=True)

rh = import_flaml(pipeline)
main = XAutoML(rh, X_test, y_test)
main