In [1]:
# inspired from Gaurav Duttas work in competition :  Binary Prediction of Poisonous Mushrooms

# 🚗 **2024 Kaggle Playground Series: Used Car Price Prediction** 🏁

Welcome to the 2024 Kaggle Playground Series! 🎉 In this competition, we will predict the price of used cars using various attributes. The goal is to create a robust model using **AutoGluon**, leveraging the power of GPUs for efficient computation. Let's dive in!

## 📚 **Libraries and Setup**

First, let's install and import the necessary libraries. We'll use AutoGluon for its ease of use and efficiency in model training and deployment.



In [2]:
!pip install ray==2.10.0 autogluon.tabular ipywidgets catboost==1.2.5 -q

[31mERROR: pip's dependency resolver does not currently take into account all the packages that are installed. This behaviour is the source of the following dependency conflicts.
cuml 24.8.0 requires cupy-cuda11x>=12.0.0, which is not installed.
aiobotocore 2.13.2 requires botocore<1.34.132,>=1.34.70, but you have botocore 1.29.165 which is incompatible.
bigframes 0.22.0 requires google-cloud-bigquery[bqstorage,pandas]>=3.10.0, but you have google-cloud-bigquery 2.34.4 which is incompatible.
bigframes 0.22.0 requires google-cloud-storage>=2.0.0, but you have google-cloud-storage 1.44.0 which is incompatible.
bigframes 0.22.0 requires pandas<2.1.4,>=1.5.0, but you have pandas 2.2.2 which is incompatible.
dataproc-jupyter-plugin 0.1.79 requires pydantic~=1.10.0, but you have pydantic 2.8.2 which is incompatible.
libpysal 4.9.2 requires packaging>=22, but you have packaging 21.3 which is incompatible.
libpysal 4.9.2 requires shapely>=2.0.1, but you have shapely 1.8.5.post1 which 

In [3]:
import pandas as pd
from autogluon.tabular import TabularDataset, TabularPredictor
import warnings
warnings.filterwarnings('ignore')

## 💾 **Load and Explore the Data**

Let's load the dataset and take a quick look at the first few rows

In [4]:
train=pd.read_csv('/kaggle/input/playground-series-s4e9/train.csv')
test=pd.read_csv('/kaggle/input/playground-series-s4e9/test.csv')
train.head()

Unnamed: 0,id,brand,model,model_year,milage,fuel_type,engine,transmission,ext_col,int_col,accident,clean_title,price
0,0,MINI,Cooper S Base,2007,213000,Gasoline,172.0HP 1.6L 4 Cylinder Engine Gasoline Fuel,A/T,Yellow,Gray,None reported,Yes,4200
1,1,Lincoln,LS V8,2002,143250,Gasoline,252.0HP 3.9L 8 Cylinder Engine Gasoline Fuel,A/T,Silver,Beige,At least 1 accident or damage reported,Yes,4999
2,2,Chevrolet,Silverado 2500 LT,2002,136731,E85 Flex Fuel,320.0HP 5.3L 8 Cylinder Engine Flex Fuel Capab...,A/T,Blue,Gray,None reported,Yes,13900
3,3,Genesis,G90 5.0 Ultimate,2017,19500,Gasoline,420.0HP 5.0L 8 Cylinder Engine Gasoline Fuel,Transmission w/Dual Shift Mode,Black,Black,None reported,Yes,45000
4,4,Mercedes-Benz,Metris Base,2021,7388,Gasoline,208.0HP 2.0L 4 Cylinder Engine Gasoline Fuel,7-Speed A/T,Black,Beige,None reported,Yes,97500


In [5]:
train.duplicated().value_counts() # no duplicates :)

False    188533
Name: count, dtype: int64

## 🚀 **Training the Model with AutoGluon**

Now, we'll train the model using AutoGluon, which automatically handles feature engineering, model selection, and hyperparameter tuning. We'll utilize the GPU for faster training.

In [6]:
label = 'price'
train[label].describe()

count    1.885330e+05
mean     4.387802e+04
std      7.881952e+04
min      2.000000e+03
25%      1.700000e+04
50%      3.082500e+04
75%      4.990000e+04
max      2.954083e+06
Name: price, dtype: float64

In [7]:
predictor = TabularPredictor(label=label,eval_metric ='root_mean_squared_error',
                             problem_type="regression").fit(train,presets='best_quality',
                                                           time_limit=3600*11,verbosity=3,
                                                           excluded_model_types=['KNN'],
                                                           ag_args_fit={'num_gpus': 1}
                                                      )
results = predictor.fit_summary()

No path specified. Models will be saved in: "AutogluonModels/ag-20240910_171549"
Verbosity: 3 (Detailed Logging)
AutoGluon Version:  1.1.1
Python Version:     3.10.14
Operating System:   Linux
Platform Machine:   x86_64
Platform Version:   #1 SMP Thu Jun 27 20:43:36 UTC 2024
CPU Count:          4
GPU Count:          2
Memory Avail:       30.16 GB / 31.36 GB (96.2%)
Disk Space Avail:   19.50 GB / 19.52 GB (99.9%)
Presets specified: ['best_quality']
User Specified kwargs:
{'ag_args_fit': {'num_gpus': 1},
 'auto_stack': True,
 'excluded_model_types': ['KNN'],
 'num_bag_sets': 1,
 'verbosity': 3}
Full kwargs:
{'_feature_generator_kwargs': None,
 '_save_bag_folds': None,
 'ag_args': None,
 'ag_args_ensemble': None,
 'ag_args_fit': {'num_gpus': 1},
 'auto_stack': True,
 'calibrate': 'auto',
 'ds_args': {'clean_up_fits': True,
             'detection_time_frac': 0.25,
             'enable_ray_logging': True,
             'holdout_data': None,
             'holdout_frac': 0.1111111111111111,
 

[36m(_ray_fit pid=436)[0m [50]	valid_set's rmse: 82322


[36m(_ray_fit pid=493)[0m 	Training S1F3 with GPU, note that this may negatively impact model quality compared to CPU training.[32m [repeated 2x across cluster] (Ray deduplicates logs by default. Set RAY_DEDUP_LOGS=0 to disable log deduplication, or see https://docs.ray.io/en/master/ray-observability/ray-logging.html#log-deduplication for more options.)[0m
[36m(_ray_fit pid=437)[0m [LightGBM] [Fatal] bin size 1665 cannot run on GPU
[36m(_ray_fit pid=493)[0m [LightGBM] [Fatal] bin size 1670 cannot run on GPU


[36m(_ray_fit pid=493)[0m [50]	valid_set's rmse: 69880.3[32m [repeated 3x across cluster][0m


[36m(_ray_fit pid=551)[0m 	Training S1F5 with GPU, note that this may negatively impact model quality compared to CPU training.[32m [repeated 2x across cluster][0m
[36m(_ray_fit pid=520)[0m [LightGBM] [Fatal] bin size 1666 cannot run on GPU
[36m(_ray_fit pid=551)[0m [LightGBM] [Fatal] bin size 1667 cannot run on GPU


[36m(_ray_fit pid=551)[0m [50]	valid_set's rmse: 67583[32m [repeated 3x across cluster][0m


[36m(_ray_fit pid=609)[0m 	Training S1F7 with GPU, note that this may negatively impact model quality compared to CPU training.[32m [repeated 2x across cluster][0m
[36m(_ray_fit pid=578)[0m [LightGBM] [Fatal] bin size 1667 cannot run on GPU
[36m(_ray_fit pid=609)[0m [LightGBM] [Fatal] bin size 1664 cannot run on GPU


[36m(_ray_fit pid=609)[0m [50]	valid_set's rmse: 61900.6[32m [repeated 4x across cluster][0m


[36m(_dystack pid=167)[0m Saving AutogluonModels/ag-20240910_171549/ds_sub_fit/sub_fit_ho/models/LightGBMXT_BAG_L1/utils/oof.pkl
[36m(_dystack pid=167)[0m Saving AutogluonModels/ag-20240910_171549/ds_sub_fit/sub_fit_ho/models/LightGBMXT_BAG_L1/model.pkl
[36m(_dystack pid=167)[0m 	-73332.5258	 = Validation score   (-root_mean_squared_error)
[36m(_dystack pid=167)[0m 	40.71s	 = Training   runtime
[36m(_dystack pid=167)[0m 	1.31s	 = Validation runtime
[36m(_dystack pid=167)[0m 	16039.5	 = Inference  throughput (rows/s | 20948 batch size)
[36m(_dystack pid=167)[0m Saving AutogluonModels/ag-20240910_171549/ds_sub_fit/sub_fit_ho/models/trainer.pkl
[36m(_dystack pid=167)[0m Fitting model: LightGBM_BAG_L1 ... Training model for up to 6533.8s of the 9827.0s of remaining time.
[36m(_ray_fit pid=637)[0m 	Training S1F8 with GPU, note that this may negatively impact model quality compared to CPU training.
[36m(_ray_fit pid=637)[0m [LightGBM] [Fatal] bin size 1668 cannot run on G

[36m(_ray_fit pid=759)[0m [50]	valid_set's rmse: 82343.8[32m [repeated 3x across cluster][0m


[36m(_ray_fit pid=816)[0m 	Training S1F3 with GPU, note that this may negatively impact model quality compared to CPU training.[32m [repeated 2x across cluster][0m
[36m(_ray_fit pid=760)[0m [LightGBM] [Fatal] bin size 1665 cannot run on GPU
[36m(_ray_fit pid=816)[0m [LightGBM] [Fatal] bin size 1670 cannot run on GPU


[36m(_ray_fit pid=816)[0m [50]	valid_set's rmse: 70321.1[32m [repeated 3x across cluster][0m


[36m(_ray_fit pid=875)[0m 	Training S1F5 with GPU, note that this may negatively impact model quality compared to CPU training.[32m [repeated 2x across cluster][0m
[36m(_ray_fit pid=845)[0m [LightGBM] [Fatal] bin size 1666 cannot run on GPU
[36m(_ray_fit pid=875)[0m [LightGBM] [Fatal] bin size 1667 cannot run on GPU


[36m(_ray_fit pid=875)[0m [50]	valid_set's rmse: 67767.7[32m [repeated 2x across cluster][0m


[36m(_ray_fit pid=933)[0m 	Training S1F7 with GPU, note that this may negatively impact model quality compared to CPU training.[32m [repeated 2x across cluster][0m
[36m(_ray_fit pid=903)[0m [LightGBM] [Fatal] bin size 1667 cannot run on GPU
[36m(_ray_fit pid=933)[0m [LightGBM] [Fatal] bin size 1664 cannot run on GPU


[36m(_ray_fit pid=933)[0m [50]	valid_set's rmse: 62461.7[32m [repeated 2x across cluster][0m


[36m(_dystack pid=167)[0m Saving AutogluonModels/ag-20240910_171549/ds_sub_fit/sub_fit_ho/models/LightGBM_BAG_L1/utils/oof.pkl
[36m(_dystack pid=167)[0m Saving AutogluonModels/ag-20240910_171549/ds_sub_fit/sub_fit_ho/models/LightGBM_BAG_L1/model.pkl
[36m(_dystack pid=167)[0m 	-73655.3912	 = Validation score   (-root_mean_squared_error)
[36m(_dystack pid=167)[0m 	37.61s	 = Training   runtime
[36m(_dystack pid=167)[0m 	0.77s	 = Validation runtime
[36m(_dystack pid=167)[0m 	27063.2	 = Inference  throughput (rows/s | 20948 batch size)
[36m(_dystack pid=167)[0m Saving AutogluonModels/ag-20240910_171549/ds_sub_fit/sub_fit_ho/models/trainer.pkl
[36m(_dystack pid=167)[0m Fitting model: RandomForestMSE_BAG_L1 ... Training model for up to 6493.56s of the 9786.76s of remaining time.
[36m(_ray_fit pid=962)[0m 	Training S1F8 with GPU, note that this may negatively impact model quality compared to CPU training.
[36m(_ray_fit pid=962)[0m [LightGBM] [Fatal] bin size 1668 cannot run

[36m(_ray_fit pid=1099)[0m 0:	learn: 77311.6402447	test: 88003.1367496	best: 88003.1367496 (0)	total: 268ms	remaining: 536ms
[36m(_ray_fit pid=962)[0m [50]	valid_set's rmse: 71115.9
[36m(_ray_fit pid=1098)[0m bestTest = 86222.31329
[36m(_ray_fit pid=1098)[0m bestIteration = 2
[36m(_ray_fit pid=1098)[0m 120:	learn: 71345.6164832	test: 82274.6601930	best: 82274.6601930 (120)	total: 3.39s	remaining: 43.9s[32m [repeated 17x across cluster][0m
[36m(_ray_fit pid=1099)[0m bestTest = 87325.04992
[36m(_ray_fit pid=1099)[0m bestIteration = 2
[36m(_ray_fit pid=1099)[0m bestTest = 83150.38293
[36m(_ray_fit pid=1099)[0m bestIteration = 289
[36m(_ray_fit pid=1099)[0m Shrink model to first 290 iterations.
[36m(_ray_fit pid=1098)[0m 280:	learn: 70368.5066108	test: 82201.4294370	best: 82201.4294370 (280)	total: 8.25s	remaining: 41.3s[32m [repeated 18x across cluster][0m


[36m(_ray_fit pid=1181)[0m 	Training S1F3 with GPU, note that this may negatively impact model quality compared to CPU training.[32m [repeated 2x across cluster][0m


[36m(_ray_fit pid=1181)[0m bestTest = 74295.31455[32m [repeated 2x across cluster][0m
[36m(_ray_fit pid=1181)[0m bestIteration = 2[32m [repeated 2x across cluster][0m
[36m(_ray_fit pid=1098)[0m Shrink model to first 386 iterations.
[36m(_ray_fit pid=1181)[0m 2:	learn: 78458.3309823	test: 74295.3145535	best: 74295.3145535 (2)	total: 41.7ms	remaining: 0us[32m [repeated 9x across cluster][0m
[36m(_ray_fit pid=1221)[0m bestTest = 83321.07326
[36m(_ray_fit pid=1221)[0m bestIteration = 2
[36m(_ray_fit pid=1181)[0m 140:	learn: 73100.1212964	test: 69576.9705613	best: 69572.4350924 (137)	total: 3.36s	remaining: 51.3s[32m [repeated 11x across cluster][0m
[36m(_ray_fit pid=1181)[0m bestTest = 69562.80775
[36m(_ray_fit pid=1181)[0m bestIteration = 201
[36m(_ray_fit pid=1181)[0m Shrink model to first 202 iterations.
[36m(_ray_fit pid=1221)[0m 200:	learn: 71302.9849077	test: 79090.0991733	best: 79083.0913919 (164)	total: 5.36s	remaining: 1m 8s[32m [repeated 15x across 

[36m(_ray_fit pid=1271)[0m 	Training S1F5 with GPU, note that this may negatively impact model quality compared to CPU training.[32m [repeated 2x across cluster][0m


[36m(_ray_fit pid=1221)[0m bestTest = 79075.50062
[36m(_ray_fit pid=1221)[0m bestIteration = 293
[36m(_ray_fit pid=1221)[0m Shrink model to first 294 iterations.
[36m(_ray_fit pid=1271)[0m 0:	learn: 79420.2177508	test: 73582.9002673	best: 73582.9002673 (0)	total: 25.7ms	remaining: 1m 1s[32m [repeated 10x across cluster][0m


[36m(_ray_fit pid=1316)[0m 	Training S1F6 with GPU, note that this may negatively impact model quality compared to CPU training.


[36m(_ray_fit pid=1271)[0m bestTest = 72776.19388
[36m(_ray_fit pid=1271)[0m bestIteration = 2
[36m(_ray_fit pid=1316)[0m bestTest = 74387.68963
[36m(_ray_fit pid=1316)[0m bestIteration = 2
[36m(_ray_fit pid=1271)[0m 220:	learn: 72901.1554674	test: 67401.5061287	best: 67400.1366570 (211)	total: 5.05s	remaining: 49.8s[32m [repeated 13x across cluster][0m
[36m(_ray_fit pid=1271)[0m bestTest = 67365.69762
[36m(_ray_fit pid=1271)[0m bestIteration = 339
[36m(_ray_fit pid=1271)[0m Shrink model to first 340 iterations.
[36m(_ray_fit pid=1316)[0m 180:	learn: 72877.1736096	test: 69477.0382777	best: 69477.0382777 (180)	total: 4.82s	remaining: 1m 3s[32m [repeated 18x across cluster][0m


[36m(_ray_fit pid=1361)[0m 	Training S1F7 with GPU, note that this may negatively impact model quality compared to CPU training.


[36m(_ray_fit pid=1361)[0m bestTest = 67422.59426
[36m(_ray_fit pid=1361)[0m bestIteration = 2
[36m(_ray_fit pid=1361)[0m 2:	learn: 79393.5474715	test: 67422.5942627	best: 67422.5942627 (2)	total: 36.1ms	remaining: 0us[32m [repeated 11x across cluster][0m
[36m(_ray_fit pid=1316)[0m Shrink model to first 330 iterations.


[36m(_ray_fit pid=1405)[0m 	Training S1F8 with GPU, note that this may negatively impact model quality compared to CPU training.


[36m(_ray_fit pid=1316)[0m bestTest = 69446.73157
[36m(_ray_fit pid=1316)[0m bestIteration = 329
[36m(_ray_fit pid=1361)[0m 140:	learn: 74258.1946045	test: 61776.0344826	best: 61776.0344826 (140)	total: 3.44s	remaining: 1m 4s[32m [repeated 8x across cluster][0m
[36m(_ray_fit pid=1405)[0m bestTest = 75739.79947
[36m(_ray_fit pid=1405)[0m bestIteration = 2
[36m(_ray_fit pid=1361)[0m Shrink model to first 258 iterations.
[36m(_ray_fit pid=1405)[0m 120:	learn: 73078.4686743	test: 70804.7804898	best: 70804.2884440 (119)	total: 2.83s	remaining: 52.6s[32m [repeated 17x across cluster][0m
[36m(_ray_fit pid=1361)[0m bestTest = 61651.18291
[36m(_ray_fit pid=1361)[0m bestIteration = 257
[36m(_ray_fit pid=1405)[0m bestTest = 70751.41319
[36m(_ray_fit pid=1405)[0m bestIteration = 203


[36m(_dystack pid=167)[0m Saving AutogluonModels/ag-20240910_171549/ds_sub_fit/sub_fit_ho/models/CatBoost_BAG_L1/utils/oof.pkl
[36m(_dystack pid=167)[0m Saving AutogluonModels/ag-20240910_171549/ds_sub_fit/sub_fit_ho/models/CatBoost_BAG_L1/model.pkl
[36m(_dystack pid=167)[0m 	-73252.3851	 = Validation score   (-root_mean_squared_error)
[36m(_dystack pid=167)[0m 	64.57s	 = Training   runtime
[36m(_dystack pid=167)[0m 	1.11s	 = Validation runtime
[36m(_dystack pid=167)[0m 	18807.3	 = Inference  throughput (rows/s | 20948 batch size)
[36m(_dystack pid=167)[0m Saving AutogluonModels/ag-20240910_171549/ds_sub_fit/sub_fit_ho/models/trainer.pkl
[36m(_dystack pid=167)[0m Fitting model: ExtraTreesMSE_BAG_L1 ... Training model for up to 6085.48s of the 9378.67s of remaining time.
[36m(_dystack pid=167)[0m 	Fitting ExtraTreesMSE_BAG_L1 with 'num_gpus': 2, 'num_cpus': 4
[36m(_dystack pid=167)[0m Saving AutogluonModels/ag-20240910_171549/ds_sub_fit/sub_fit_ho/models/ExtraTreesMS

[36m(_ray_fit pid=1972)[0m [0]	validation_0-rmse:86424.94535
[36m(_ray_fit pid=1405)[0m Shrink model to first 204 iterations.
[36m(_ray_fit pid=1405)[0m 240:	learn: 72295.6618773	test: 70754.5600447	best: 70751.4131852 (203)	total: 5.42s	remaining: 47.8s[32m [repeated 6x across cluster][0m


[36m(_ray_fit pid=1972)[0m Potential solutions:
[36m(_ray_fit pid=1972)[0m - Use a data structure that matches the device ordinal in the booster.
[36m(_ray_fit pid=1972)[0m - Set the device for booster before call to inplace_predict.
[36m(_ray_fit pid=2035)[0m [32m [repeated 12x across cluster][0m
[36m(_ray_fit pid=2035)[0m     E.g. tree_method = "hist", device = "cuda"[32m [repeated 4x across cluster][0m
[36m(_ray_fit pid=1973)[0m Potential solutions:
[36m(_ray_fit pid=1973)[0m - Use a data structure that matches the device ordinal in the booster.
[36m(_ray_fit pid=1973)[0m - Set the device for booster before call to inplace_predict.


[36m(_ray_fit pid=2033)[0m [0]	validation_0-rmse:74586.04403[32m [repeated 6x across cluster][0m


[36m(_ray_fit pid=2033)[0m Potential solutions:
[36m(_ray_fit pid=2033)[0m - Use a data structure that matches the device ordinal in the booster.
[36m(_ray_fit pid=2033)[0m - Set the device for booster before call to inplace_predict.
[36m(_ray_fit pid=2097)[0m [32m [repeated 12x across cluster][0m
[36m(_ray_fit pid=2097)[0m     E.g. tree_method = "hist", device = "cuda"[32m [repeated 4x across cluster][0m
[36m(_ray_fit pid=2035)[0m Potential solutions:
[36m(_ray_fit pid=2035)[0m - Use a data structure that matches the device ordinal in the booster.
[36m(_ray_fit pid=2035)[0m - Set the device for booster before call to inplace_predict.


[36m(_ray_fit pid=2097)[0m [0]	validation_0-rmse:74606.25773[32m [repeated 6x across cluster][0m


[36m(_ray_fit pid=2097)[0m Potential solutions:
[36m(_ray_fit pid=2097)[0m - Use a data structure that matches the device ordinal in the booster.
[36m(_ray_fit pid=2097)[0m - Set the device for booster before call to inplace_predict.
[36m(_ray_fit pid=2159)[0m [32m [repeated 12x across cluster][0m
[36m(_ray_fit pid=2159)[0m     E.g. tree_method = "hist", device = "cuda"[32m [repeated 4x across cluster][0m
[36m(_ray_fit pid=2095)[0m Potential solutions:
[36m(_ray_fit pid=2095)[0m - Use a data structure that matches the device ordinal in the booster.
[36m(_ray_fit pid=2095)[0m - Set the device for booster before call to inplace_predict.


[36m(_ray_fit pid=2159)[0m [0]	validation_0-rmse:67650.73027[32m [repeated 6x across cluster][0m


[36m(_ray_fit pid=2159)[0m Potential solutions:
[36m(_ray_fit pid=2159)[0m - Use a data structure that matches the device ordinal in the booster.
[36m(_ray_fit pid=2159)[0m - Set the device for booster before call to inplace_predict.
[36m(_dystack pid=167)[0m Saving AutogluonModels/ag-20240910_171549/ds_sub_fit/sub_fit_ho/models/XGBoost_BAG_L1/utils/oof.pkl
[36m(_dystack pid=167)[0m Saving AutogluonModels/ag-20240910_171549/ds_sub_fit/sub_fit_ho/models/XGBoost_BAG_L1/model.pkl
[36m(_dystack pid=167)[0m 	-74076.0485	 = Validation score   (-root_mean_squared_error)
[36m(_dystack pid=167)[0m 	22.54s	 = Training   runtime
[36m(_dystack pid=167)[0m 	1.13s	 = Validation runtime
[36m(_dystack pid=167)[0m 	18556.4	 = Inference  throughput (rows/s | 20948 batch size)
[36m(_dystack pid=167)[0m Saving AutogluonModels/ag-20240910_171549/ds_sub_fit/sub_fit_ho/models/trainer.pkl
[36m(_dystack pid=167)[0m Fitting model: NeuralNetTorch_BAG_L1 ... Training model for up to 5040.98s

[36m(_ray_fit pid=2683)[0m [50]	valid_set's rmse: 84303.2
[36m(_ray_fit pid=2185)[0m [89]	validation_0-rmse:71544.70260[32m [repeated 5x across cluster][0m


[36m(_ray_fit pid=2739)[0m 	Training S1F3 with GPU, note that this may negatively impact model quality compared to CPU training.[32m [repeated 2x across cluster][0m
[36m(_ray_fit pid=2682)[0m [LightGBM] [Fatal] bin size 1669 cannot run on GPU
[36m(_ray_fit pid=2739)[0m [LightGBM] [Fatal] bin size 1670 cannot run on GPU


[36m(_ray_fit pid=2739)[0m [50]	valid_set's rmse: 70767[32m [repeated 2x across cluster][0m


[36m(_ray_fit pid=2798)[0m 	Training S1F5 with GPU, note that this may negatively impact model quality compared to CPU training.[32m [repeated 2x across cluster][0m
[36m(_ray_fit pid=2741)[0m [LightGBM] [Fatal] bin size 1666 cannot run on GPU
[36m(_ray_fit pid=2798)[0m [LightGBM] [Fatal] bin size 1667 cannot run on GPU


[36m(_ray_fit pid=2798)[0m [50]	valid_set's rmse: 68297.7[32m [repeated 2x across cluster][0m


[36m(_ray_fit pid=2856)[0m 	Training S1F7 with GPU, note that this may negatively impact model quality compared to CPU training.[32m [repeated 2x across cluster][0m
[36m(_ray_fit pid=2800)[0m [LightGBM] [Fatal] bin size 1667 cannot run on GPU
[36m(_ray_fit pid=2856)[0m [LightGBM] [Fatal] bin size 1664 cannot run on GPU


[36m(_ray_fit pid=2856)[0m [50]	valid_set's rmse: 62681.5[32m [repeated 3x across cluster][0m


[36m(_dystack pid=167)[0m Saving AutogluonModels/ag-20240910_171549/ds_sub_fit/sub_fit_ho/models/LightGBMLarge_BAG_L1/utils/oof.pkl
[36m(_dystack pid=167)[0m Saving AutogluonModels/ag-20240910_171549/ds_sub_fit/sub_fit_ho/models/LightGBMLarge_BAG_L1/model.pkl
[36m(_dystack pid=167)[0m 	-74049.3777	 = Validation score   (-root_mean_squared_error)
[36m(_dystack pid=167)[0m 	47.41s	 = Training   runtime
[36m(_dystack pid=167)[0m 	1.28s	 = Validation runtime
[36m(_dystack pid=167)[0m 	16313.6	 = Inference  throughput (rows/s | 20948 batch size)
[36m(_dystack pid=167)[0m Saving AutogluonModels/ag-20240910_171549/ds_sub_fit/sub_fit_ho/models/trainer.pkl
[36m(_dystack pid=167)[0m Fitting model: CatBoost_r177_BAG_L1 ... Training model for up to 4279.87s of the 7573.06s of remaining time.
[36m(_ray_fit pid=2884)[0m 	Training S1F8 with GPU, note that this may negatively impact model quality compared to CPU training.
[36m(_ray_fit pid=2884)[0m [LightGBM] [Fatal] bin size 1668 

[36m(_ray_fit pid=3006)[0m 0:	learn: 77276.1350443	test: 86726.5606146	best: 86726.5606146 (0)	total: 50.2ms	remaining: 100ms
[36m(_ray_fit pid=2884)[0m [100]	valid_set's rmse: 71602.1[32m [repeated 2x across cluster][0m
[36m(_ray_fit pid=3006)[0m 2:	learn: 76185.8383622	test: 85792.1180353	best: 85792.1180353 (2)	total: 125ms	remaining: 0us
[36m(_ray_fit pid=3006)[0m bestTest = 85792.11804
[36m(_ray_fit pid=3006)[0m bestIteration = 2
[36m(_ray_fit pid=3006)[0m 140:	learn: 70691.4064705	test: 82244.6628969	best: 82244.6628969 (140)	total: 3.04s	remaining: 35.4s[32m [repeated 17x across cluster][0m
[36m(_ray_fit pid=3007)[0m bestTest = 86955.38217
[36m(_ray_fit pid=3007)[0m bestIteration = 2
[36m(_ray_fit pid=3007)[0m bestTest = 83157.20205
[36m(_ray_fit pid=3007)[0m bestIteration = 150
[36m(_ray_fit pid=3007)[0m Shrink model to first 151 iterations.


[36m(_ray_fit pid=3089)[0m 	Training S1F3 with GPU, note that this may negatively impact model quality compared to CPU training.[32m [repeated 2x across cluster][0m


[36m(_ray_fit pid=3089)[0m 0:	learn: 79051.7437160	test: 74884.6925115	best: 74884.6925115 (0)	total: 44.3ms	remaining: 88.7ms[32m [repeated 10x across cluster][0m
[36m(_ray_fit pid=3006)[0m bestTest = 82224.65375
[36m(_ray_fit pid=3006)[0m bestIteration = 209
[36m(_ray_fit pid=3006)[0m Shrink model to first 210 iterations.
[36m(_ray_fit pid=3089)[0m bestTest = 73786.16406
[36m(_ray_fit pid=3089)[0m bestIteration = 2
[36m(_ray_fit pid=3123)[0m 60:	learn: 72022.5075957	test: 79173.1275396	best: 79173.1275396 (60)	total: 1.69s	remaining: 39.9s[32m [repeated 15x across cluster][0m
[36m(_ray_fit pid=3123)[0m bestTest = 82900.02869
[36m(_ray_fit pid=3123)[0m bestIteration = 2
[36m(_ray_fit pid=3089)[0m bestTest = 69591.72606
[36m(_ray_fit pid=3089)[0m bestIteration = 173
[36m(_ray_fit pid=3089)[0m Shrink model to first 174 iterations.


[36m(_ray_fit pid=3184)[0m 	Training S1F5 with GPU, note that this may negatively impact model quality compared to CPU training.[32m [repeated 2x across cluster][0m


[36m(_ray_fit pid=3184)[0m 0:	learn: 79253.8800376	test: 73405.0072275	best: 73405.0072275 (0)	total: 39.4ms	remaining: 78.9ms[32m [repeated 10x across cluster][0m
[36m(_ray_fit pid=3123)[0m bestTest = 79094.099
[36m(_ray_fit pid=3123)[0m bestIteration = 113
[36m(_ray_fit pid=3123)[0m Shrink model to first 114 iterations.
[36m(_ray_fit pid=3184)[0m bestTest = 72258.76715
[36m(_ray_fit pid=3184)[0m bestIteration = 2
[36m(_ray_fit pid=3184)[0m 140:	learn: 72805.6678050	test: 67398.7998356	best: 67378.8394241 (130)	total: 3.09s	remaining: 36.3s[32m [repeated 18x across cluster][0m
[36m(_ray_fit pid=3189)[0m bestTest = 73883.91839
[36m(_ray_fit pid=3189)[0m bestIteration = 2
[36m(_ray_fit pid=3184)[0m bestTest = 67363.01961
[36m(_ray_fit pid=3184)[0m bestIteration = 186
[36m(_ray_fit pid=3184)[0m Shrink model to first 187 iterations.


[36m(_ray_fit pid=3278)[0m 	Training S1F7 with GPU, note that this may negatively impact model quality compared to CPU training.[32m [repeated 2x across cluster][0m


[36m(_ray_fit pid=3278)[0m 2:	learn: 78947.4033487	test: 66871.7642540	best: 66871.7642540 (2)	total: 99.6ms	remaining: 0us[32m [repeated 12x across cluster][0m
[36m(_ray_fit pid=3278)[0m bestTest = 66871.76425[32m [repeated 2x across cluster][0m
[36m(_ray_fit pid=3278)[0m bestIteration = 2[32m [repeated 2x across cluster][0m
[36m(_ray_fit pid=3189)[0m Shrink model to first 195 iterations.
[36m(_ray_fit pid=3280)[0m 160:	learn: 72274.6864176	test: 70728.5886051	best: 70723.6598612 (156)	total: 3.61s	remaining: 36.7s[32m [repeated 19x across cluster][0m
[36m(_ray_fit pid=3280)[0m bestTest = 75268.68563
[36m(_ray_fit pid=3280)[0m bestIteration = 2
[36m(_ray_fit pid=3280)[0m bestTest = 70712.27516
[36m(_ray_fit pid=3280)[0m bestIteration = 170
[36m(_ray_fit pid=3280)[0m Shrink model to first 171 iterations.


[36m(_dystack pid=167)[0m Saving AutogluonModels/ag-20240910_171549/ds_sub_fit/sub_fit_ho/models/CatBoost_r177_BAG_L1/utils/oof.pkl
[36m(_dystack pid=167)[0m Saving AutogluonModels/ag-20240910_171549/ds_sub_fit/sub_fit_ho/models/CatBoost_r177_BAG_L1/model.pkl
[36m(_dystack pid=167)[0m 	-73245.1053	 = Validation score   (-root_mean_squared_error)
[36m(_dystack pid=167)[0m 	49.38s	 = Training   runtime
[36m(_dystack pid=167)[0m 	0.98s	 = Validation runtime
[36m(_dystack pid=167)[0m 	21360.9	 = Inference  throughput (rows/s | 20948 batch size)
[36m(_dystack pid=167)[0m Saving AutogluonModels/ag-20240910_171549/ds_sub_fit/sub_fit_ho/models/trainer.pkl
[36m(_dystack pid=167)[0m Fitting model: NeuralNetTorch_r79_BAG_L1 ... Training model for up to 4227.84s of the 7521.04s of remaining time.
[36m(_ray_fit pid=3280)[0m 	Training S1F8 with GPU, note that this may negatively impact model quality compared to CPU training.
[36m(_dystack pid=167)[0m 	Fitting NeuralNetTorch_r79_B

[36m(_ray_fit pid=3830)[0m [50]	valid_set's rmse: 83877.3
[36m(_ray_fit pid=3278)[0m 320:	learn: 71982.3782355	test: 61542.3343293	best: 61524.4833450 (281)	total: 7.23s	remaining: 32.6s[32m [repeated 12x across cluster][0m
[36m(_ray_fit pid=3278)[0m bestTest = 61524.48334
[36m(_ray_fit pid=3278)[0m bestIteration = 281
[36m(_ray_fit pid=3278)[0m Shrink model to first 282 iterations.
[36m(_ray_fit pid=3830)[0m [200]	valid_set's rmse: 82188.3[32m [repeated 6x across cluster][0m


[36m(_ray_fit pid=3888)[0m 	Training S1F3 with GPU, note that this may negatively impact model quality compared to CPU training.[32m [repeated 2x across cluster][0m
[36m(_ray_fit pid=3831)[0m [LightGBM] [Fatal] bin size 1665 cannot run on GPU
[36m(_ray_fit pid=3888)[0m [LightGBM] [Fatal] bin size 1670 cannot run on GPU


[36m(_ray_fit pid=3888)[0m [50]	valid_set's rmse: 71615.6[32m [repeated 5x across cluster][0m
[36m(_ray_fit pid=3888)[0m [200]	valid_set's rmse: 70214.8[32m [repeated 5x across cluster][0m


[36m(_ray_fit pid=3946)[0m 	Training S1F5 with GPU, note that this may negatively impact model quality compared to CPU training.[32m [repeated 2x across cluster][0m
[36m(_ray_fit pid=3916)[0m [LightGBM] [Fatal] bin size 1666 cannot run on GPU
[36m(_ray_fit pid=3946)[0m [LightGBM] [Fatal] bin size 1667 cannot run on GPU


[36m(_ray_fit pid=3946)[0m [50]	valid_set's rmse: 69739.8[32m [repeated 4x across cluster][0m
[36m(_ray_fit pid=3974)[0m [100]	valid_set's rmse: 70193.9[32m [repeated 4x across cluster][0m
[36m(_ray_fit pid=3974)[0m [250]	valid_set's rmse: 69748.9[32m [repeated 5x across cluster][0m


[36m(_ray_fit pid=4004)[0m 	Training S1F7 with GPU, note that this may negatively impact model quality compared to CPU training.[32m [repeated 2x across cluster][0m
[36m(_ray_fit pid=3974)[0m [LightGBM] [Fatal] bin size 1667 cannot run on GPU
[36m(_ray_fit pid=4004)[0m [LightGBM] [Fatal] bin size 1664 cannot run on GPU


[36m(_ray_fit pid=4004)[0m [50]	valid_set's rmse: 64231.7
[36m(_ray_fit pid=4032)[0m [50]	valid_set's rmse: 72952.7
[36m(_ray_fit pid=4032)[0m [200]	valid_set's rmse: 71038.3[32m [repeated 6x across cluster][0m


[36m(_dystack pid=167)[0m Saving AutogluonModels/ag-20240910_171549/ds_sub_fit/sub_fit_ho/models/LightGBM_r131_BAG_L1/utils/oof.pkl
[36m(_dystack pid=167)[0m Saving AutogluonModels/ag-20240910_171549/ds_sub_fit/sub_fit_ho/models/LightGBM_r131_BAG_L1/model.pkl
[36m(_dystack pid=167)[0m 	-73543.3012	 = Validation score   (-root_mean_squared_error)
[36m(_dystack pid=167)[0m 	72.75s	 = Training   runtime
[36m(_dystack pid=167)[0m 	3.23s	 = Validation runtime
[36m(_dystack pid=167)[0m 	6491.1	 = Inference  throughput (rows/s | 20948 batch size)
[36m(_dystack pid=167)[0m Saving AutogluonModels/ag-20240910_171549/ds_sub_fit/sub_fit_ho/models/trainer.pkl
[36m(_dystack pid=167)[0m Fitting model: NeuralNetFastAI_r191_BAG_L1 ... Training model for up to 3442.27s of the 6735.46s of remaining time.
[36m(_dystack pid=167)[0m 	Fitting NeuralNetFastAI_r191_BAG_L1 with 'num_gpus': 2, 'num_cpus': 4
[36m(_dystack pid=167)[0m Saving AutogluonModels/ag-20240910_171549/ds_sub_fit/sub_fit

[36m(_ray_fit pid=4564)[0m 0:	learn: 77544.3546225	test: 86992.5199819	best: 86992.5199819 (0)	total: 202ms	remaining: 404ms
[36m(_ray_fit pid=4564)[0m 2:	learn: 76837.8078461	test: 86460.9734512	best: 86460.9734512 (2)	total: 261ms	remaining: 0us
[36m(_ray_fit pid=4564)[0m bestTest = 86460.97345
[36m(_ray_fit pid=4564)[0m bestIteration = 2
[36m(_ray_fit pid=4032)[0m [250]	valid_set's rmse: 71113.4[32m [repeated 2x across cluster][0m
[36m(_ray_fit pid=4564)[0m Shrink model to first 116 iterations.
[36m(_ray_fit pid=4565)[0m 160:	learn: 65400.9823889	test: 83184.6799520	best: 83175.4654902 (149)	total: 3.3s	remaining: 14.7s[32m [repeated 20x across cluster][0m
[36m(_ray_fit pid=4564)[0m bestTest = 82014.14335[32m [repeated 2x across cluster][0m
[36m(_ray_fit pid=4564)[0m bestIteration = 115[32m [repeated 2x across cluster][0m


[36m(_ray_fit pid=4647)[0m 	Training S1F3 with GPU, note that this may negatively impact model quality compared to CPU training.[32m [repeated 2x across cluster][0m


[36m(_ray_fit pid=4565)[0m Shrink model to first 195 iterations.
[36m(_ray_fit pid=4647)[0m 0:	learn: 79278.8770445	test: 75185.7129808	best: 75185.7129808 (0)	total: 18.2ms	remaining: 23.3s[32m [repeated 7x across cluster][0m
[36m(_ray_fit pid=4647)[0m bestTest = 74569.56452[32m [repeated 2x across cluster][0m
[36m(_ray_fit pid=4647)[0m bestIteration = 2[32m [repeated 2x across cluster][0m
[36m(_ray_fit pid=4647)[0m Shrink model to first 116 iterations.
[36m(_ray_fit pid=4681)[0m 140:	learn: 67166.0855009	test: 79026.0921268	best: 79013.5850346 (124)	total: 2.42s	remaining: 14.1s[32m [repeated 18x across cluster][0m
[36m(_ray_fit pid=4647)[0m bestTest = 69758.78758[32m [repeated 2x across cluster][0m
[36m(_ray_fit pid=4647)[0m bestIteration = 115[32m [repeated 2x across cluster][0m


[36m(_ray_fit pid=4737)[0m 	Training S1F5 with GPU, note that this may negatively impact model quality compared to CPU training.[32m [repeated 2x across cluster][0m


[36m(_ray_fit pid=4681)[0m Shrink model to first 125 iterations.
[36m(_ray_fit pid=4737)[0m 20:	learn: 74707.4982962	test: 69480.3437166	best: 69480.3437166 (20)	total: 361ms	remaining: 16.8s[32m [repeated 5x across cluster][0m
[36m(_ray_fit pid=4737)[0m bestTest = 73032.38932[32m [repeated 2x across cluster][0m
[36m(_ray_fit pid=4737)[0m bestIteration = 2[32m [repeated 2x across cluster][0m
[36m(_ray_fit pid=4737)[0m Shrink model to first 130 iterations.
[36m(_ray_fit pid=4771)[0m 160:	learn: 67857.6282080	test: 69550.7148467	best: 69525.6789298 (126)	total: 2.87s	remaining: 13.9s[32m [repeated 18x across cluster][0m
[36m(_ray_fit pid=4737)[0m bestTest = 67270.97229[32m [repeated 2x across cluster][0m
[36m(_ray_fit pid=4737)[0m bestIteration = 129[32m [repeated 2x across cluster][0m


[36m(_ray_fit pid=4827)[0m 	Training S1F7 with GPU, note that this may negatively impact model quality compared to CPU training.[32m [repeated 2x across cluster][0m


[36m(_ray_fit pid=4771)[0m Shrink model to first 127 iterations.
[36m(_ray_fit pid=4827)[0m 40:	learn: 73366.2196592	test: 62429.3251531	best: 62429.3251531 (40)	total: 760ms	remaining: 17.8s[32m [repeated 5x across cluster][0m
[36m(_ray_fit pid=4827)[0m bestTest = 67657.42082[32m [repeated 2x across cluster][0m
[36m(_ray_fit pid=4827)[0m bestIteration = 2[32m [repeated 2x across cluster][0m
[36m(_ray_fit pid=4827)[0m Shrink model to first 181 iterations.
[36m(_ray_fit pid=4861)[0m 180:	learn: 67204.7699186	test: 70799.9220948	best: 70760.3100595 (142)	total: 2.65s	remaining: 10.7s[32m [repeated 21x across cluster][0m
[36m(_ray_fit pid=4861)[0m bestTest = 70760.31006[32m [repeated 3x across cluster][0m
[36m(_ray_fit pid=4861)[0m bestIteration = 142[32m [repeated 3x across cluster][0m


[36m(_dystack pid=167)[0m Saving AutogluonModels/ag-20240910_171549/ds_sub_fit/sub_fit_ho/models/CatBoost_r9_BAG_L1/utils/oof.pkl
[36m(_dystack pid=167)[0m Saving AutogluonModels/ag-20240910_171549/ds_sub_fit/sub_fit_ho/models/CatBoost_r9_BAG_L1/model.pkl
[36m(_dystack pid=167)[0m 	-73237.922	 = Validation score   (-root_mean_squared_error)
[36m(_dystack pid=167)[0m 	39.16s	 = Training   runtime
[36m(_dystack pid=167)[0m 	1.02s	 = Validation runtime
[36m(_dystack pid=167)[0m 	20637.0	 = Inference  throughput (rows/s | 20948 batch size)
[36m(_dystack pid=167)[0m Saving AutogluonModels/ag-20240910_171549/ds_sub_fit/sub_fit_ho/models/trainer.pkl
[36m(_dystack pid=167)[0m Fitting model: LightGBM_r96_BAG_L1 ... Training model for up to 2455.7s of the 5748.89s of remaining time.
[36m(_ray_fit pid=4861)[0m 	Training S1F8 with GPU, note that this may negatively impact model quality compared to CPU training.
[36m(_dystack pid=167)[0m 	Fitting LightGBM_r96_BAG_L1 with 'num_gp

[36m(_ray_fit pid=5016)[0m [50]	valid_set's rmse: 85254.5
[36m(_ray_fit pid=4861)[0m Shrink model to first 143 iterations.
[36m(_ray_fit pid=5016)[0m [250]	valid_set's rmse: 83245.2[32m [repeated 8x across cluster][0m
[36m(_ray_fit pid=5016)[0m [450]	valid_set's rmse: 83210.1[32m [repeated 7x across cluster][0m


[36m(_ray_fit pid=5072)[0m 	Training S1F3 with GPU, note that this may negatively impact model quality compared to CPU training.[32m [repeated 2x across cluster][0m
[36m(_ray_fit pid=5015)[0m [LightGBM] [Fatal] bin size 1669 cannot run on GPU
[36m(_ray_fit pid=5072)[0m [LightGBM] [Fatal] bin size 1670 cannot run on GPU


[36m(_ray_fit pid=5072)[0m [50]	valid_set's rmse: 71868.7[32m [repeated 2x across cluster][0m
[36m(_ray_fit pid=5100)[0m [150]	valid_set's rmse: 79291.1[32m [repeated 6x across cluster][0m
[36m(_ray_fit pid=5100)[0m [350]	valid_set's rmse: 79011.3[32m [repeated 6x across cluster][0m


[36m(_ray_fit pid=5130)[0m 	Training S1F5 with GPU, note that this may negatively impact model quality compared to CPU training.[32m [repeated 2x across cluster][0m
[36m(_ray_fit pid=5100)[0m [LightGBM] [Fatal] bin size 1666 cannot run on GPU
[36m(_ray_fit pid=5130)[0m [LightGBM] [Fatal] bin size 1667 cannot run on GPU


[36m(_ray_fit pid=5130)[0m [100]	valid_set's rmse: 68480.6[32m [repeated 5x across cluster][0m


[36m(_ray_fit pid=5159)[0m 	Training S1F6 with GPU, note that this may negatively impact model quality compared to CPU training.
[36m(_ray_fit pid=5159)[0m [LightGBM] [Fatal] bin size 1667 cannot run on GPU


[36m(_ray_fit pid=5130)[0m [300]	valid_set's rmse: 67458.3[32m [repeated 4x across cluster][0m
[36m(_ray_fit pid=5130)[0m [500]	valid_set's rmse: 67440.8[32m [repeated 7x across cluster][0m


[36m(_ray_fit pid=5188)[0m 	Training S1F7 with GPU, note that this may negatively impact model quality compared to CPU training.


[36m(_ray_fit pid=5159)[0m [400]	valid_set's rmse: 69293.5[32m [repeated 5x across cluster][0m


[36m(_ray_fit pid=5188)[0m [LightGBM] [Fatal] bin size 1664 cannot run on GPU


[36m(_ray_fit pid=5188)[0m [150]	valid_set's rmse: 62102.3[32m [repeated 6x across cluster][0m


[36m(_ray_fit pid=5217)[0m 	Training S1F8 with GPU, note that this may negatively impact model quality compared to CPU training.
[36m(_ray_fit pid=5217)[0m [LightGBM] [Fatal] bin size 1668 cannot run on GPU


[36m(_ray_fit pid=5188)[0m [350]	valid_set's rmse: 61687[32m [repeated 4x across cluster][0m
[36m(_ray_fit pid=5217)[0m [200]	valid_set's rmse: 71019.9[32m [repeated 5x across cluster][0m
[36m(_ray_fit pid=5217)[0m [400]	valid_set's rmse: 70915.3[32m [repeated 4x across cluster][0m


[36m(_dystack pid=167)[0m Saving AutogluonModels/ag-20240910_171549/ds_sub_fit/sub_fit_ho/models/LightGBM_r96_BAG_L1/utils/oof.pkl
[36m(_dystack pid=167)[0m Saving AutogluonModels/ag-20240910_171549/ds_sub_fit/sub_fit_ho/models/LightGBM_r96_BAG_L1/model.pkl
[36m(_dystack pid=167)[0m 	-73278.1604	 = Validation score   (-root_mean_squared_error)
[36m(_dystack pid=167)[0m 	85.12s	 = Training   runtime
[36m(_dystack pid=167)[0m 	5.21s	 = Validation runtime
[36m(_dystack pid=167)[0m 	4017.2	 = Inference  throughput (rows/s | 20948 batch size)
[36m(_dystack pid=167)[0m Saving AutogluonModels/ag-20240910_171549/ds_sub_fit/sub_fit_ho/models/trainer.pkl
[36m(_dystack pid=167)[0m Fitting model: NeuralNetTorch_r22_BAG_L1 ... Training model for up to 2367.38s of the 5660.58s of remaining time.
[36m(_dystack pid=167)[0m 	Fitting NeuralNetTorch_r22_BAG_L1 with 'num_gpus': 2, 'num_cpus': 4
[36m(_dystack pid=167)[0m Saving AutogluonModels/ag-20240910_171549/ds_sub_fit/sub_fit_ho/mo

[36m(_ray_fit pid=5709)[0m [0]	validation_0-rmse:88224.60899


[36m(_ray_fit pid=5708)[0m Potential solutions:
[36m(_ray_fit pid=5708)[0m - Use a data structure that matches the device ordinal in the booster.
[36m(_ray_fit pid=5708)[0m - Set the device for booster before call to inplace_predict.
[36m(_ray_fit pid=5769)[0m [32m [repeated 12x across cluster][0m
[36m(_ray_fit pid=5769)[0m     E.g. tree_method = "hist", device = "cuda"[32m [repeated 4x across cluster][0m
[36m(_ray_fit pid=5709)[0m Potential solutions:
[36m(_ray_fit pid=5709)[0m - Use a data structure that matches the device ordinal in the booster.
[36m(_ray_fit pid=5709)[0m - Set the device for booster before call to inplace_predict.


[36m(_ray_fit pid=5769)[0m [0]	validation_0-rmse:75335.11470[32m [repeated 11x across cluster][0m


[36m(_ray_fit pid=5769)[0m Potential solutions:
[36m(_ray_fit pid=5769)[0m - Use a data structure that matches the device ordinal in the booster.
[36m(_ray_fit pid=5769)[0m - Set the device for booster before call to inplace_predict.
[36m(_ray_fit pid=5831)[0m [32m [repeated 12x across cluster][0m
[36m(_ray_fit pid=5831)[0m     E.g. tree_method = "hist", device = "cuda"[32m [repeated 4x across cluster][0m
[36m(_ray_fit pid=5771)[0m Potential solutions:
[36m(_ray_fit pid=5771)[0m - Use a data structure that matches the device ordinal in the booster.
[36m(_ray_fit pid=5771)[0m - Set the device for booster before call to inplace_predict.


[36m(_ray_fit pid=5831)[0m [0]	validation_0-rmse:73863.31803[32m [repeated 10x across cluster][0m


[36m(_ray_fit pid=5831)[0m Potential solutions:
[36m(_ray_fit pid=5831)[0m - Use a data structure that matches the device ordinal in the booster.
[36m(_ray_fit pid=5831)[0m - Set the device for booster before call to inplace_predict.
[36m(_ray_fit pid=5893)[0m [32m [repeated 12x across cluster][0m
[36m(_ray_fit pid=5893)[0m     E.g. tree_method = "hist", device = "cuda"[32m [repeated 4x across cluster][0m
[36m(_ray_fit pid=5833)[0m Potential solutions:
[36m(_ray_fit pid=5833)[0m - Use a data structure that matches the device ordinal in the booster.
[36m(_ray_fit pid=5833)[0m - Set the device for booster before call to inplace_predict.


[36m(_ray_fit pid=5893)[0m [0]	validation_0-rmse:68523.75215[32m [repeated 10x across cluster][0m


[36m(_ray_fit pid=5893)[0m Potential solutions:
[36m(_ray_fit pid=5893)[0m - Use a data structure that matches the device ordinal in the booster.
[36m(_ray_fit pid=5893)[0m - Set the device for booster before call to inplace_predict.
[36m(_dystack pid=167)[0m Saving AutogluonModels/ag-20240910_171549/ds_sub_fit/sub_fit_ho/models/XGBoost_r33_BAG_L1/utils/oof.pkl
[36m(_dystack pid=167)[0m Saving AutogluonModels/ag-20240910_171549/ds_sub_fit/sub_fit_ho/models/XGBoost_r33_BAG_L1/model.pkl
[36m(_dystack pid=167)[0m 	-74174.9112	 = Validation score   (-root_mean_squared_error)
[36m(_dystack pid=167)[0m 	31.19s	 = Training   runtime
[36m(_dystack pid=167)[0m 	1.26s	 = Validation runtime
[36m(_dystack pid=167)[0m 	16628.4	 = Inference  throughput (rows/s | 20948 batch size)
[36m(_dystack pid=167)[0m Saving AutogluonModels/ag-20240910_171549/ds_sub_fit/sub_fit_ho/models/trainer.pkl
[36m(_dystack pid=167)[0m Fitting model: ExtraTrees_r42_BAG_L1 ... Training model for up to 

[36m(_ray_fit pid=6061)[0m 0:	learn: 77354.9331597	test: 88031.6631084	best: 88031.6631084 (0)	total: 33.5ms	remaining: 67ms
[36m(_ray_fit pid=5920)[0m [182]	validation_0-rmse:71672.58072[32m [repeated 8x across cluster][0m
[36m(_ray_fit pid=6061)[0m 2:	learn: 76632.4578642	test: 87395.1376023	best: 87395.1376023 (2)	total: 81.9ms	remaining: 0us
[36m(_ray_fit pid=6061)[0m bestTest = 87395.1376
[36m(_ray_fit pid=6061)[0m bestIteration = 2
[36m(_ray_fit pid=6060)[0m 200:	learn: 71567.0556539	test: 82257.5063112	best: 82254.5609745 (196)	total: 2.98s	remaining: 5.49s[32m [repeated 25x across cluster][0m
[36m(_ray_fit pid=6060)[0m bestTest = 86266.64643
[36m(_ray_fit pid=6060)[0m bestIteration = 2
[36m(_ray_fit pid=6060)[0m bestTest = 82239.71585
[36m(_ray_fit pid=6060)[0m bestIteration = 317
[36m(_ray_fit pid=6060)[0m Shrink model to first 318 iterations.


[36m(_ray_fit pid=6143)[0m 	Training S1F3 with GPU, note that this may negatively impact model quality compared to CPU training.[32m [repeated 2x across cluster][0m


[36m(_ray_fit pid=6143)[0m 0:	learn: 79247.2447927	test: 75092.4158096	best: 75092.4158096 (0)	total: 37ms	remaining: 74ms[32m [repeated 20x across cluster][0m
[36m(_ray_fit pid=6061)[0m bestTest = 83147.66668
[36m(_ray_fit pid=6061)[0m bestIteration = 398
[36m(_ray_fit pid=6061)[0m Shrink model to first 399 iterations.
[36m(_ray_fit pid=6143)[0m bestTest = 74355.43684
[36m(_ray_fit pid=6143)[0m bestIteration = 2
[36m(_ray_fit pid=6175)[0m 200:	learn: 72121.1453271	test: 79136.2646294	best: 79134.5997904 (194)	total: 2.9s	remaining: 8.21s[32m [repeated 27x across cluster][0m
[36m(_ray_fit pid=6175)[0m bestTest = 83407.4181
[36m(_ray_fit pid=6175)[0m bestIteration = 2
[36m(_ray_fit pid=6143)[0m bestTest = 69595.02893
[36m(_ray_fit pid=6143)[0m bestIteration = 358
[36m(_ray_fit pid=6143)[0m Shrink model to first 359 iterations.


[36m(_ray_fit pid=6233)[0m 	Training S1F5 with GPU, note that this may negatively impact model quality compared to CPU training.[32m [repeated 2x across cluster][0m


[36m(_ray_fit pid=6233)[0m 0:	learn: 79460.5505184	test: 73635.7218546	best: 73635.7218546 (0)	total: 32ms	remaining: 64.1ms[32m [repeated 17x across cluster][0m
[36m(_ray_fit pid=6175)[0m bestTest = 79119.74373
[36m(_ray_fit pid=6175)[0m bestIteration = 329
[36m(_ray_fit pid=6175)[0m Shrink model to first 330 iterations.
[36m(_ray_fit pid=6233)[0m bestTest = 72856.799
[36m(_ray_fit pid=6233)[0m bestIteration = 2
[36m(_ray_fit pid=6233)[0m 240:	learn: 73636.8413477	test: 67459.4856027	best: 67459.2451885 (228)	total: 3.52s	remaining: 7.54s[32m [repeated 29x across cluster][0m
[36m(_ray_fit pid=6241)[0m bestTest = 74393.75364
[36m(_ray_fit pid=6241)[0m bestIteration = 2
[36m(_ray_fit pid=6241)[0m bestTest = 69428.15014
[36m(_ray_fit pid=6241)[0m bestIteration = 452
[36m(_ray_fit pid=6241)[0m Shrink model to first 453 iterations.
[36m(_ray_fit pid=6233)[0m 560:	learn: 72852.6184845	test: 67386.5206336	best: 67378.6343818 (520)	total: 8.59s	remaining: 3s[32m

[36m(_ray_fit pid=6326)[0m 	Training S1F7 with GPU, note that this may negatively impact model quality compared to CPU training.[32m [repeated 2x across cluster][0m


[36m(_ray_fit pid=6326)[0m bestTest = 67474.06319[32m [repeated 2x across cluster][0m
[36m(_ray_fit pid=6326)[0m bestIteration = 2[32m [repeated 2x across cluster][0m
[36m(_ray_fit pid=6233)[0m Shrink model to first 521 iterations.
[36m(_ray_fit pid=6326)[0m 0:	learn: 80142.1883749	test: 68289.1478176	best: 68289.1478176 (0)	total: 19.1ms	remaining: 15.4s[32m [repeated 3x across cluster][0m
[36m(_ray_fit pid=6360)[0m bestTest = 75830.73178
[36m(_ray_fit pid=6360)[0m bestIteration = 2
[36m(_ray_fit pid=6326)[0m 360:	learn: 74225.9826592	test: 61708.9324439	best: 61708.0596240 (358)	total: 5.29s	remaining: 6.51s[32m [repeated 33x across cluster][0m
[36m(_ray_fit pid=6326)[0m bestTest = 61698.8894
[36m(_ray_fit pid=6326)[0m bestIteration = 379
[36m(_ray_fit pid=6326)[0m Shrink model to first 380 iterations.


[36m(_dystack pid=167)[0m Saving AutogluonModels/ag-20240910_171549/ds_sub_fit/sub_fit_ho/models/CatBoost_r137_BAG_L1/utils/oof.pkl
[36m(_dystack pid=167)[0m Saving AutogluonModels/ag-20240910_171549/ds_sub_fit/sub_fit_ho/models/CatBoost_r137_BAG_L1/model.pkl
[36m(_dystack pid=167)[0m 	-73275.4024	 = Validation score   (-root_mean_squared_error)
[36m(_dystack pid=167)[0m 	53.14s	 = Training   runtime
[36m(_dystack pid=167)[0m 	1.01s	 = Validation runtime
[36m(_dystack pid=167)[0m 	20648.9	 = Inference  throughput (rows/s | 20948 batch size)
[36m(_dystack pid=167)[0m Saving AutogluonModels/ag-20240910_171549/ds_sub_fit/sub_fit_ho/models/trainer.pkl
[36m(_dystack pid=167)[0m Fitting model: NeuralNetFastAI_r102_BAG_L1 ... Training model for up to 1807.31s of the 5100.5s of remaining time.
[36m(_dystack pid=167)[0m 	Fitting NeuralNetFastAI_r102_BAG_L1 with 'num_gpus': 2, 'num_cpus': 4
[36m(_dystack pid=167)[0m Saving AutogluonModels/ag-20240910_171549/ds_sub_fit/sub_fit

[36m(_ray_fit pid=6925)[0m 0:	learn: 77577.3832646	test: 88238.9771801	best: 88238.9771801 (0)	total: 68.7ms	remaining: 137ms[32m [repeated 17x across cluster][0m
[36m(_ray_fit pid=6360)[0m bestTest = 70760.65527
[36m(_ray_fit pid=6360)[0m bestIteration = 466
[36m(_ray_fit pid=6360)[0m Shrink model to first 467 iterations.
[36m(_ray_fit pid=6925)[0m bestTest = 87958.65449
[36m(_ray_fit pid=6925)[0m bestIteration = 2
[36m(_ray_fit pid=6924)[0m 100:	learn: 72034.0747586	test: 82637.3401578	best: 82637.3401578 (100)	total: 3.08s	remaining: 12.9s[32m [repeated 14x across cluster][0m
[36m(_ray_fit pid=6924)[0m bestTest = 86859.21794
[36m(_ray_fit pid=6924)[0m bestIteration = 2
[36m(_ray_fit pid=6925)[0m 240:	learn: 70886.5451003	test: 83210.9168015	best: 83210.9168015 (240)	total: 8.58s	remaining: 15.5s[32m [repeated 15x across cluster][0m
[36m(_ray_fit pid=6924)[0m 420:	learn: 70249.4433323	test: 82225.2722604	best: 82225.2722604 (420)	total: 13.6s	remaining: 3.

[36m(_ray_fit pid=7007)[0m 	Training S1F3 with GPU, note that this may negatively impact model quality compared to CPU training.[32m [repeated 2x across cluster][0m


[36m(_ray_fit pid=6925)[0m bestTest = 83076.95524
[36m(_ray_fit pid=6925)[0m bestIteration = 672
[36m(_ray_fit pid=6925)[0m Shrink model to first 673 iterations.
[36m(_ray_fit pid=7007)[0m 0:	learn: 79495.4283907	test: 75358.3424905	best: 75358.3424905 (0)	total: 135ms	remaining: 1m 27s[32m [repeated 9x across cluster][0m
[36m(_ray_fit pid=7007)[0m bestTest = 75022.47355
[36m(_ray_fit pid=7007)[0m bestIteration = 2


[36m(_ray_fit pid=7052)[0m 	Training S1F4 with GPU, note that this may negatively impact model quality compared to CPU training.


[36m(_ray_fit pid=7052)[0m bestTest = 84001.53416
[36m(_ray_fit pid=7052)[0m bestIteration = 2
[36m(_ray_fit pid=7007)[0m 180:	learn: 73195.0841449	test: 69678.6388418	best: 69678.6388418 (180)	total: 5.85s	remaining: 15.3s[32m [repeated 11x across cluster][0m
[36m(_ray_fit pid=7052)[0m 140:	learn: 72079.6645227	test: 79254.7072387	best: 79254.7072387 (140)	total: 4.89s	remaining: 21s[32m [repeated 15x across cluster][0m
[36m(_ray_fit pid=7007)[0m bestTest = 69553.73755
[36m(_ray_fit pid=7007)[0m bestIteration = 429
[36m(_ray_fit pid=7007)[0m Shrink model to first 430 iterations.
[36m(_ray_fit pid=7052)[0m 300:	learn: 71197.6056049	test: 79050.0137748	best: 79050.0137748 (300)	total: 10.3s	remaining: 15.3s[32m [repeated 15x across cluster][0m


[36m(_ray_fit pid=7097)[0m 	Training S1F5 with GPU, note that this may negatively impact model quality compared to CPU training.


[36m(_ray_fit pid=7052)[0m 460:	learn: 70554.2058115	test: 79029.0361747	best: 79025.9249056 (433)	total: 16.1s	remaining: 9.97s[32m [repeated 8x across cluster][0m
[36m(_ray_fit pid=7097)[0m bestTest = 73535.57917
[36m(_ray_fit pid=7097)[0m bestIteration = 2
[36m(_ray_fit pid=7052)[0m Shrink model to first 434 iterations.
[36m(_ray_fit pid=7097)[0m 100:	learn: 74223.3282721	test: 68067.3739358	best: 68067.3739358 (100)	total: 3.57s	remaining: 24s[32m [repeated 9x across cluster][0m
[36m(_ray_fit pid=7052)[0m bestTest = 79025.92491
[36m(_ray_fit pid=7052)[0m bestIteration = 433


[36m(_ray_fit pid=7142)[0m 	Training S1F6 with GPU, note that this may negatively impact model quality compared to CPU training.


[36m(_ray_fit pid=7142)[0m bestTest = 75085.16825
[36m(_ray_fit pid=7142)[0m bestIteration = 2
[36m(_ray_fit pid=7097)[0m 260:	learn: 73066.6628509	test: 67489.3615279	best: 67489.3615279 (260)	total: 8.79s	remaining: 17.5s[32m [repeated 13x across cluster][0m
[36m(_ray_fit pid=7142)[0m 220:	learn: 73005.8122539	test: 69572.3775338	best: 69572.3775338 (220)	total: 6.75s	remaining: 13.8s[32m [repeated 16x across cluster][0m
[36m(_ray_fit pid=7097)[0m 560:	learn: 71887.7583105	test: 67347.2941975	best: 67347.2941975 (560)	total: 19s	remaining: 7.46s[32m [repeated 15x across cluster][0m
[36m(_ray_fit pid=7097)[0m bestTest = 67340.76811
[36m(_ray_fit pid=7097)[0m bestIteration = 634
[36m(_ray_fit pid=7097)[0m Shrink model to first 635 iterations.
[36m(_ray_fit pid=7142)[0m 520:	learn: 71905.7121036	test: 69410.2906701	best: 69410.0223969 (519)	total: 17s	remaining: 4.97s[32m [repeated 14x across cluster][0m


[36m(_ray_fit pid=7187)[0m 	Training S1F7 with GPU, note that this may negatively impact model quality compared to CPU training.


[36m(_ray_fit pid=7142)[0m 660:	learn: 71461.0058558	test: 69389.9566620	best: 69389.7460204 (659)	total: 22s	remaining: 399ms[32m [repeated 7x across cluster][0m
[36m(_ray_fit pid=7142)[0m bestTest = 69388.86305
[36m(_ray_fit pid=7142)[0m bestIteration = 668
[36m(_ray_fit pid=7142)[0m Shrink model to first 669 iterations.
[36m(_ray_fit pid=7187)[0m 80:	learn: 75292.4653478	test: 62695.1540613	best: 62695.1540613 (80)	total: 2.71s	remaining: 20.2s[32m [repeated 8x across cluster][0m
[36m(_ray_fit pid=7187)[0m bestTest = 68174.62712
[36m(_ray_fit pid=7187)[0m bestIteration = 2


[36m(_ray_fit pid=7232)[0m 	Training S1F8 with GPU, note that this may negatively impact model quality compared to CPU training.


[36m(_ray_fit pid=7232)[0m bestTest = 76515.12803
[36m(_ray_fit pid=7232)[0m bestIteration = 2
[36m(_ray_fit pid=7187)[0m 240:	learn: 74022.4663293	test: 61802.8652656	best: 61802.8652656 (240)	total: 7.73s	remaining: 14.2s[32m [repeated 13x across cluster][0m
[36m(_ray_fit pid=7187)[0m 400:	learn: 73378.1893433	test: 61659.9283418	best: 61658.2657328 (396)	total: 13s	remaining: 9.14s[32m [repeated 16x across cluster][0m
[36m(_ray_fit pid=7232)[0m 380:	learn: 72258.5533954	test: 70711.0519245	best: 70710.8933554 (378)	total: 12s	remaining: 10.8s[32m [repeated 16x across cluster][0m
[36m(_ray_fit pid=7232)[0m bestTest = 70701.62777
[36m(_ray_fit pid=7232)[0m bestIteration = 427
[36m(_ray_fit pid=7232)[0m Shrink model to first 428 iterations.


[36m(_dystack pid=167)[0m Saving AutogluonModels/ag-20240910_171549/ds_sub_fit/sub_fit_ho/models/CatBoost_r13_BAG_L1/utils/oof.pkl
[36m(_dystack pid=167)[0m Saving AutogluonModels/ag-20240910_171549/ds_sub_fit/sub_fit_ho/models/CatBoost_r13_BAG_L1/model.pkl
[36m(_dystack pid=167)[0m 	-73214.9673	 = Validation score   (-root_mean_squared_error)
[36m(_dystack pid=167)[0m 	109.56s	 = Training   runtime
[36m(_dystack pid=167)[0m 	2.16s	 = Validation runtime
[36m(_dystack pid=167)[0m 	9703.2	 = Inference  throughput (rows/s | 20948 batch size)
[36m(_dystack pid=167)[0m Saving AutogluonModels/ag-20240910_171549/ds_sub_fit/sub_fit_ho/models/trainer.pkl
[36m(_dystack pid=167)[0m Fitting model: RandomForest_r195_BAG_L1 ... Training model for up to 1541.71s of the 4834.91s of remaining time.
[36m(_dystack pid=167)[0m 	Fitting RandomForest_r195_BAG_L1 with 'num_gpus': 2, 'num_cpus': 4
[36m(_dystack pid=167)[0m Saving AutogluonModels/ag-20240910_171549/ds_sub_fit/sub_fit_ho/mod

[36m(_ray_fit pid=7388)[0m [50]	valid_set's rmse: 82529.8
[36m(_ray_fit pid=7187)[0m 683:	learn: 72459.7890478	test: 61573.9139175	best: 61573.9139175 (683)	total: 22.6s	remaining: 0us[32m [repeated 12x across cluster][0m
[36m(_ray_fit pid=7187)[0m bestTest = 61573.91392
[36m(_ray_fit pid=7187)[0m bestIteration = 683


[36m(_ray_fit pid=7446)[0m 	Training S1F3 with GPU, note that this may negatively impact model quality compared to CPU training.[32m [repeated 2x across cluster][0m
[36m(_ray_fit pid=7388)[0m [LightGBM] [Fatal] bin size 1669 cannot run on GPU
[36m(_ray_fit pid=7446)[0m [LightGBM] [Fatal] bin size 1670 cannot run on GPU


[36m(_ray_fit pid=7446)[0m [50]	valid_set's rmse: 70087.2[32m [repeated 4x across cluster][0m


[36m(_ray_fit pid=7503)[0m 	Training S1F5 with GPU, note that this may negatively impact model quality compared to CPU training.[32m [repeated 2x across cluster][0m
[36m(_ray_fit pid=7471)[0m [LightGBM] [Fatal] bin size 1666 cannot run on GPU
[36m(_ray_fit pid=7503)[0m [LightGBM] [Fatal] bin size 1667 cannot run on GPU


[36m(_ray_fit pid=7503)[0m [50]	valid_set's rmse: 68013.4[32m [repeated 4x across cluster][0m


[36m(_ray_fit pid=7561)[0m 	Training S1F7 with GPU, note that this may negatively impact model quality compared to CPU training.[32m [repeated 2x across cluster][0m
[36m(_ray_fit pid=7531)[0m [LightGBM] [Fatal] bin size 1667 cannot run on GPU
[36m(_ray_fit pid=7561)[0m [LightGBM] [Fatal] bin size 1664 cannot run on GPU


[36m(_ray_fit pid=7561)[0m [50]	valid_set's rmse: 62160[32m [repeated 4x across cluster][0m


[36m(_dystack pid=167)[0m Saving AutogluonModels/ag-20240910_171549/ds_sub_fit/sub_fit_ho/models/LightGBM_r188_BAG_L1/utils/oof.pkl
[36m(_dystack pid=167)[0m Saving AutogluonModels/ag-20240910_171549/ds_sub_fit/sub_fit_ho/models/LightGBM_r188_BAG_L1/model.pkl
[36m(_dystack pid=167)[0m 	-73463.1388	 = Validation score   (-root_mean_squared_error)
[36m(_dystack pid=167)[0m 	49.78s	 = Training   runtime
[36m(_dystack pid=167)[0m 	2.39s	 = Validation runtime
[36m(_dystack pid=167)[0m 	8763.0	 = Inference  throughput (rows/s | 20948 batch size)
[36m(_dystack pid=167)[0m Saving AutogluonModels/ag-20240910_171549/ds_sub_fit/sub_fit_ho/models/trainer.pkl
[36m(_dystack pid=167)[0m Fitting model: NeuralNetFastAI_r145_BAG_L1 ... Training model for up to 1161.47s of the 4454.66s of remaining time.
[36m(_dystack pid=167)[0m 	Fitting NeuralNetFastAI_r145_BAG_L1 with 'num_gpus': 2, 'num_cpus': 4
[36m(_dystack pid=167)[0m Saving AutogluonModels/ag-20240910_171549/ds_sub_fit/sub_fit

[36m(_ray_fit pid=8126)[0m [0]	validation_0-rmse:87702.90149
[36m(_ray_fit pid=7588)[0m [100]	valid_set's rmse: 71260.5[32m [repeated 3x across cluster][0m


[36m(_ray_fit pid=8126)[0m Potential solutions:
[36m(_ray_fit pid=8126)[0m - Use a data structure that matches the device ordinal in the booster.
[36m(_ray_fit pid=8126)[0m - Set the device for booster before call to inplace_predict.
[36m(_ray_fit pid=8186)[0m [32m [repeated 12x across cluster][0m
[36m(_ray_fit pid=8186)[0m     E.g. tree_method = "hist", device = "cuda"[32m [repeated 4x across cluster][0m
[36m(_ray_fit pid=8125)[0m Potential solutions:
[36m(_ray_fit pid=8125)[0m - Use a data structure that matches the device ordinal in the booster.
[36m(_ray_fit pid=8125)[0m - Set the device for booster before call to inplace_predict.


[36m(_ray_fit pid=8186)[0m [0]	validation_0-rmse:74703.42619[32m [repeated 8x across cluster][0m


[36m(_ray_fit pid=8186)[0m Potential solutions:
[36m(_ray_fit pid=8186)[0m - Use a data structure that matches the device ordinal in the booster.
[36m(_ray_fit pid=8186)[0m - Set the device for booster before call to inplace_predict.
[36m(_ray_fit pid=8248)[0m [32m [repeated 12x across cluster][0m
[36m(_ray_fit pid=8248)[0m     E.g. tree_method = "hist", device = "cuda"[32m [repeated 4x across cluster][0m


[36m(_ray_fit pid=8248)[0m [0]	validation_0-rmse:73201.06943[32m [repeated 6x across cluster][0m


[36m(_ray_fit pid=8188)[0m Potential solutions:
[36m(_ray_fit pid=8188)[0m - Use a data structure that matches the device ordinal in the booster.
[36m(_ray_fit pid=8188)[0m - Set the device for booster before call to inplace_predict.
[36m(_ray_fit pid=8250)[0m Potential solutions:
[36m(_ray_fit pid=8250)[0m - Use a data structure that matches the device ordinal in the booster.
[36m(_ray_fit pid=8250)[0m - Set the device for booster before call to inplace_predict.
[36m(_ray_fit pid=8309)[0m [32m [repeated 12x across cluster][0m
[36m(_ray_fit pid=8309)[0m     E.g. tree_method = "hist", device = "cuda"[32m [repeated 4x across cluster][0m
[36m(_ray_fit pid=8248)[0m Potential solutions:
[36m(_ray_fit pid=8248)[0m - Use a data structure that matches the device ordinal in the booster.
[36m(_ray_fit pid=8248)[0m - Set the device for booster before call to inplace_predict.


[36m(_ray_fit pid=8309)[0m [0]	validation_0-rmse:67830.36905[32m [repeated 7x across cluster][0m


[36m(_ray_fit pid=8309)[0m Potential solutions:
[36m(_ray_fit pid=8309)[0m - Use a data structure that matches the device ordinal in the booster.
[36m(_ray_fit pid=8309)[0m - Set the device for booster before call to inplace_predict.
[36m(_dystack pid=167)[0m Saving AutogluonModels/ag-20240910_171549/ds_sub_fit/sub_fit_ho/models/XGBoost_r89_BAG_L1/utils/oof.pkl
[36m(_dystack pid=167)[0m Saving AutogluonModels/ag-20240910_171549/ds_sub_fit/sub_fit_ho/models/XGBoost_r89_BAG_L1/model.pkl
[36m(_dystack pid=167)[0m 	-73667.0865	 = Validation score   (-root_mean_squared_error)
[36m(_dystack pid=167)[0m 	21.72s	 = Training   runtime
[36m(_dystack pid=167)[0m 	1.07s	 = Validation runtime
[36m(_dystack pid=167)[0m 	19662.6	 = Inference  throughput (rows/s | 20948 batch size)
[36m(_dystack pid=167)[0m Saving AutogluonModels/ag-20240910_171549/ds_sub_fit/sub_fit_ho/models/trainer.pkl
[36m(_dystack pid=167)[0m Fitting model: NeuralNetTorch_r30_BAG_L1 ... Training model for up

[36m(_ray_fit pid=8841)[0m [50]	valid_set's rmse: 82219.9
[36m(_ray_fit pid=8312)[0m [158]	validation_0-rmse:71252.73282[32m [repeated 8x across cluster][0m


[36m(_ray_fit pid=8898)[0m 	Training S1F3 with GPU, note that this may negatively impact model quality compared to CPU training.[32m [repeated 2x across cluster][0m
[36m(_ray_fit pid=8842)[0m [LightGBM] [Fatal] bin size 1665 cannot run on GPU
[36m(_ray_fit pid=8898)[0m [LightGBM] [Fatal] bin size 1670 cannot run on GPU


[36m(_ray_fit pid=8898)[0m [50]	valid_set's rmse: 69976.6[32m [repeated 2x across cluster][0m


[36m(_ray_fit pid=8958)[0m 	Training S1F5 with GPU, note that this may negatively impact model quality compared to CPU training.[32m [repeated 2x across cluster][0m
[36m(_ray_fit pid=8900)[0m [LightGBM] [Fatal] bin size 1666 cannot run on GPU
[36m(_ray_fit pid=8958)[0m [LightGBM] [Fatal] bin size 1667 cannot run on GPU


[36m(_ray_fit pid=8958)[0m [50]	valid_set's rmse: 67614.7[32m [repeated 2x across cluster][0m


[36m(_ray_fit pid=8958)[0m 	Ran out of time, early stopping on iteration 79. Best iteration is:
[36m(_ray_fit pid=8958)[0m 	[60]	valid_set's rmse: 67571.2
[36m(_ray_fit pid=8960)[0m 	Training S1F6 with GPU, note that this may negatively impact model quality compared to CPU training.
[36m(_ray_fit pid=9019)[0m 	Training S1F8 with GPU, note that this may negatively impact model quality compared to CPU training.
[36m(_ray_fit pid=8960)[0m [LightGBM] [Fatal] bin size 1667 cannot run on GPU
[36m(_ray_fit pid=9017)[0m [LightGBM] [Fatal] bin size 1664 cannot run on GPU


[36m(_ray_fit pid=9017)[0m [50]	valid_set's rmse: 62038.3[32m [repeated 2x across cluster][0m


[36m(_ray_fit pid=9019)[0m 	Ran out of time, early stopping on iteration 80. Best iteration is:
[36m(_ray_fit pid=9019)[0m 	[47]	valid_set's rmse: 71038.4
[36m(_ray_fit pid=9017)[0m 	Training S1F7 with GPU, note that this may negatively impact model quality compared to CPU training.
[36m(_dystack pid=167)[0m Saving AutogluonModels/ag-20240910_171549/ds_sub_fit/sub_fit_ho/models/LightGBM_r130_BAG_L1/utils/oof.pkl
[36m(_dystack pid=167)[0m Saving AutogluonModels/ag-20240910_171549/ds_sub_fit/sub_fit_ho/models/LightGBM_r130_BAG_L1/model.pkl
[36m(_dystack pid=167)[0m 	-73485.8269	 = Validation score   (-root_mean_squared_error)
[36m(_dystack pid=167)[0m 	40.05s	 = Training   runtime
[36m(_dystack pid=167)[0m 	1.18s	 = Validation runtime
[36m(_dystack pid=167)[0m 	17689.3	 = Inference  throughput (rows/s | 20948 batch size)
[36m(_dystack pid=167)[0m Saving AutogluonModels/ag-20240910_171549/ds_sub_fit/sub_fit_ho/models/trainer.pkl
[36m(_dystack pid=167)[0m Skipping Neu

[36m(_ray_fit pid=9175)[0m [50]	valid_set's rmse: 63996.4[32m [repeated 2x across cluster][0m


[36m(_ray_fit pid=9231)[0m 	Training S1F3 with GPU, note that this may negatively impact model quality compared to CPU training.[32m [repeated 2x across cluster][0m
[36m(_ray_fit pid=9174)[0m [LightGBM] [Fatal] bin size 1667 cannot run on GPU
[36m(_ray_fit pid=9231)[0m [LightGBM] [Fatal] bin size 1668 cannot run on GPU


[36m(_ray_fit pid=9231)[0m [50]	valid_set's rmse: 59618.7[32m [repeated 2x across cluster][0m


[36m(_ray_fit pid=9289)[0m 	Training S1F5 with GPU, note that this may negatively impact model quality compared to CPU training.[32m [repeated 2x across cluster][0m
[36m(_ray_fit pid=9259)[0m [LightGBM] [Fatal] bin size 1666 cannot run on GPU
[36m(_ray_fit pid=9289)[0m [LightGBM] [Fatal] bin size 1667 cannot run on GPU


[36m(_ray_fit pid=9289)[0m [50]	valid_set's rmse: 73884[32m [repeated 2x across cluster][0m


[36m(_ray_fit pid=9347)[0m 	Training S1F7 with GPU, note that this may negatively impact model quality compared to CPU training.[32m [repeated 2x across cluster][0m
[36m(_ray_fit pid=9317)[0m [LightGBM] [Fatal] bin size 1670 cannot run on GPU
[36m(_ray_fit pid=9347)[0m [LightGBM] [Fatal] bin size 1665 cannot run on GPU


[36m(_ray_fit pid=9347)[0m [50]	valid_set's rmse: 76407.1[32m [repeated 2x across cluster][0m


[36m(_dystack pid=167)[0m Saving AutogluonModels/ag-20240910_171549/ds_sub_fit/sub_fit_ho/models/LightGBMXT_BAG_L2/utils/oof.pkl
[36m(_dystack pid=167)[0m Saving AutogluonModels/ag-20240910_171549/ds_sub_fit/sub_fit_ho/models/LightGBMXT_BAG_L2/model.pkl
[36m(_dystack pid=167)[0m 	-73075.8958	 = Validation score   (-root_mean_squared_error)
[36m(_dystack pid=167)[0m 	52.69s	 = Training   runtime
[36m(_dystack pid=167)[0m 	0.76s	 = Validation runtime
[36m(_dystack pid=167)[0m 	360.6	 = Inference  throughput (rows/s | 20948 batch size)
[36m(_dystack pid=167)[0m Saving AutogluonModels/ag-20240910_171549/ds_sub_fit/sub_fit_ho/models/trainer.pkl
[36m(_dystack pid=167)[0m Fitting model: LightGBM_BAG_L2 ... Training model for up to 3221.61s of the 3221.35s of remaining time.
[36m(_ray_fit pid=9374)[0m 	Training S1F8 with GPU, note that this may negatively impact model quality compared to CPU training.
[36m(_ray_fit pid=9374)[0m [LightGBM] [Fatal] bin size 1667 cannot run on

[36m(_ray_fit pid=9504)[0m [50]	valid_set's rmse: 64445.6[32m [repeated 2x across cluster][0m


[36m(_ray_fit pid=9560)[0m 	Training S1F3 with GPU, note that this may negatively impact model quality compared to CPU training.[32m [repeated 2x across cluster][0m
[36m(_ray_fit pid=9503)[0m [LightGBM] [Fatal] bin size 1667 cannot run on GPU
[36m(_ray_fit pid=9560)[0m [LightGBM] [Fatal] bin size 1668 cannot run on GPU


[36m(_ray_fit pid=9560)[0m [50]	valid_set's rmse: 60072.8[32m [repeated 2x across cluster][0m


[36m(_ray_fit pid=9620)[0m 	Training S1F5 with GPU, note that this may negatively impact model quality compared to CPU training.[32m [repeated 2x across cluster][0m
[36m(_ray_fit pid=9562)[0m [LightGBM] [Fatal] bin size 1666 cannot run on GPU
[36m(_ray_fit pid=9620)[0m [LightGBM] [Fatal] bin size 1667 cannot run on GPU


[36m(_ray_fit pid=9620)[0m [50]	valid_set's rmse: 74482.8[32m [repeated 2x across cluster][0m


[36m(_ray_fit pid=9679)[0m 	Training S1F7 with GPU, note that this may negatively impact model quality compared to CPU training.[32m [repeated 2x across cluster][0m
[36m(_ray_fit pid=9622)[0m [LightGBM] [Fatal] bin size 1670 cannot run on GPU
[36m(_ray_fit pid=9679)[0m [LightGBM] [Fatal] bin size 1665 cannot run on GPU


[36m(_ray_fit pid=9679)[0m [50]	valid_set's rmse: 76811.2[32m [repeated 2x across cluster][0m


[36m(_dystack pid=167)[0m Saving AutogluonModels/ag-20240910_171549/ds_sub_fit/sub_fit_ho/models/LightGBM_BAG_L2/utils/oof.pkl
[36m(_dystack pid=167)[0m Saving AutogluonModels/ag-20240910_171549/ds_sub_fit/sub_fit_ho/models/LightGBM_BAG_L2/model.pkl
[36m(_dystack pid=167)[0m 	-73536.591	 = Validation score   (-root_mean_squared_error)
[36m(_dystack pid=167)[0m 	49.68s	 = Training   runtime
[36m(_dystack pid=167)[0m 	0.63s	 = Validation runtime
[36m(_dystack pid=167)[0m 	361.4	 = Inference  throughput (rows/s | 20948 batch size)
[36m(_dystack pid=167)[0m Saving AutogluonModels/ag-20240910_171549/ds_sub_fit/sub_fit_ho/models/trainer.pkl
[36m(_dystack pid=167)[0m Fitting model: RandomForestMSE_BAG_L2 ... Training model for up to 3169.11s of the 3168.84s of remaining time.
[36m(_ray_fit pid=9706)[0m 	Training S1F8 with GPU, note that this may negatively impact model quality compared to CPU training.
[36m(_ray_fit pid=9706)[0m [LightGBM] [Fatal] bin size 1667 cannot run 

[36m(_ray_fit pid=9849)[0m 0:	learn: 78703.7573224	test: 78013.4455484	best: 78013.4455484 (0)	total: 21.2ms	remaining: 42.3ms
[36m(_ray_fit pid=9706)[0m [50]	valid_set's rmse: 82745.9
[36m(_ray_fit pid=9850)[0m bestTest = 68747.34999
[36m(_ray_fit pid=9850)[0m bestIteration = 2
[36m(_ray_fit pid=9849)[0m 100:	learn: 72135.8054801	test: 71759.2722409	best: 71759.2722409 (100)	total: 2.78s	remaining: 4.71s[32m [repeated 15x across cluster][0m
[36m(_ray_fit pid=9849)[0m bestTest = 77038.68496
[36m(_ray_fit pid=9849)[0m bestIteration = 2
[36m(_ray_fit pid=9850)[0m bestTest = 63809.49179
[36m(_ray_fit pid=9850)[0m bestIteration = 68
[36m(_ray_fit pid=9850)[0m Shrink model to first 69 iterations.


[36m(_ray_fit pid=9932)[0m 	Training S1F3 with GPU, note that this may negatively impact model quality compared to CPU training.[32m [repeated 2x across cluster][0m


[36m(_ray_fit pid=9932)[0m 2:	learn: 79233.2217512	test: 65676.2753096	best: 65676.2753096 (2)	total: 41.9ms	remaining: 0us[32m [repeated 6x across cluster][0m
[36m(_ray_fit pid=9932)[0m bestTest = 65676.27531[32m [repeated 2x across cluster][0m
[36m(_ray_fit pid=9932)[0m bestIteration = 2[32m [repeated 2x across cluster][0m
[36m(_ray_fit pid=9849)[0m Shrink model to first 143 iterations.
[36m(_ray_fit pid=9966)[0m 20:	learn: 72843.3174715	test: 79788.8782975	best: 79788.8782975 (20)	total: 501ms	remaining: 8.14s[32m [repeated 10x across cluster][0m
[36m(_ray_fit pid=9966)[0m bestTest = 83206.89349
[36m(_ray_fit pid=9966)[0m bestIteration = 2
[36m(_ray_fit pid=9932)[0m bestTest = 59531.43591
[36m(_ray_fit pid=9932)[0m bestIteration = 104
[36m(_ray_fit pid=9932)[0m Shrink model to first 105 iterations.


[36m(_ray_fit pid=10023)[0m 	Training S1F5 with GPU, note that this may negatively impact model quality compared to CPU training.[32m [repeated 2x across cluster][0m


[36m(_ray_fit pid=10023)[0m 2:	learn: 77585.1847625	test: 78225.9888362	best: 78225.9888362 (2)	total: 45.6ms	remaining: 0us[32m [repeated 13x across cluster][0m
[36m(_ray_fit pid=10023)[0m bestTest = 78225.98884[32m [repeated 2x across cluster][0m
[36m(_ray_fit pid=10023)[0m bestIteration = 2[32m [repeated 2x across cluster][0m
[36m(_ray_fit pid=9966)[0m Shrink model to first 165 iterations.
[36m(_ray_fit pid=10023)[0m 120:	learn: 71356.3890504	test: 74007.8019133	best: 73971.4858372 (89)	total: 2.98s	remaining: 6.64s[32m [repeated 9x across cluster][0m
[36m(_ray_fit pid=10063)[0m bestTest = 79735.72845
[36m(_ray_fit pid=10063)[0m bestIteration = 2
[36m(_ray_fit pid=10023)[0m bestTest = 73971.48584
[36m(_ray_fit pid=10023)[0m bestIteration = 89
[36m(_ray_fit pid=10023)[0m Shrink model to first 90 iterations.


[36m(_ray_fit pid=10114)[0m 	Training S1F7 with GPU, note that this may negatively impact model quality compared to CPU training.[32m [repeated 2x across cluster][0m


[36m(_ray_fit pid=10063)[0m 140:	learn: 71223.5987878	test: 74807.3538412	best: 74807.2815747 (131)	total: 3.67s	remaining: 6.37s[32m [repeated 8x across cluster][0m
[36m(_ray_fit pid=10114)[0m bestTest = 80187.59653
[36m(_ray_fit pid=10114)[0m bestIteration = 2
[36m(_ray_fit pid=10063)[0m Shrink model to first 161 iterations.
[36m(_ray_fit pid=10114)[0m 60:	learn: 71764.4399511	test: 76209.1044469	best: 76209.1044469 (60)	total: 1.35s	remaining: 7.32s[32m [repeated 9x across cluster][0m


[36m(_ray_fit pid=10159)[0m 	Training S1F8 with GPU, note that this may negatively impact model quality compared to CPU training.


[36m(_ray_fit pid=10114)[0m bestTest = 76205.45766[32m [repeated 2x across cluster][0m
[36m(_ray_fit pid=10114)[0m bestIteration = 62[32m [repeated 2x across cluster][0m
[36m(_ray_fit pid=10114)[0m Shrink model to first 63 iterations.
[36m(_ray_fit pid=10159)[0m 40:	learn: 71230.9910856	test: 82341.0320767	best: 82341.0320767 (40)	total: 817ms	remaining: 8.63s[32m [repeated 7x across cluster][0m
[36m(_ray_fit pid=10159)[0m bestTest = 86583.63203
[36m(_ray_fit pid=10159)[0m bestIteration = 2
[36m(_ray_fit pid=10159)[0m bestTest = 82077.68639
[36m(_ray_fit pid=10159)[0m bestIteration = 166
[36m(_ray_fit pid=10159)[0m Shrink model to first 167 iterations.


[36m(_dystack pid=167)[0m Saving AutogluonModels/ag-20240910_171549/ds_sub_fit/sub_fit_ho/models/CatBoost_BAG_L2/utils/oof.pkl
[36m(_dystack pid=167)[0m Saving AutogluonModels/ag-20240910_171549/ds_sub_fit/sub_fit_ho/models/CatBoost_BAG_L2/model.pkl
[36m(_dystack pid=167)[0m 	-72963.065	 = Validation score   (-root_mean_squared_error)
[36m(_dystack pid=167)[0m 	52.04s	 = Training   runtime
[36m(_dystack pid=167)[0m 	0.73s	 = Validation runtime
[36m(_dystack pid=167)[0m 	360.8	 = Inference  throughput (rows/s | 20948 batch size)
[36m(_dystack pid=167)[0m Saving AutogluonModels/ag-20240910_171549/ds_sub_fit/sub_fit_ho/models/trainer.pkl
[36m(_dystack pid=167)[0m Fitting model: ExtraTreesMSE_BAG_L2 ... Training model for up to 1386.0s of the 1385.73s of remaining time.
[36m(_dystack pid=167)[0m 	Fitting ExtraTreesMSE_BAG_L2 with 'num_gpus': 2, 'num_cpus': 4
[36m(_dystack pid=167)[0m Saving AutogluonModels/ag-20240910_171549/ds_sub_fit/sub_fit_ho/models/ExtraTreesMSE_BA

[36m(_ray_fit pid=10738)[0m [0]	validation_0-rmse:77408.16935
[36m(_ray_fit pid=10159)[0m 200:	learn: 69532.0098187	test: 82084.0482313	best: 82077.6863877 (166)	total: 4.21s	remaining: 5.71s[32m [repeated 8x across cluster][0m


[36m(_ray_fit pid=10739)[0m Potential solutions:
[36m(_ray_fit pid=10739)[0m - Use a data structure that matches the device ordinal in the booster.
[36m(_ray_fit pid=10739)[0m - Set the device for booster before call to inplace_predict.
[36m(_ray_fit pid=10799)[0m [32m [repeated 12x across cluster][0m
[36m(_ray_fit pid=10799)[0m     E.g. tree_method = "hist", device = "cuda"[32m [repeated 4x across cluster][0m
[36m(_ray_fit pid=10738)[0m Potential solutions:
[36m(_ray_fit pid=10738)[0m - Use a data structure that matches the device ordinal in the booster.
[36m(_ray_fit pid=10738)[0m - Set the device for booster before call to inplace_predict.


[36m(_ray_fit pid=10799)[0m [0]	validation_0-rmse:66127.62197[32m [repeated 5x across cluster][0m


[36m(_ray_fit pid=10801)[0m Potential solutions:
[36m(_ray_fit pid=10801)[0m - Use a data structure that matches the device ordinal in the booster.
[36m(_ray_fit pid=10801)[0m - Set the device for booster before call to inplace_predict.
[36m(_ray_fit pid=10861)[0m [32m [repeated 12x across cluster][0m
[36m(_ray_fit pid=10861)[0m     E.g. tree_method = "hist", device = "cuda"[32m [repeated 4x across cluster][0m
[36m(_ray_fit pid=10799)[0m Potential solutions:
[36m(_ray_fit pid=10799)[0m - Use a data structure that matches the device ordinal in the booster.
[36m(_ray_fit pid=10799)[0m - Set the device for booster before call to inplace_predict.


[36m(_ray_fit pid=10863)[0m [0]	validation_0-rmse:80105.80584[32m [repeated 4x across cluster][0m


[36m(_ray_fit pid=10861)[0m Potential solutions:
[36m(_ray_fit pid=10861)[0m - Use a data structure that matches the device ordinal in the booster.
[36m(_ray_fit pid=10861)[0m - Set the device for booster before call to inplace_predict.
[36m(_ray_fit pid=10924)[0m [32m [repeated 12x across cluster][0m
[36m(_ray_fit pid=10924)[0m     E.g. tree_method = "hist", device = "cuda"[32m [repeated 4x across cluster][0m
[36m(_ray_fit pid=10863)[0m Potential solutions:
[36m(_ray_fit pid=10863)[0m - Use a data structure that matches the device ordinal in the booster.
[36m(_ray_fit pid=10863)[0m - Set the device for booster before call to inplace_predict.


[36m(_ray_fit pid=10924)[0m [0]	validation_0-rmse:80501.17583[32m [repeated 5x across cluster][0m


[36m(_ray_fit pid=10924)[0m Potential solutions:
[36m(_ray_fit pid=10924)[0m - Use a data structure that matches the device ordinal in the booster.
[36m(_ray_fit pid=10924)[0m - Set the device for booster before call to inplace_predict.
[36m(_dystack pid=167)[0m Saving AutogluonModels/ag-20240910_171549/ds_sub_fit/sub_fit_ho/models/XGBoost_BAG_L2/utils/oof.pkl
[36m(_dystack pid=167)[0m Saving AutogluonModels/ag-20240910_171549/ds_sub_fit/sub_fit_ho/models/XGBoost_BAG_L2/model.pkl
[36m(_dystack pid=167)[0m 	-73495.7824	 = Validation score   (-root_mean_squared_error)
[36m(_dystack pid=167)[0m 	27.81s	 = Training   runtime
[36m(_dystack pid=167)[0m 	1.7s	 = Validation runtime
[36m(_dystack pid=167)[0m 	354.9	 = Inference  throughput (rows/s | 20948 batch size)
[36m(_dystack pid=167)[0m Saving AutogluonModels/ag-20240910_171549/ds_sub_fit/sub_fit_ho/models/trainer.pkl
[36m(_dystack pid=167)[0m Fitting model: NeuralNetTorch_BAG_L2 ... Training model for up to 231.31s 

*** Summary of fit() ***
Estimated performance of each model:
                          model     score_val              eval_metric  pred_time_val      fit_time  pred_time_val_marginal  fit_time_marginal  stack_level  can_infer  fit_order
0           WeightedEnsemble_L3 -72460.423724  root_mean_squared_error     279.053290  20059.416847                0.003043           0.654773            3       True         85
1           WeightedEnsemble_L2 -72490.986540  root_mean_squared_error      95.635211   5099.747616                0.002633           0.792438            2       True         62
2               CatBoost_BAG_L2 -72498.486929  root_mean_squared_error     271.744857  19496.905839                0.657709          58.334367            2       True         66
3          CatBoost_r137_BAG_L2 -72517.432074  root_mean_squared_error     271.864494  19490.463722                0.777346          51.892251            2       True         81
4          CatBoost_r177_BAG_L2 -72530.828743  r

In [8]:
predictor.leaderboard()

Unnamed: 0,model,score_val,eval_metric,pred_time_val,fit_time,pred_time_val_marginal,fit_time_marginal,stack_level,can_infer,fit_order
0,WeightedEnsemble_L3,-72460.423724,root_mean_squared_error,279.053290,20059.416847,0.003043,0.654773,3,True,85
1,WeightedEnsemble_L2,-72490.986540,root_mean_squared_error,95.635211,5099.747616,0.002633,0.792438,2,True,62
2,CatBoost_BAG_L2,-72498.486929,root_mean_squared_error,271.744857,19496.905839,0.657709,58.334367,2,True,66
3,CatBoost_r137_BAG_L2,-72517.432074,root_mean_squared_error,271.864494,19490.463722,0.777346,51.892251,2,True,81
4,CatBoost_r177_BAG_L2,-72530.828743,root_mean_squared_error,271.704902,19489.933062,0.617754,51.361590,2,True,72
...,...,...,...,...,...,...,...,...,...,...
80,NeuralNetTorch_r158_BAG_L1,-90209.459491,root_mean_squared_error,1.535660,444.695758,1.535660,444.695758,1,True,47
81,NeuralNetTorch_r22_BAG_L2,-90209.487958,root_mean_squared_error,276.094049,19806.412071,5.006902,367.840599,2,True,78
82,NeuralNetTorch_r22_BAG_L1,-90209.535136,root_mean_squared_error,1.299191,307.069428,1.299191,307.069428,1,True,16
83,NeuralNetTorch_r86_BAG_L1,-90209.558475,root_mean_squared_error,1.469808,326.626285,1.469808,326.626285,1,True,28


## 🏅 **Submission**

Finally, we'll generate the submission file. This file will be in the required format with id and price columns.

In [9]:
y_pred = predictor.predict(test)
y_pred[:5]

Loading: AutogluonModels/ag-20240910_171549/models/CatBoost_BAG_L1/model.pkl
Loading: AutogluonModels/ag-20240910_171549/models/CatBoost_r128_BAG_L1/model.pkl
Loading: AutogluonModels/ag-20240910_171549/models/CatBoost_r137_BAG_L1/model.pkl
Loading: AutogluonModels/ag-20240910_171549/models/CatBoost_r13_BAG_L1/model.pkl
Loading: AutogluonModels/ag-20240910_171549/models/CatBoost_r167_BAG_L1/model.pkl
Loading: AutogluonModels/ag-20240910_171549/models/CatBoost_r177_BAG_L1/model.pkl
Loading: AutogluonModels/ag-20240910_171549/models/CatBoost_r49_BAG_L1/model.pkl
Loading: AutogluonModels/ag-20240910_171549/models/CatBoost_r50_BAG_L1/model.pkl
Loading: AutogluonModels/ag-20240910_171549/models/CatBoost_r69_BAG_L1/model.pkl
Loading: AutogluonModels/ag-20240910_171549/models/CatBoost_r70_BAG_L1/model.pkl
Loading: AutogluonModels/ag-20240910_171549/models/CatBoost_r86_BAG_L1/model.pkl
Loading: AutogluonModels/ag-20240910_171549/models/CatBoost_r9_BAG_L1/model.pkl
Loading: AutogluonModels/ag-2

0    18676.458984
1    85042.539062
2    58277.531250
3    30024.185547
4    29510.189453
Name: price, dtype: float32

In [10]:
df = pd.DataFrame(y_pred,columns=['price'])
df.head()

Unnamed: 0,price
0,18676.458984
1,85042.539062
2,58277.53125
3,30024.185547
4,29510.189453


In [11]:
sol=pd.read_csv('/kaggle/input/playground-series-s4e9/sample_submission.csv')
sol.head()

Unnamed: 0,id,price
0,188533,43878.016
1,188534,43878.016
2,188535,43878.016
3,188536,43878.016
4,188537,43878.016


In [12]:
sol['price']=df['price']
sol.head()

Unnamed: 0,id,price
0,188533,18676.458984
1,188534,85042.539062
2,188535,58277.53125
3,188536,30024.185547
4,188537,29510.189453


In [13]:
sol.to_csv('submission.csv', index=False)

## 🎯 **Conclusion**

In this kernel, we successfully built a used car price prediction model using AutoGluon, efficiently utilizing GPU resources. Keep experimenting and refining your model to climb the leaderboard! 🏆