v0.6.0
Version 0.6.0
We're happy to announce the AutoGluon 0.6 release. 0.6 contains major enhancements to Tabular, Multimodal, and Time Series
modules, along with many quality of life improvements and fixes.
As always, only load previously trained models using the same version of AutoGluon that they were originally trained on.
Loading models trained in different versions of AutoGluon is not supported.
This release contains 263 commits from 25 contributors!
See the full commit change-log here: v0.5.2...v0.6.0
Special thanks to @cheungdaven, @suzhoum, @BingzhaoZhu, @liangfu, @Harry-zzh, @gidler, @yongxinw, @martinschaef,
@giswqs, @Jalagarto, @geoalgo, @lujiaying and @leloykun who were first time contributors to AutoGluon this release!
Full Contributor List (ordered by # of commits):
@shchur, @yinweisu, @zhiqiangdon, @Innixma, @FANGAreNotGnu, @canerturkmen, @sxjscience, @gradientsky, @cheungdaven,
@bryanyzhu, @suzhoum, @BingzhaoZhu, @yongxinw, @tonyhoo, @liangfu, @Harry-zzh, @Raldir, @gidler, @martinschaef,
@giswqs, @Jalagarto, @geoalgo, @lujiaying, @leloykun, @yiqings
This version supports Python versions 3.7 to 3.9. This is the last release that will support Python 3.7.
Changes
AutoMM
AutoGluon Multimodal (a.k.a AutoMM) supports three new features: 1) object detection, 2) named entity recognition, and 3) multimodal matching. In addition, the HPO backend of AutoGluon Multimodal has been upgraded to ray 2.0. It also supports fine-tuning billion-scale FLAN-T5-XL model on a single AWS g4.2x-large instance with improved parameter-efficient finetuning. Starting from 0.6, we recommend using autogluon.multimodal rather than autogluon.text or autogluon.vision and added deprecation warnings.
New features
-
Object Detection
- Add new problem_type
"object_detection"
. - Customers can run inference with pretrained object detection models and train their own model with three lines of code.
- Integrate with open-mmlab/mmdetection, which supports classic detection architectures like Faster RCNN, and more efficient and performant architectures like YOLOV3 and VFNet.
- See tutorials and examples for more detail.
- Contributors and commits: @FANGAreNotGnu, @bryanyzhu, @zhiqiangdon, @yongxinw, @sxjscience, @Harry-zzh (#2025, #2061, #2131, #2181, #2196, #2215, #2244, #2265, #2290, #2311, #2312, #2337, #2349, #2353, #2360, #2362, #2365, #2380, #2381, #2391, #2393, #2400, #2419, #2421, #2063, #2104, #2411)
- Add new problem_type
-
Named Entity Recognition
- Add new problem_type
"ner"
. - Customers can train models to extract named entities with three lines of code.
- The implementation supports any backbones in huggingface/transformer, including the recently FLAN-T5 series released by Google.
- See tutorials for more detail.
- Contributors and commits: @cheungdaven (#2183, #2232, #2220, #2282, #2295, #2301, #2337, #2346, #2361, #2372, #2394, #2412)
- Add new problem_type
-
Multimodal Matching
- Add new problem_type
"text_similarity"
,"image_similarity"
,"image_text_similarity"
. - Users can now extract semantic embeddings with pretrained models for text-text, image-image, and text-image matching problems.
- Moreover, users can further finetune these models with relevance data.
- The semantic text embedding model can also be combined with BM25 to form a hybrid indexing solution.
- Internally, AutoGluon Multimodal implements a twin-tower architecture that is flexible in the choice of backbones for each tower. It supports image backbones in TIMM, text backbones in huggingface/transformers, and also the CLIP backbone.
- See tutorials for more detail.
- Contributors and commits: @zhiqiangdon @FANGAreNotGnu @cheungdaven @suzhoum @sxjscience @bryanyzhu (#1975, #1994, #2142, #2179, #2186, #2217, #2235, #2284, #2297, #2313, #2326, #2337, #2347, #2357, #2358, #2362, #2363, #2375, #2378, #2404, #2416, #2407, #2417)
- Add new problem_type
-
Miscellaneous minor fixes. @cheungdaven @FANGAreNotGnu @geoalgo @zhiqiangdon (#2402, #2409, #2026, #2401, #2418)
Other Enhancements
- Fix the FT-Transformer implementation and support Fastformer. @BingzhaoZhu @yiqings (#1958, #2194, #2251, #2344, #2379, #2386)
- Support finetuning billion-scale FLAN-T5-XL in a single AWS g4.2x-large instance via improved parameter-efficient finetuning. See tutorial. @Raldir @sxjscience (#2032, #2108, #2285, #2336, #2352)
- Upgrade multimodal HPO to use ray 2.0 and also add new tutorial. @yinweisu @suzhoum @bryanyzhu (#2206, #2341)
- Further improvement on model distillation. Add example and tutorial. @FANGAreNotGnu @sxjscience (#1983, #2064, #2397)
- Revise the default presets of AutoMM for image classification problems. @bryanyzhu (#2351)
- Support backend=βautommβ in autogluon.vision. @bryanyzhu (#2316)
- Add deprecated warning to autogluon.vision and autogluon.text and point the usage to autogluon.multimodal. @bryanyzhu @sxjscience (#2268, #2315)
- Examples about Kaggle: Feedback Prize prediction competition. We created a solution with AutoGluon Multimodal that obtained 152/1557 in the public leaderboard and 170/1557 in the private leaderboard, which is among the top 12% participants. The solution is public days before the DDL of the competition and obtained more than 3000 views. @suzhoum @MountPOTATO (#2129, #2168, #2333)
- Improve native inference speed. @zhiqiangdon (#2051, #2157, #2161, #2171)
- Other improvements, security/bug fixes. @zhiqiangdon @sxjscience @FANGAreNotGnu, @yinweisu @Innixma @tonyhoo @martinschaef @giswqs @tonyhoo (#1980, #1987, #1989, #2003, #2080, #2018, #2039, #2058, #2101, #2102, #2125, #2135, #2136, #2140, #2141, #2152, #2164, #2166, #2192, #2219, #2250, #2257, #2280, #2308, #2315, #2317, #2321, #2356, #2388, #2392, #2413, #2414, #2417, #2426, #2028, #2382, #2415, #2193, #2213, #2230)
- CI improvements. @yinweisu (#1965, #1966, #1972, #1991, #2002, #2029, #2137, #2151, #2156, #2163, #2191, #2214, #2369, #2113, #2118)
Experimental Features
- Support 11B-scale model finetuning with DeepSpeed. @Raldir (#2032)
- Enable few-shot learning with 11B-scale model. @Raldir (#2197)
- ONNX export example of hf_text model. @FANGAreNotGnu (#2149)
Tabular
New features
-
New experimental model
FT_TRANSFORMER
. @BingzhaoZhu, @Innixma (#2085, #2379, #2389, #2410)- You can access it via specifying the
FT_TRANSFORMER
key
in thehyperparameters
dictionary or viapresets="experimental_best_quality"
. - It is recommended to use GPU to train this model, but CPU training is also supported.
- If given enough training time, this model generally improves the ensemble quality.
- You can access it via specifying the
-
New experimental model compilation support via
predictor.compile_models()
. @liangfu, @Innixma (#2225, #2260, #2300)- Currently only Random Forest and Extra Trees have compilation support.
- You will need to install extra dependencies for this to work:
pip install autogluon.tabular[all,skl2onnx]
. - Compiling models dramatically speeds up inference time (~10x) when processing small batches of samples (<10000).
- Note that a known bug exists in the current implementation: Refitting models after compilation will fail
and cause a crash. To avoid this, ensure that.compile_models
is called only at the very end.
-
Added
predictor.clone(...)
method to allow perfectly cloning a predictor object to a new directory.
This is useful to preserve the state of a predictor prior to altering it
(such as prior to calling.save_space
,.distill
,.compile_models
, or.refit_full
. @Innixma (#2071) -
Added simplified
num_gpus
andnum_cpus
arguments topredictor.fit
to control total resources.
@yinweisu, @Innixma (#2263) -
Improved stability and effectiveness of HPO functionality via various refactors regarding our usage of ray.
@yinweisu, @Innixma (#1974, #1990, #2094, #2121, #2133, #2195, #2253, #2263, #2330) -
Upgraded dependency versions: XGBoost 1.7, CatBoost 1.1, Scikit-learn 1.1, Pandas 1.5, Scipy 1.9, Numpy 1.23.
@Innixma (#2373) -
Added python version compatibility check when loading a fitted TabularPredictor.
Will now error if python versions are incompatible. @Innixma (#2054) -
Added
fit_weighted_ensemble
argument topredictor.fit
. This allows the user to disable the weighted ensemble.
@Innixma (#2145)
Other Enhancements
- Improved logging clarity when using
infer_limit
. @Innixma (#2014) - Significantly improved HPO search space of XGBoost. @Innixma (#2123)
- Fixed HPO crashing when tuning Random Forest, Extra Trees, or KNN. @Innixma (#2070)
- Optimized roc_auc metric scoring speed by 7x. @Innixma (#2318, #2331)
- Fixed bug with AutoMM Tabular model crashing if not trained last. @Innixma (#2309)
- Refactored
Scorer
classes to be easier to use, plus added comprehensive unit tests for all metrics. @Innixma (#2242) - Sped up TextSpecial feature generation during preprocessing by 20% @gidler (#2095)
- imodels integration improvements @Jalagarto (#2062)
- Fix crash when calling feature importance in quantile_regression. @leloykun (#1977)
- Add FAQ section for missing value imputation. @Innixma (#2076)
- Various minor fixes and cleanup @Innixma, @yinweisu, @gradientsky, @gidler (#1997, #2031, #2124, #2144, #2178, #2340, #2342, #2345, #2374, #2339,
#2348, #2403, #1981, #1982, #2234, #2233, #2243, #2269, #2288, #2307, #2367, #2019)
Time Series
New features
TimeSeriesPredictor
now supports static features (a.k.a. time series metadata, static covariates) and **
time-varying covariates** (a.k.a. dynamic features or related time series). @shchur @canerturkmen (#1986, #2238,
#2276, #2287)- AutoGluon-TimeSeries now uses PyTorch by default (for
DeepAR
andSimpleFeedForward
), removing the dependency
on MXNet. @canerturkmen (#2074, #2205, #2279) - New models!
AutoGluonTabular
relies on XGBoost, LightGBM and CatBoost under the hood via theautogluon.tabular
module.Naive
andSeasonalNaive
forecasters are simple methods that provide strong baselines with no increase in
training time.TemporalFusionTransformerMXNet
brings the TFT transformer architecture to AutoGluon. @shchur (#2106,
#2188, #2258, #2266) - Up to 20x faster parallel and memory-efficient training for statistical (local) forecasting models like
ETS
,ARIMA
andTheta
, as well asWeightedEnsemble
. @shchur @canerturkmen (#2001, #2033, #2040, #2067, #2072, #2073, #2180,
#2293, #2305) - Up to 3x faster training for GluonTS models with data caching. GPU training enabled by default on PyTorch models.
@shchur (#2323) - More accurate validation for time series models with multi-window backtesting. @shchur (#2013, #2038)
TimeSeriesPredictor
now handles irregularly sampled time series withignore_index
. @canerturkmen, @shchur (#1993,
#2322)- Improved and extended presets for more accurate forecasting. @shchur (#2304)
- 15x faster and more robust forecast evaluation with updates to
TimeSeriesEvaluator
@shchur (#2147, #2150) - Enabled Ray Tune backend for hyperparameter optimization of time series models. @shchur (#2167, #2203)
More tutorials and examples
Improved documentation and new tutorials:
- Updated Quickstart tutorial
- New! In-depth tutorial
- New! Overview of available models and hyperparameters
- Updated API documentation
@shchur (#2120, #2127, #2146, #2174, #2187, #2354)
Miscellaneous
- Deprecate passing quantile_levels to TimeSeriesPredictor.predict (#2277)
- Use static features in GluonTS forecasting models (#2238)
- Make sure that time series splitter doesn't trim training series shorter than prediction_length + 1 (#2099)
- Fix hyperparameter overloading in HPO for time series models (#2189)
- Clean up the TimeSeriesDataFrame public API (#2105)
- Fix item order in GluonTS models predictions (#2092)
- Implement hash_ts_dataframe_items (#2060)
- Speed up TimeSeriesDataFrame.slice_by_timestep (#2020)
- Speed up RandomForestQuantileRegressor and ExtraTreesQuantileRegressor (#2204)
- Various backend enhancements / refactoring / cleanup (#2314, #2294, #2292, #2278, #1985, #2398)
- Increase the number of samples used by DeepAR at prediction time (#2291)
- revise timeseries presets to minimum context length of 10 (#2065)
- Fix timeseries daily frequency inferred period (#2100)
- Various backend enhancements / refactoring / cleanup (#2286, #2302, #2240, #2093, #2098, #2044, #2385, #2355, #2405)