Skip to content

Releases: alteryx/evalml

v0.31.0

19 Aug 20:05
ed74174
Compare
Choose a tag to compare

v0.31.0 Aug. 19, 2021

Enhancements

  • Updated the high variance check in AutoMLSearch to be robust to a variety of objectives and cv scores #2622
  • Use Woodwork's outlier detection for the OutliersDataCheck #2637
  • Added ability to utilize instantiated components when creating a pipeline #2643
  • Sped up the all Nan and unknown check in infer_feature_types #2661

Fixes

Changes

  • Deleted _put_into_original_order helper function #2639
  • Refactored time series pipeline code using a time series pipeline base class #2649
  • Renamed dask_tests to parallel_tests #2657
  • Removed commented out code in pipeline_meta.py #2659

Documentation Changes

  • Add complete install command to README and Install section #2627

Testing Changes

Breaking Changes

  • TimeSeriesRegressionPipeline no longer inherits from TimeSeriesRegressionPipeline #2649

v0.30.2

16 Aug 20:15
2b0f42e
Compare
Choose a tag to compare

v0.30.2 Aug. 16, 2021

Fixes

  • Updated changelog and version numbers to match the release. Release 0.30.1 was released erroneously without a change to the version numbers. 0.30.2 replaces it.

v0.30.1

13 Aug 21:13
948e3b4
Compare
Choose a tag to compare

v0.30.1 Aug. 13, 2021

Enhancements

  • Added DatetimeFormatDataCheck for time series problems #2603
  • Added ProphetRegressor to estimators #2242
  • Updated ComponentGraph to handle not calling samplers' transform during predict, and updated samplers' transform methods s.t. fit_transform is equivalent to fit(X, y).transform(X, y) #2583
  • Updated ComponentGraph _validate_component_dict logic to be stricter about input values #2599
  • Patched bug in xgboost estimators where predicting on a feature matrix of only booleans would throw an exception. #2602
  • Updated ARIMARegressor to use relative forecasting to predict values #2613
  • Updated to support Woodwork 0.5.1 #2610

Fixes

  • Updated get_best_sampler_for_data to consider all non-numeric datatypes as categorical for SMOTE #2590
  • Fixed inconsistent test results from TargetDistributionDataCheck #2608
  • Adopted vectorized pd.NA checking for Woodwork 0.5.1 support #2626

Changes

  • Renamed SMOTE samplers to SMOTE oversampler #2595
  • Changed partial_dependence and graph_partial_dependence to raise a PartialDependenceError instead of ValueError. This is not a breaking change because PartialDependenceError is a subclass of ValueError #2604
  • Cleaned up code duplication in ComponentGraph #2612

Documentation Changes

  • To avoid local docs build error, only add warning disable and download headers on ReadTheDocs builds, not locally #2617

Testing Changes

  • Changed the lint CI job to only check against python 3.9 via the -t flag #2586
  • Installed Prophet in linux nightlies test and fixed test_all_components #2598
  • Refactored and fixed all make_pipeline tests to assert correct order and address new Woodwork Unknown type inference #2572
  • Removed component_graphs as a global variable in test_component_graphs.py #2609

Breaking Changes

  • Renamed SMOTE samplers to SMOTE oversampler. Please use SMOTEOversampler, SMOTENCOversampler, SMOTENOversampler instead of SMOTESampler, SMOTENCSampler, and SMOTENSampler #2595

v0.30.0

04 Aug 22:12
455210b
Compare
Choose a tag to compare

v0.30.0 Aug. 4, 2021

Enhancements

  • Added LogTransformer and TargetDistributionDataCheck #2487
  • Issue a warning to users when a pipeline parameter passed in isn't used in the pipeline #2564
  • Added Gini coefficient as an objective #2544
  • Added repr to ComponentGraph #2565
  • Added components to extract features from URL and EmailAddress Logical Types #2550
  • Added support for NaN values in TextFeaturizer #2532
  • Added SelectByType transformer #2531
  • Added separate thresholds for percent null rows and columns in HighlyNullDataCheck #2562
  • Added support for NaN natural language values #2577

Fixes

  • Raised error message for types URL, NaturalLanguage, and EmailAddress in partial_dependence #2573

Changes

  • Updated PipelineBase implementation for creating pipelines from a list of components #2549
  • Moved get_hyperparameter_ranges to PipelineBase class from automl/utils module #2546
  • Renamed ComponentGraph's get_parents to get_inputs #2540
  • Removed ComponentGraph.linearized_component_graph and ComponentGraph.from_list #2556
  • Updated ComponentGraph to enforce requiring .x and .y inputs for each component in the graph #2563

Documentation Changes

  • Added documentation for DaskEngine and CFEngine parallel engines #2560
  • Improved detail of TextFeaturizer docstring and tutorial #2568

Testing Changes

  • Added test that makes sure split_data does not shuffle for time series problems #2552

Breaking Changes

  • Moved get_hyperparameter_ranges to PipelineBase class from automl/utils module #2546
  • Renamed ComponentGraph's get_parents to get_inputs #2540
  • Removed ComponentGraph.linearized_component_graph and ComponentGraph.from_list #2556
  • Updated ComponentGraph to enforce requiring .x and .y inputs for each component in the graph #2563

v0.29.0

22 Jul 13:56
5f22f03
Compare
Choose a tag to compare

v0.29.0 Jul. 22, 2021

Enhancements

  • Updated 1-way partial dependence support for datetime features #2454
  • Added details on how to fix error caused by broken ww schema #2466
  • Added ability to use built-in pickle for saving AutoMLSearch #2463
  • Updated our components and component graphs to use latest features of ww 0.4.1, e.g. concat_columns and drop in-place. #2465
  • Added new, concurrent.futures based engine for parallel AutoML #2506
  • Added support for new Woodwork Unknown type in AutoMLSearch #2477
  • Updated our components with an attribute that describes if they modify features or targets and can be used in list API for pipeline initialization #2504
  • Updated ComponentGraph to accept X and y as inputs #2507
  • Removed unused TARGET_BINARY_INVALID_VALUES from DataCheckMessageCode enum and fixed formatting of objective documentation #2520

Fixes

  • Fixed FraudCost objective and reverted threshold optimization method for binary classification to Golden #2450
  • Added custom exception message for partial dependence on features with scales that are too small #2455
  • Ensures the typing for Ordinal and Datetime ltypes are passed through _retain_custom_types_and_initalize_woodwork #2461
  • Updated to work with Pandas 1.3.0 #2442
  • Updated to work with sktime 0.7.0 #2499

Changes

  • Updated XGBoost dependency to >=1.4.2 #2484, #2498
  • Added a DeprecationWarning about deprecating the list API for ComponentGraph #2488
  • Updated make_pipeline for AutoML to create dictionaries, not lists, to initialize pipelines #2504
  • No longer installing graphviz on windows in our CI pipelines because release 0.17 breaks windows 3.7 #2516

Documentation Changes

  • Moved docstrings from __init__ to class pages, added missing docstrings for missing classes, and updated missing default values #2452
  • Build documentation with sphinx-autoapi #2458
  • Change autoapi_ignore to only ignore files in evalml/tests/* #2530

Testing Changes

  • Fixed flaky dask tests #2471
  • Removed shellcheck action from build_conda_pkg action #2514
  • Added a tmp_dir fixture that deletes its contents after tests run #2505
  • Added a test that makes sure all pipelines in AutoMLSearch get the same data splits #2513
  • Condensed warning output in test logs #2521

Breaking Changes

  • NaN values in the Natural Language type are no longer supported by the Imputer with the pandas upgrade. #2477

v0.28.0

02 Jul 22:14
a3aa403
Compare
Choose a tag to compare

v0.28.0 Jul. 2, 2021

Enhancements

  • Added support for showing a Individual Conditional Expectations plot when graphing Partial Dependence #2386
  • Exposed thread_count for Catboost estimators as n_jobs parameter #2410
  • Updated Objectives API to allow for sample weighting #2433

Fixes

  • Deleted unreachable line from IterativeAlgorithm #2464

Changes

  • Pinned Woodwork version between 0.4.1 and 0.4.2 #2460
  • Updated psutils minimum version in requirements #2438
  • Updated log_error_callback to not include filepath in logged message #2429

Documentation Changes

  • Sped up docs #2430
  • Removed mentions of DataTable and DataColumn from the docs #2445

Testing Changes

  • Added slack integration for nightlies tests #2436
  • Changed build_conda_pkg CI job to run only when dependencies are updates #2446
  • Updated workflows to store pytest runtimes as test artifacts #2448
  • Added AutoMLTestEnv test fixture for making it easy to mock automl tests #2406

v0.27.0

23 Jun 14:35
fbbc8b3
Compare
Choose a tag to compare

v0.27.0 Jun. 23, 2021

Enhancements

  • Adds force plots for prediction explanations #2157
  • Removed self-reference from AutoMLSearch #2304
  • Added support for nonlinear pipelines for generate_pipeline_code #2332
  • Added inverse_transform method to pipelines #2256
  • Add optional automatic update checker #2350
  • Added search_order to AutoMLSearch's rankings and full_rankings tables #2345
  • Updated threshold optimization method for binary classification #2315
  • Updated demos to pull data from S3 instead of including demo data in package #2387
  • Upgrade woodwork version to v0.4.1 #2379

Fixes

  • Preserve user-specified woodwork types throughout pipeline fit/predict #2297
  • Fixed ComponentGraph appending target to final_component_features if there is a component that returns both X and y #2358
  • Fixed partial dependence graph method failing on multiclass problems when the class labels are numeric #2372
  • Added thresholding_objective argument to AutoMLSearch for binary classification problems #2320
  • Added change for k_neighbors parameter in SMOTE Oversamplers to automatically handle small samples #2375
  • Changed naming for Logistic Regression Classifier file #2399
  • Pinned pytest-timeout to fix minimum dependence checker #2425
  • Replaced Elastic Net Classifier base class with Logistsic Regression to avoid NaN outputs #2420

Changes

  • Cleaned up PipelineBase's component_graph and _component_graph attributes. Updated PipelineBase __repr__ and added __eq__ for ComponentGraph #2332
  • Added and applied black linting package to the EvalML repo in place of autopep8 #2306
  • Separated custom_hyperparameters from pipelines and added them as an argument to AutoMLSearch #2317
  • Replaced allowed_pipelines with allowed_component_graphs #2364
  • Removed private method _compute_features_during_fit from PipelineBase #2359
  • Updated compute_order in ComponentGraph to be a read-only property #2408
  • Unpinned PyZMQ version in requirements.txt #2389
  • Uncapping LightGBM version in requirements.txt #2405
  • Updated minimum version of plotly #2415
  • Removed SensitivityLowAlert objective from core objectives #2418

Documentation Changes

  • Fixed lead scoring weights in the demos documentation #2315
  • Fixed start page code and description dataset naming discrepancy #2370

Testing Changes

  • Update minimum unit tests to run on all pull requests #2314
  • Pass token to authorize uploading of codecov reports #2344
  • Add pytest-timeout. All tests that run longer than 6 minutes will fail. #2374
  • Separated the dask tests out into separate github action jobs to isolate dask failures. #2376
  • Refactored dask tests #2377
  • Added the combined dask/non-dask unit tests back and renamed the dask only unit tests. #2382
  • Sped up unit tests and split into separate jobs #2365
  • Change CI job names, run lint for python 3.9, run nightlies on python 3.8 at 3am EST #2395 #2398
  • Set fail-fast to false for CI jobs that run for PRs #2402

Breaking Changes

  • AutoMLSearch will accept allowed_component_graphs instead of allowed_pipelines #2364
  • Removed PipelineBase's _component_graph attribute. Updated PipelineBase __repr__ and added __eq__ for ComponentGraph #2332
  • pipeline_parameters will no longer accept skopt.space variables since hyperparameter ranges will now be specified through custom_hyperparameters #2317

v0.25.0

02 Jun 15:09
961584a
Compare
Choose a tag to compare

v0.25.0 Jun. 2, 2021

Enhancements

  • Upgraded minimum woodwork to version 0.3.1. Previous versions will not be supported #2181
  • Added a new callback parameter for explain_predictions_best_worst #2308

Fixes

Changes

  • Deleted the return_pandas flag from our demo data loaders #2181

Documentation Changes

Testing Changes

  • Ignoring test_saving_png_file while building conda package #2323

Breaking Changes

  • Deleted the return_pandas flag from our demo data loaders #2181
  • Upgraded minimum woodwork to version 0.3.1. Previous versions will not be supported #2181
  • Due to the weak-ref in woodwork, set the result of infer_feature_types to a variable before accessing woodwork #2181

v0.24.2

25 May 01:00
bcfd02f
Compare
Choose a tag to compare

v0.24.2 May. 24, 2021

Enhancements

  • Added oversamplers to AutoMLSearch #2213 #2286
  • Added dictionary input functionality for Undersampler component #2271
  • Changed the default parameter values for Elastic Net Classifier and Elastic Net Regressor #2269

Fixes

  • Set default n_jobs to 1 for StackedEnsembleClassifier and StackedEnsembleRegressor until fix for text-based parallelism in sklearn stacking can be found #2295

Changes

  • Updated start_iteration_callback to accept a pipeline instance instead of a pipeline class and no longer accept pipeline parameters as a parameter #2290

Documentation Changes

Testing Changes

  • Use codecov action to update coverage reports #2238
  • Removed MarkupSafe dependency version pin from requirements.txt and moved instead into RTD docs build CI #2261

Breaking Changes

  • Updated start_iteration_callback to accept a pipeline instance instead of a pipeline class and no longer accept pipeline parameters as a parameter #2290

v0.24.1

17 May 15:58
93e043c
Compare
Choose a tag to compare

v0.24.1 May. 17, 2021

Enhancements

  • Integrated ARIMARegressor into AutoML #2009
  • Updated HighlyNullDataCheck to also perform a null row check #2222
  • Set max_depth to 1 in calls to featuretools dfs #2231

Fixes

  • Removed data splitter sampler calls during training #2253
  • Set minimum required version for for pyzmq, colorama, and docutils #2254
  • Changed BaseSampler to return None instead of y #2272

Changes

  • Updated pipeline repr() and generate_pipeline_code to return pipeline instances without generating custom pipeline class #2227

Documentation Changes

  • Capped Sphinx version under 4.0.0 #2244

Testing Changes

  • Change number of cores for pytest from 4 to 2 #2266
  • Add minimum dependency checker to generate minimum requirement files #2267
    (evalml_dev) RM-MB-151:evalml karsten.chu$ tools/format_release_notes.sh

v0.24.1 May. 17, 2021

Enhancements

  • Integrated ARIMARegressor into AutoML #2009
  • Updated HighlyNullDataCheck to also perform a null row check #2222
  • Set max_depth to 1 in calls to featuretools dfs #2231

Fixes

  • Removed data splitter sampler calls during training #2253
  • Set minimum required version for for pyzmq, colorama, and docutils #2254
  • Changed BaseSampler to return None instead of y #2272

Changes

  • Updated pipeline repr() and generate_pipeline_code to return pipeline instances without generating custom pipeline class #2227

Documentation Changes

  • Capped Sphinx version under 4.0.0 #2244

Testing Changes

  • Change number of cores for pytest from 4 to 2 #2266
  • Add minimum dependency checker to generate minimum requirement files #2267