Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
v0.15.0 Oct. 29, 2020
Enhancements
StackedEnsembleClassifier
,StackedEnsembleRegressor
) Implement stacked ensemble classes #1134AutoMLSearch
Integrate ensemble methods in AutoML #1253DecisionTreeClassifier
andDecisionTreeRegressor
to AutoML Add DecisionTreeClassifier and DecisionTreeRegressor to AutoML #1255graph_prediction_vs_actual
inmodel_understanding
for regression problems Add Plot for Prediction vs Actual for Regression Problems #1252OneHotEncoder
to enable filtering for features to encode for Allow user to filter which features to encode for OneHotEncoder #1249HighVarianceCVDataCheck
and replaced synonymous warning inAutoMLSearch
Add HighVarianceCVDataCheck #1254PCA Transformer
component for dimensionality reduction Add PCA component #1270generate_pipeline_code
andgenerate_component_code
to allow for code generation given a pipeline or component instance Code Generation for Pipelines and Components #1306PCA Transformer
component for dimensionality reduction Add PCA component #1270AutoMLSearch
to supportWoodwork
data structures Update AutoMLSearch to support WoodWork DataTables #1299ClassImbalanceDataCheck
and added this check toDefaultDataChecks
Add ClassImbalanceDataCheck to DefaultDataChecks #1333max_batches
argument toAutoMLSearch.search
public Added max_batches as a public parameter #1320_pipelines_per_batch
as a private argument toAutoMLSearch
Added_pipelines_per_batch
as a private argument toAutoMLSearch
#1355Fixes
evalml info
CLI command Clean up lacking codecov for __main__.py and addevalml info
to docs #1293boosting type='rf'
for LightGBM Classifier, as well asnum_leaves
error Fix 'RF' error for LightGBM Classifier #1302explain_predictions_best_worst
where a custom index in the target variable would cause aValueError
explain_predictions_best_worst custom index bug fix #1318evalml.pipelines.__init__
file Add stacked ensemble to __init__ file for pipelines #1326top_n
was less than the number of categories in a column Make OHE deterministic when top_n < no. of categories #1324HighVarianceCVDataCheck
Fix 'Invalid Value Encountered' error during AutomLSearch #1346random_state
was not being passed to the created pipelines Passing random state to pipelines created by IterativeAlgorithm next_batch #1321Changes
add_to_rankings
to be called before AutoMLSearch is called allowadd_to_rankings
to work before search #1250max_pipelines
parameter fromAutoMLSearch
Removemax_pipelines
parameter fromAutoMLSearch
#1264featuretools
andnlp_primitives
core dependencies Integrate TextFeaturizer to automl #1062PartOfSpeechCount
fromTextFeaturizer
transform primitives Integrate TextFeaturizer to automl #1062partial_dependency
when the feature includes null values Warning on null values for partial_dependency #1352Documentation Changes
evalml info
command to documentation Clean up lacking codecov for __main__.py and addevalml info
to docs #1293feature_importance
Fix docstring for feature_importance #1353AutoML
with text data Add a tutorial for text data #1357Testing Changes
jupyter_check
to handle IPython Add Test Coverage for IPython #1256make_pipeline
tests to test for all estimators Clean upmake_pipeline
tests #1257__main__.py
and unnecessary Clean up lacking codecov for __main__.py and addevalml info
to docs #1293Breaking Changes
LabelLeakageDataCheck
toTargetLeakageDataCheck
Rename LabelLeakageDataCheck to TargetLeakageDataCheck. #1319max_pipelines
parameter has been removed fromAutoMLSearch
. Please usemax_iterations
instead. Removemax_pipelines
parameter fromAutoMLSearch
#1264AutoMLSearch.search()
will now log a warning if the input is not aWoodwork
data structure (pandas
,numpy
) Update AutoMLSearch to support WoodWork DataTables #1299max_batches
argument toAutoMLSearch.search
public Added max_batches as a public parameter #1320feature_types
from AutoMLSearch.search Integrate TextFeaturizer to automl #1062