v0.1.0
Added:
- Added
ParallelPipelineclass to pipeline module & unit tests. - Added
rulesparameter to filter modules, allowing a given rule set to be filtered. - Added
rulesparameter toRBSOptimiser, allowing a given rule set to be filtered to the rules remaining after optimisation. - Added
rulesattribute to rule generators (which is aRulesclass containing the generated rules). - Added example notebook for
ParallelPipelineclass. - Added advanced example notebook for
BayesSearchCVclass. - Added
__repr__toAgglomerativeClusteringReducerclass.
Changed:
- The
get_paramsmethod in theLinearPipelineandParallelPipelineclasses now returns a dictionary of parameters and their values for each pipeline step. - Removed
num_coresparameter fromRBSPipelineandRBSOptimiserclasses (as it was ignored anyway). - Updated
LinearPipelineandBayesSearchCVexample notebook with diagrams. - Relaxed versions of packages required in setup.py.
Improvements:
- Classes in later stages of a
LinearPipelinecan now use the initial datasets if required (e.g. if a rule optimiser is placed after a rule generator, the rule optimiser can be configured to use the initial feature set to optimise the generated rules). See theuse_init_dataparam inLinearPipelineclass. ClassAccessorclasses can now be used in parameters that are mutable iterables - this allows attributes from classes earlier in a pipeline to be passed to parameters of classes later in the pipeline that are mutable iterables (e.g. lists).- Users can set whether they want to use the
sample_weightwhen calculating themetricof the validation fold in theBayesSearchCVclass (prior to this, thesample_weightwas always used if provided). See thesample_weight_in_valparameter in theBayesSearchCVclass. Rulesclasses can now be combined using+andsum().
Fixes:
- Updated rule generators so rule names remain constant if the
fitmethod is applied more than once after class instantiation. intvalues are now properly parsed when converting fromrule_stringstorule_dictsin theRulesclass.- Added a check for the parameter name when updating a class parameter in a
LinearPipelineorParallelPipeline- if the parameter does not exist, an error is thrown (prior to this, no error was thrown). - Updated the
RBSPipelineclass to return the index ofX_rulesin the prediction.