Implement a similar interface to sklearn's make_pipeline #17

bangxiangyong · 2019-11-14T00:08:32Z

Been having this in mind of viewing data analytics/ML as a workflow (which current implementation for ZEMA EMC with Haris Lulic is kind-of adopting already), that is a sequence of functions applied one after the other.

sklearn reference

Example:
group_of_agents = make_pipeline(FFT(), PCA(pca_parameters), LDA(lda_parameters))

The arguments are any number of objects which implements the function fit and transform similar to how sklearn structures its classes
The line above should instantiate 3 agents connected from left to right (FFT -> PCA -> LDA)
Further, for this to work, a generic ML agent class which accepts these data analytics methods should be developed

The advantage with the agent network architecture:

Pipelines which use similar components/agents will only need to process once if the agent connections are made dynamic. The code below should create 4 agents (FFT -> PCA -> LDA & ANN) instead of 6 agents (sum of number of agents in both pipelines)
Example:

group_of_agents_1 = make_pipeline(FFT(), PCA(pca_parameters), LDA()) 
group_of_agents_2 = make_pipeline(FFT(), PCA(pca_parameters), ANN())

Entire data processing pipelines can be visualized via dashboard immediately
Compatible immediately with sklearn and more fluid integration with data science
Promotes incremental development for mathematical components. Such as when investigating the effect of added noise/bias, we can have two pipelines to be compared:

group_of_agents_1 = make_pipeline(AddNoise(), FFT(), LDA()) 
group_of_agents_2 = make_pipeline(FFT(),  LDA())

The text was updated successfully, but these errors were encountered:

bangxiangyong · 2019-11-25T20:48:43Z

More information here : (https://www.slideshare.net/yongbangxiang/use-cases-agentmet4fof)

bangxiangyong · 2019-11-26T21:24:04Z

This is how the interface should look like:

This is to specify the pipelines

#option 1 : group of pipelines with 3 levels
ML_pipelines_A = make_agent_pipelines([PCA(), KNN()],
                                [StandardScaler(),RobustScaler()],
                                [LinearRegression(),ANN()], parameters)

#option 2 : multi pipelines of single level
ML_pipelines_B = make_agent_pipelines([CNN(),BCNN(),ANN()], parameters)

And their candidate parameters to be grid-searched

#example of parameters for pipelines
parameters = ([{"n_components":[1,2,3,4,5]}, {"n":[4]}],
              [],
              [0,{"dimensions":[120,233,345,666]}])

Then connection to data, evaluator and monitor looks like this:

#connect data
data_agent.connect(ML_pipelines)

#connect evaluator such as F1 SCORE, PICP, etc
ML_pipelines.connect(evaluator)

#collect experiment statistics with Monitor Agent
evaluator.connect(monitor_agent)

Lastly, to execute and log them as "Experiments" #25

#loop through the parameters and log on MLFLOW
aggregated_results = run_experiment([data_agent_1, data_agent_2], [ML_pipelines_A,ML_pipelines_B,ML_pipeline_C])

BjoernLudwigPTB · 2021-09-16T20:51:07Z

That appears to be resolved by #33 long ago.

bangxiangyong mentioned this issue Nov 26, 2019

Support ZeMA's effort to integrate their machine learning with agentMET4FOF #20

Closed

BjoernLudwigPTB mentioned this issue Nov 26, 2019

Integrate PyDynamic #18

Open

bangxiangyong mentioned this issue Nov 26, 2019

Implement "Experiments" #25

Closed

BjoernLudwigPTB added this to Sprint backlog in agentMET4FOF's progress Nov 26, 2019

bangxiangyong self-assigned this Nov 26, 2019

bangxiangyong moved this from Sprint backlog to In progress in agentMET4FOF's progress Nov 26, 2019

bangxiangyong moved this from In progress to Review in progress in agentMET4FOF's progress Dec 9, 2019

bangxiangyong mentioned this issue Dec 9, 2019

Feature/ml experiments #33

Merged

bangxiangyong moved this from Review in progress to Done in agentMET4FOF's progress Dec 11, 2019

BjoernLudwigPTB closed this as completed Sep 16, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement a similar interface to sklearn's make_pipeline #17

Implement a similar interface to sklearn's make_pipeline #17

bangxiangyong commented Nov 14, 2019 •

edited by BjoernLudwigPTB

Loading

bangxiangyong commented Nov 25, 2019

bangxiangyong commented Nov 26, 2019 •

edited

Loading

BjoernLudwigPTB commented Sep 16, 2021

Implement a similar interface to sklearn's make_pipeline #17

Implement a similar interface to sklearn's make_pipeline #17

Comments

bangxiangyong commented Nov 14, 2019 • edited by BjoernLudwigPTB Loading

The advantage with the agent network architecture:

bangxiangyong commented Nov 25, 2019

bangxiangyong commented Nov 26, 2019 • edited Loading

BjoernLudwigPTB commented Sep 16, 2021

bangxiangyong commented Nov 14, 2019 •

edited by BjoernLudwigPTB

Loading

bangxiangyong commented Nov 26, 2019 •

edited

Loading