What's wrong with the way we export learning networks as new model types #831

ablaom · 2022-08-29T04:59:59Z

The learning networks API appeared in the very earlier versions of MLJ and over time has grown to become a bit of a hack, in response to a great many feature requests. Here are some complaints:

From the user's point of view, the process for exporting a learning network is too complicated (eg, first define learning network machine, etc)
It is also too mysterious (too much hidden knowledge)
From the developer's point of view, it is also too complicated and too mysterious making it a challenge to maintain and enhance, even for those very familiar with it.
There exist unnatural restrictions on what a user can do, such as one that led to Prohibit distinct fields in composite models pointing to two models that are === #377. This becomes worse if we move to allowing immutable model structs (for, eg, integration with TableTransforms.jl)
Retraining a composite is not "smart" if a component model is replaced (only if it is mutated)
The fact that fitted_params(::Machine) and report(::Machine) need special casing for machines bound to composite models feels like an unnecessary complication and is making it difficult to design uniform interfaces; see eg, Storing intermediate results of a Composite Model MLJ.jl#841 (comment).

These issues do not just affect esoteric applications to model composition but are holding back some important developments, eg, in outlier detection, and TableTransforms.jl integration.

The root causes for these issues are:

1. The way we currently establish a mapping from composite model hyperparameters for component models to the corresponding machines in the network that are meant to point to those component models.

2. The fact that we prematurely merge reports from fit with reports from an operation, instead of keeping them separate in the machine and providing a model-specific method to say how to combine them.

I've had a pretty good idea on how to resolve 1, but it will take a little time to put together. Fixing 2 after 1 will be easier, and what is needed there is more-or-less obvious. Stay tuned for PR's to address these. Thank you for your patience.

The text was updated successfully, but these errors were encountered:

ablaom · 2022-12-06T01:50:43Z

Resolved by #852

ablaom self-assigned this Aug 29, 2022

ablaom mentioned this issue Aug 30, 2022

Add report method for merging fit reports with operational reports JuliaAI/MLJModelInterface.jl#160

Merged

ablaom mentioned this issue Sep 16, 2022

Add new option for exporting learning networks as stand-alone composite model types #841

Merged

3 tasks

ablaom mentioned this issue Sep 29, 2022

serializing learning network machines #844

Closed

This was referenced Nov 3, 2022

Towards a 0.21 release #852

Closed

Issue to trigger releases #345

Closed

ablaom closed this as completed Dec 6, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

What's wrong with the way we export learning networks as new model types #831

What's wrong with the way we export learning networks as new model types #831

ablaom commented Aug 29, 2022 •

edited

Loading

ablaom commented Dec 6, 2022

What's wrong with the way we export learning networks as new model types #831

What's wrong with the way we export learning networks as new model types #831

Comments

ablaom commented Aug 29, 2022 • edited Loading

ablaom commented Dec 6, 2022

ablaom commented Aug 29, 2022 •

edited

Loading