Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Task Array (flexible feature-wise task assignment for dynamic modular learning) #504

Open
Annie-LiGHT opened this issue Oct 26, 2022 · 0 comments
Labels
feature New feature or request

Comments

@Annie-LiGHT
Copy link
Collaborator

PROBLEM

Currently in DISCO, tasks comprise a single model with an expected input of a restricted size.

  • We previously proposed way to modularize training into a series of feature-specific (or modality-specific) modules i.e. MoDN: Modular Decision Support Networks
  • This approach allows training on imperfectly interoperable (IIO) datasets i.e. where one client has a different feature set than another, but with an overlapping subset in common.

Screenshot 2022-10-26 at 10 59 38
Like in the image, if the x-axis is number_of_features)

  • This scenario can be thought of as "horizontally vertical" distributed learning (i.e. different rows AND different columns)
  • MoDN proposes 1encoder/feature and 1decoder/label, thus producing an array of models of size [feature set + label set]
  • This can also apply to non-MoDN models (for example "Predict COVID" as a meta-task with subtasks of ["using demographic features" or "using lab and demographic features"....etc.]

PROPOSED FEATURE

  • Support a meta-task that comprises an array of sub-task modules (i.e. an array of models)
  • Connecting data sets with a various number of IIO features to meta-task
  • Assign sub-task modules to users according to that users' available features

Example

  • In MoDN if there is 1encoder/feature in a tabular dataset, we want to
    - only have one "meta task" or task array for the set of modules (as opposed to a long list of tasks for feature-specific encoder)
    - only send models relevant to the user's feature set.
    - allow input from tabular datasets of various sizes and assign sub-tasks to features.
@Annie-LiGHT Annie-LiGHT added the feature New feature or request label Oct 26, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
feature New feature or request
Projects
None yet
Development

No branches or pull requests

1 participant