Add multi target classification #441

YonyBresler · 2024-05-01T19:44:09Z

Added the ability for multi-target classification, discussed in #430.

Highlighting some of the changes:

Initiated by having targets a list, same as in multi-target regression
In inferred-config, added 'output_cardinality' to track the number of classes per target
label_encoder is now a list, [i] corresponds to the i'th target
Updated unit tests for all models to include a multi-target variation of classification
Updated documentation to remove limitation of no multi-target classification support
[pre-commit.ci] auto fixes from pre-commit.com hooks

Things that I'm happy to improve:

With custom metrics, we now need a set of parameters for each metric and for each target (since the number of classes can vary). I'm not as familiar with OmegaConf, I created a sub_params_list config object with I omegaConf.create() on the fly, but wondering if there's a cleaner way to do this.
Bagging predict would require significant changes to allow multi-target classification. It is not clear to me if multi-target regression is currently supported in tabular_model._combine_predictions()? For now bagging only works correctly with single-target classification.
While there are tests to see that multi-target is working, all current model tests only evaluate that it runs without errors, but not that it runs correctly.

📚 Documentation preview 📚: https://pytorch-tabular--441.org.readthedocs.build/en/441/

…failing other tests

manujosephv · 2024-05-08T06:44:42Z

Thanks a lot for this PR. This is something that was requested by a lot of folks in the community! I'm running a bit from pillar to post at the moment, but I'll take a look at the code as soon as I get a chance :)

manujosephv · 2024-05-29T10:15:03Z

Thank you @YonyBresler for the excellent PR. I have almost no suggestions except one minor thing about an error message! And apologize for the tardiness in reviewing the PR. Things been crazy at work.

Also, the custom_metrics solution is something I can live with.

One more thing I would add is a tutorial notebook.. Maybe a How-To guide? That would be a good start to put down how one would use multi-label classification

for more information, see https://pre-commit.ci

manujosephv · 2024-07-01T00:38:04Z

@YonyBresler Did you manage to take a look at the comment? Or do you want me to make the changes and merge the PR? Let me know. This is a very useful and asked for feature

YonyBresler · 2024-07-03T14:43:28Z

Hi @manujosephv, it's in progress, preparing the notebook I found and fixed a bug with how the metric is reported in some circumstances (already committed)

There's still an issue with certain metrics (that require probabilities) and auto-lr for model like Gandalf, it causes an error, I believe things are not getting initialized properly with the new changes.

Let me see if I can resolve it, or if not perhaps add a limit to not use auto-lr in multi-target classifier with models that can't handle it right now, but I hope I can fix it.

YonyBresler · 2024-07-10T20:27:20Z

Thanks for your patience @manujosephv, I've resolved the issue (it wasn't a bug, but rather improper configuration) and added a tutorial notebook to walk through multi-target classification (albeit with a very rudimentary 2nd target).

As far as I can tell, should be good to go, please take a look when you get a chance and let me know if there's any issue or ready to merge.

Thanks!

manujosephv · 2024-07-13T10:27:53Z

There is some error in test cases. I think it's not your code, but some library compatibility. I'll try and figure it out as soon as I get some time.

YonyBresler · 2024-08-13T18:49:46Z

It's weird, I tested on a clean python 3.10.14 install and all tests pass, no library issue, so not sure what's causing it on your test script.
Let me know if there's something I can do to help resolve this.

YonyBresler · 2024-09-16T14:08:17Z

Thanks for resolving this issue @Borda!

manujosephv · 2024-05-29T10:05:00Z

src/pytorch_tabular/tabular_model.py

@@ -2035,23 +2036,21 @@ def _combine_predictions(
        elif callable(aggregate):
            bagged_pred = aggregate(pred_prob_l)
        if self.config.task == "classification":
-            classes = self.datamodule.label_encoder.classes_
+            # FIXME need to iterate .label_encoder[x]


Maybe we should give an error message if somebody attempts bagging predict with multi label classification?

Yony Bresler added 7 commits April 10, 2024 20:16

First pass at Multi-Target classifier. Core functionality works, but …

31a37c3

…failing other tests

Updated base model to support custom metrics on multi-target

3e5c080

fix to init metrics param config in multi-target

1df34a8

updates pytests to include multi-target classification

76963a8

preliminary fix for combine_prediction

db8e18a

Documentation updates

3b4f10a

linter cleanup

766f01f

YonyBresler mentioned this pull request May 1, 2024

Multi-Target Classification #430

Closed

Yony Bresler and others added 2 commits June 13, 2024 15:30

Bugfix for metrics in multi-target classification

a2bcf21

[pre-commit.ci] auto fixes from pre-commit.com hooks

31425df

for more information, see https://pre-commit.ci

YonyBresler added 2 commits July 10, 2024 20:09

Added new tutorial for multi-target classification

a08e542

Minor update to documentation for multi-target classification

660dc1c

Merge branch 'main' into add-multi-target-classification

ab5ec44

Merge branch 'main' into add-multi-target-classification

1a77255

Borda requested a review from manujosephv September 16, 2024 14:03

manujosephv added 2 commits September 17, 2024 08:13

Merge branch 'main' into add-multi-target-classification

19002f5

Merge branch 'main' into add-multi-target-classification

4020a05

manujosephv approved these changes Sep 17, 2024

View reviewed changes

manujosephv merged commit 25691f5 into manujosephv:main Sep 17, 2024
9 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add multi target classification #441

Add multi target classification #441

YonyBresler commented May 1, 2024 •

edited by github-actions bot

Loading

manujosephv commented May 8, 2024

manujosephv commented May 29, 2024

manujosephv commented Jul 1, 2024

YonyBresler commented Jul 3, 2024

YonyBresler commented Jul 10, 2024

manujosephv commented Jul 13, 2024

YonyBresler commented Aug 13, 2024

YonyBresler commented Sep 16, 2024

manujosephv May 29, 2024

Add multi target classification #441

Add multi target classification #441

Conversation

YonyBresler commented May 1, 2024 • edited by github-actions bot Loading

manujosephv commented May 8, 2024

manujosephv commented May 29, 2024

manujosephv commented Jul 1, 2024

YonyBresler commented Jul 3, 2024

YonyBresler commented Jul 10, 2024

manujosephv commented Jul 13, 2024

YonyBresler commented Aug 13, 2024

YonyBresler commented Sep 16, 2024

manujosephv May 29, 2024

Choose a reason for hiding this comment

YonyBresler commented May 1, 2024 •

edited by github-actions bot

Loading