[AutoMM] Support using data path in fit() #3006

zhiqiangdon · 2023-03-06T06:15:30Z

Issue #, if available:

Description of changes:
Support passing path of training data in fit().

predictor = MultiModalPredictor(label="label")
predictor.fit(train_data=train_data_path)

By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.

suzhoum

LGTM! Great feature!

FANGAreNotGnu

LGTM

multimodal/src/autogluon/multimodal/utils/data.py

github-actions · 2023-03-06T19:57:32Z

Job PR-3006-7c81474 is done.
Docs are uploaded to http://autogluon-staging.s3-website-us-west-2.amazonaws.com/PR-3006/7c81474/index.html

Innixma · 2023-03-07T00:27:53Z

multimodal/src/autogluon/multimodal/utils/data.py

+def split_train_tuning_data(
+    train_data: Union[pd.DataFrame, str],
+    tuning_data: Optional[Union[pd.DataFrame, str]] = None,
+    holdout_frac: Optional[float] = None,
+    is_classification: Optional[bool] = False,
+    label_column: Optional[str] = None,
+    seed: Optional[int] = 123,
+):


Data loading should not be done during data splitting. This is not the logical place for it.

I have considered loading data in predictor.py before calling split_train_tuning_data, but I need repeat the same code of loading data in matcher.py, which increase the boilerplate code. How about changing this function name to prepare_train_tuning_data to make it more than just splitting data?

Considering PR #3004, move it outside of split_train_tuning_data.

@Innixma Any further questions on this?

Since it is moved outside, I am ok to merge.

The ideal solution long term is using a mix-in or having both predictor.py and matcher.py inherit from the same abstract class. This would avoid code dupe.

github-actions · 2023-03-07T00:29:19Z

Job PR-3006-1f12920 is done.
Docs are uploaded to http://autogluon-staging.s3-website-us-west-2.amazonaws.com/PR-3006/1f12920/index.html

github-actions · 2023-03-07T07:39:43Z

Job PR-3006-13509b2 is done.
Docs are uploaded to http://autogluon-staging.s3-website-us-west-2.amazonaws.com/PR-3006/13509b2/index.html

Innixma

LGTM!

support data path in fit()

5d295a9

zhiqiangdon requested a review from liangfu March 6, 2023 06:15

zhiqiangdon added the model list checked You have updated the model list after modifying multimodal unit tests/docs label Mar 6, 2023

fix

7c81474

zhiqiangdon requested review from FANGAreNotGnu and suzhoum March 6, 2023 18:29

suzhoum approved these changes Mar 6, 2023

View reviewed changes

FANGAreNotGnu approved these changes Mar 6, 2023

View reviewed changes

liangfu requested changes Mar 6, 2023

View reviewed changes

multimodal/src/autogluon/multimodal/utils/data.py Outdated Show resolved Hide resolved

liangfu approved these changes Mar 6, 2023

View reviewed changes

type hints

1f12920

Innixma requested changes Mar 7, 2023

View reviewed changes

update

13509b2

Innixma approved these changes Mar 7, 2023

View reviewed changes

zhiqiangdon merged commit 67e6e83 into autogluon:master Mar 7, 2023

zhiqiangdon deleted the mm-fix branch March 10, 2023 05:31

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[AutoMM] Support using data path in fit() #3006

[AutoMM] Support using data path in fit() #3006

zhiqiangdon commented Mar 6, 2023

suzhoum left a comment

FANGAreNotGnu left a comment

github-actions bot commented Mar 6, 2023

Innixma Mar 7, 2023

zhiqiangdon Mar 7, 2023

zhiqiangdon Mar 7, 2023

zhiqiangdon Mar 7, 2023

Innixma Mar 7, 2023

github-actions bot commented Mar 7, 2023

github-actions bot commented Mar 7, 2023

Innixma left a comment

[AutoMM] Support using data path in fit() #3006

[AutoMM] Support using data path in fit() #3006

Conversation

zhiqiangdon commented Mar 6, 2023

suzhoum left a comment

Choose a reason for hiding this comment

FANGAreNotGnu left a comment

Choose a reason for hiding this comment

github-actions bot commented Mar 6, 2023

Innixma Mar 7, 2023

Choose a reason for hiding this comment

zhiqiangdon Mar 7, 2023

Choose a reason for hiding this comment

zhiqiangdon Mar 7, 2023

Choose a reason for hiding this comment

zhiqiangdon Mar 7, 2023

Choose a reason for hiding this comment

Innixma Mar 7, 2023

Choose a reason for hiding this comment

github-actions bot commented Mar 7, 2023

github-actions bot commented Mar 7, 2023

Innixma left a comment

Choose a reason for hiding this comment