Add experiment heuristics to automl module (variant of Avanika PR 1362) #1507

amholler · 2021-11-21T22:30:39Z

This PR is a variant of Avanika's PR 1362, generated against the tf-legacy branch.
The changes relative to PR 1362 are:

Corrected issue when generating config for concat model type (missing merge_dict, commented on PR)
Updated to use tune.with_parameters to support ray tune on large datasets
Ensure trial limit is consistently time-based (not epoch based) as intended
omitted python style changes that reviewer wanted removed

This PR has been run successfully on the following datasets:

ames_housing, forest_cover, mushroom_edibility, synthetic_fraud [tabnet]
higgs[concat, large]
otto_group_product [concat]

tgaddair

Love the new heuristics. Added some comments, let me know what you think.

ludwig/automl/automl.py

tgaddair · 2021-11-25T18:03:44Z

ludwig/automl/base_config.py

+
+encoder_defaults = {
+    "text": {
+        "bert": os.path.join(CONFIG_DIR, "text/bert_config.yaml"),


While BERT gets very good performance, I feel like it's cost/performance ratio is pretty bad (running on a single v100, for example, the batch size ends up being so small you'd be better off using something else like DistilBERT). Finding a batch size small enough to work for BERT in such cases is also a challenge. But curious if you've observed something different.

Interesting. It looks like Avanika put this code here as a placeholder; her comment indicates BERT is default
and that more robust heuristics should be added later. I personally have not done any text dataset testing.
Given that is is default, maybe it can stay until there is some updates in this area?

Sounds good, we can try it out and see how it goes.

tgaddair · 2021-11-25T18:06:07Z

ludwig/automl/base_config.py

+        num_samples = base_automl_config["hyperopt"]["sampler"]["num_samples"]
+        if num_samples is not None:
+            # allow trials to get 2x even division, since some trials perform better than avg
+            base_automl_config["hyperopt"]["sampler"]["scheduler"][


I'm curious what happens if we let the per trial max_t equal the total training time. Would this result in some of the num_samples never getting tried? I would hope that hyperband would be fair in its scheduling so that it would try all the samples for a small amount of time before letting the best trials run indefinitely.

Well, I thought I tried this and observed starvation of trials, but let me try it again to double-check.

Okay, I just confirmed that for an example Ray Tune async hyperband run, if specify num_samples=20
and set max_t == time_limit_s, then the first 3 trials are each run for max_t time and no other trials are run.
We can talk with the Ray Tune folks about this, but it is the behavior I observe.

Thanks for confirming, sounds like this is the right thing to do for now, then. Would be great if we could find a way to both explore all trials and let trials run as long as they can.

just posted this question to the ray tune slack channel

tgaddair · 2021-11-25T18:09:38Z

ludwig/automl/automl.py

+            # TODO (ASN): add image heuristics
+
+    # override and constrain automl config based on user specified values
+    if user_specified_config is not None:


I wonder if something like OmegaConf would help simplify the merging process, particularly now that we also have a base_config and combiner config separation in addition to the user_config here.

https://omegaconf.readthedocs.io/en/2.1_branch/usage.html#omegaconf-merge

Thanks for the reference to OmegaConf, looks nice!

My understanding of the complexity of this code is that it is removing config from
the hyperopt parameters section that is overlapping with the config_section.key
that is included in the user_config. I don't see a particular feature of OmegaConf
that can handle this specialized case. Apologies if I am missing something here.

I think it will end up being a refactor of some things. I can take a look in a follow-up. But my goal would be to just be able to eventually reduce the complexity to "merge a bunch of independent configs".

tgaddair

LGTM!

…2) (#1507) Co-authored-by: Anne Holler <anne@vmware.com>

…2) (#1507) (#1527) Co-authored-by: Anne Holler <anne@vmware.com> Co-authored-by: amholler <86269492+amholler@users.noreply.github.com> Co-authored-by: Anne Holler <anne@vmware.com>

anneholler added 2 commits November 21, 2021 14:18

Add experiment heuristics to automl module (variant of Avanika PR 1362)

e70ddb2

Update to support large datasets via tune.with_parameters

5295b94

amholler mentioned this pull request Nov 22, 2021

[automl] add experiment heuristics to automl module #1362

Closed

anneholler and others added 4 commits November 22, 2021 14:30

Provide epochs value for concat model type

29d0f09

Update to remove epochs and use time-based limit on trials (as intended)

817acb8

merge recent tf-legacy changes to automl

eb7ea0a

Merge branch 'tf-legacy' into automl_patch

34a5614

tgaddair reviewed Nov 25, 2021

View reviewed changes

Update as per reviewer feedback

2bb9971

tgaddair approved these changes Nov 26, 2021

View reviewed changes

tgaddair merged commit e26bdce into ludwig-ai:tf-legacy Nov 26, 2021

tgaddair pushed a commit that referenced this pull request Nov 29, 2021

Add experiment heuristics to automl module (variant of Avanika PR 136…

b6eed5f

…2) (#1507) Co-authored-by: Anne Holler <anne@vmware.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add experiment heuristics to automl module (variant of Avanika PR 1362) #1507

Add experiment heuristics to automl module (variant of Avanika PR 1362) #1507

amholler commented Nov 21, 2021 •

edited

Loading

tgaddair left a comment

tgaddair Nov 25, 2021

amholler Nov 25, 2021 •

edited

Loading

tgaddair Nov 26, 2021

tgaddair Nov 25, 2021

amholler Nov 25, 2021

amholler Nov 25, 2021 •

edited

Loading

tgaddair Nov 26, 2021

amholler Nov 28, 2021

tgaddair Nov 25, 2021

amholler Nov 25, 2021

tgaddair Nov 26, 2021

tgaddair left a comment

Add experiment heuristics to automl module (variant of Avanika PR 1362) #1507

Add experiment heuristics to automl module (variant of Avanika PR 1362) #1507

Conversation

amholler commented Nov 21, 2021 • edited Loading

tgaddair left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

amholler Nov 25, 2021 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

amholler Nov 25, 2021 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

tgaddair left a comment

Choose a reason for hiding this comment

amholler commented Nov 21, 2021 •

edited

Loading

amholler Nov 25, 2021 •

edited

Loading

amholler Nov 25, 2021 •

edited

Loading