Filter auto_transformer kwargs based on forward signature #3329

tgaddair · 2023-04-07T16:23:24Z

Fixes #3328.

jeffkinnison

LGTM! Just some nits and a quick question.

ludwig/encoders/text_encoders.py

tests/ludwig/encoders/test_text_encoders.py

ludwig/encoders/text_encoders.py

abidwael · 2023-04-07T17:05:51Z

tests/ludwig/encoders/test_text_encoders.py

+        "hf-internal-testing/tiny-random-GPTJModel",
+    ],
+)
+def test_hf_ludwig_model_auto_transformers(tmpdir, csv_filename, pretrained_model_name_or_path):


Could you mark this with @pytest.mark.slow?

I think this should only take 30s, so I would want to run it on every commit if we can get away with it.

abidwael · 2023-04-07T17:11:27Z

tests/ludwig/encoders/test_text_encoders.py

+    input_features = [
+        text_feature(
+            preprocessing={
+                "max_sequence_length": 10,
+            },
+            encoder={
+                "vocab_size": 30,
+                "min_len": 1,
+                "type": "auto_transformer",
+                "pretrained_model_name_or_path": pretrained_model_name_or_path,
+                "use_pretrained": True,
+            },
+        )
+    ]
+    output_features = [category_feature(decoder={"vocab_size": 2})]
+    rel_path = generate_data(input_features, output_features, csv_filename)
+
+    config = {
+        "input_features": input_features,
+        "output_features": output_features,
+        TRAINER: {"train_steps": 1},
+    }
+    model = LudwigModel(config=config, backend=LocalTestBackend())
+
+    # Validates that the defaults associated with the encoder are compatible with Ludwig training.
+    with mock.patch(
+        "ludwig.encoders.text_encoders.load_pretrained_hf_model_with_hub_fallback",
+        side_effect=_load_pretrained_hf_model_no_weights,
+    ):
+        model.train(dataset=rel_path, output_directory=tmpdir)
+
+


Code here is a duplicate of that in test_hf_ludwig_model_reduce_options and test_hf_ludwig_model_e2e. Perhaps separate into a separate function and reuse.

pushkarraj · 2023-04-15T06:21:18Z

Hey guys, I am working in the domain of large language models. Thanks for fixing this issue. I would love to contribute in the open source. It would be a matter of pride to be the part of this.

pushkarraj · 2023-04-15T08:59:37Z

Also this issue is resolved.after trying the latest code I got the following error:

AttributeError: 'BloomModel' object has no attribute 'n_head'

Filter auto_transformer kwargs based on forward signature

555040c

tgaddair added bug Something isn't working release-0.7 Needs cherry-pick into 0.7 release branch labels Apr 7, 2023

tgaddair requested review from geoffreyangus, abidwael and arnavgarg1 April 7, 2023 16:23

tgaddair mentioned this pull request Apr 7, 2023

TypeError: forward() got an unexpected keyword argument 'token_type_ids' #3328

Closed

jeffkinnison reviewed Apr 7, 2023

View reviewed changes

ludwig/encoders/text_encoders.py Show resolved Hide resolved

tests/ludwig/encoders/test_text_encoders.py Show resolved Hide resolved

ludwig/encoders/text_encoders.py Show resolved Hide resolved

jeffkinnison approved these changes Apr 7, 2023

View reviewed changes

Added comments

d958c67

abidwael reviewed Apr 7, 2023

View reviewed changes

arnavgarg1 approved these changes Apr 7, 2023

View reviewed changes

tgaddair merged commit 46d08c7 into master Apr 7, 2023
6 of 7 checks passed

tgaddair deleted the fix-bloom branch April 7, 2023 19:50

tgaddair added a commit that referenced this pull request Apr 7, 2023

Filter auto_transformer kwargs based on forward signature (#3329)

1d8f698

tgaddair added a commit that referenced this pull request Apr 7, 2023

Filter auto_transformer kwargs based on forward signature (#3329)

5bc740b

pushkarraj mentioned this pull request Apr 15, 2023

AttributeError: 'BloomModel' object has no attribute 'n_head' #3350

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Filter auto_transformer kwargs based on forward signature #3329

Filter auto_transformer kwargs based on forward signature #3329

tgaddair commented Apr 7, 2023 •

edited

jeffkinnison left a comment

abidwael Apr 7, 2023

tgaddair Apr 7, 2023

abidwael Apr 7, 2023

pushkarraj commented Apr 15, 2023

pushkarraj commented Apr 15, 2023

Filter auto_transformer kwargs based on forward signature #3329

Filter auto_transformer kwargs based on forward signature #3329

Conversation

tgaddair commented Apr 7, 2023 • edited

jeffkinnison left a comment

Choose a reason for hiding this comment

abidwael Apr 7, 2023

Choose a reason for hiding this comment

tgaddair Apr 7, 2023

Choose a reason for hiding this comment

abidwael Apr 7, 2023

Choose a reason for hiding this comment

pushkarraj commented Apr 15, 2023

pushkarraj commented Apr 15, 2023

tgaddair commented Apr 7, 2023 •

edited