Filter entities from comparator combiner when not listed in input_features #3251

tgaddair · 2023-03-15T17:25:22Z

No description provided.

…tures

ksbrar · 2023-03-15T17:34:00Z

Wondering why this behavior is not enforced by this validation check:

ludwig/ludwig/config_validation/checks.py

Line 194 in d5f61eb

    
           if sorted(config.combiner.entity_1 + config.combiner.entity_2) != sorted(input_feature_names):

tgaddair · 2023-03-15T17:35:24Z

@ksbrar that would validate the config, but this PR makes the behavior more permissive by letting these configs go through.

ksbrar · 2023-03-15T17:36:29Z

Ah gotcha, was thinking backwards - purpose IS to make the parameter more permissive 👍

justinxzhao · 2023-03-15T17:47:26Z

tests/ludwig/config_validation/test_checks.py

+        },
+    }
+
+    with pytest.raises(ConfigValidationError) if not expected else contextlib.nullcontext():


Maximally parameterizing tests to minimize code duplication is tempting, but it can make tests more difficult to read and understand when it involves adding logic to the test itself.

See "Avoid logic in tests" from Microsoft's testing best practices guide.

The behavior we are testing for is more readable/understandable as three separate tests, imo:

def test_comparator_combiner_raises_missing_features(): def test_comparator_combiner_removes_extra_features(): def test_comparator_combiner_raises_duplicated_features():

Can't say I agree. I was thinking of adding a handful of additioanl tests to this. Parametrizing makes this super easy. Separate tests means in practice, no one's ever going to bother.

The good thing is that the relatively simple logic in this test means it's already pretty readable.

Now that there are 7 tests, I would agree that splitting into separate tests would be too much code duplication and not worthwhile.

As a nit: would you be able to add a quick comment do the different lines explaining what each case is testing for?

github-actions · 2023-03-15T20:38:04Z

Unit Test Results

        6 files ±  0         6 suites ±0 7h 29m 6s ⏱️ + 30m 36s
  4 094 tests +  7   4 051 ✔️ +  7   43 💤 ±0 0 ❌ ±0
12 239 runs - 43 12 109 ✔️ - 38 130 💤 - 5 0 ❌ ±0

Results for commit 677f23a. ± Comparison against base commit d5f61eb.

Filter entities from comparator combiner when not listed in input_fea…

a8e1e2d

…tures

tgaddair requested review from justinxzhao and jppgks March 15, 2023 17:25

arnavgarg1 approved these changes Mar 15, 2023

View reviewed changes

jppgks approved these changes Mar 15, 2023

View reviewed changes

justinxzhao reviewed Mar 15, 2023

View reviewed changes

tgaddair added 3 commits March 15, 2023 11:35

More validation

79c0032

One more test

d1b9a39

Cover entity_2 as well

677f23a

arnavgarg1 added the release-0.7 Needs cherry-pick into 0.7 release branch label Mar 15, 2023

tgaddair merged commit 0bef777 into master Mar 15, 2023

tgaddair deleted the filter-comparator branch March 15, 2023 21:27

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Filter entities from comparator combiner when not listed in input_features #3251

Filter entities from comparator combiner when not listed in input_features #3251

tgaddair commented Mar 15, 2023

ksbrar commented Mar 15, 2023

tgaddair commented Mar 15, 2023

ksbrar commented Mar 15, 2023

justinxzhao Mar 15, 2023

tgaddair Mar 15, 2023

justinxzhao Mar 15, 2023

github-actions bot commented Mar 15, 2023

Filter entities from comparator combiner when not listed in input_features #3251

Filter entities from comparator combiner when not listed in input_features #3251

Conversation

tgaddair commented Mar 15, 2023

ksbrar commented Mar 15, 2023

tgaddair commented Mar 15, 2023

ksbrar commented Mar 15, 2023

justinxzhao Mar 15, 2023

Choose a reason for hiding this comment

tgaddair Mar 15, 2023

Choose a reason for hiding this comment

justinxzhao Mar 15, 2023

Choose a reason for hiding this comment

github-actions bot commented Mar 15, 2023

Unit Test Results