Bug fix: L016 ("Line is too long") should consider length of prior fixes #2587

barrywhart · 2022-02-09T18:27:34Z

Brief summary of the change made

Are there any other side effects of this change that we should be aware of?

Pull Request checklist

Please confirm you have completed any of the necessary steps below.
Included test cases to demonstrate any code changes, which may be one or more of the following:
- .yml rule test cases in test/fixtures/rules/std_rule_cases.
- .sql/.yml parser test cases in test/fixtures/dialects (note YML files can be auto generated with tox -e generate-fixture-yml).
- Full autofix test cases in test/fixtures/linter/autofix.
- Other.
Added appropriate documentation for the change.
Created GitHub issues for any relevant followup/future enhancements if appropriate.

barrywhart · 2022-02-09T18:29:05Z

src/sqlfluff/rules/L016.py

@@ -443,7 +443,7 @@ def _compute_source_length(cls, segments: Sequence[BaseSegment]) -> int:
            # source slice, and if we didn't correct for this, we'd count the
            # length of {{bi_ecommerce_orders}} roughly 10 times, resulting in
            # vast overcount of the source length.
-            if slice not in seen_slices:
+            if not segment.is_templated or slice not in seen_slices:


See the pre-existing block of comments above to understand the intent of this code. It turns out that this code was accidentally ignoring new segments (introduced by earlier fixes). By definition, these new segments cannot be templated (because SQLFluff itself introduced them during lint/fix, i.e. after templating).

Struggling to get my head round this, but isn't there a concern this will include any untemplated code that has been seen?

What's the precedence of not here?

Is this:

if not (segment.is_templated or slice not in seen_slices):

or

if (not segment.is_templated) or (slice not in seen_slices):

The second one.

codecov · 2022-02-09T18:57:31Z

Codecov Report

Merging #2587 (720909c) into main (c819e77) will not change coverage.
The diff coverage is 100.00%.

@@            Coverage Diff            @@
##              main     #2587   +/-   ##
=========================================
  Coverage   100.00%   100.00%           
=========================================
  Files          163       163           
  Lines        11876     11876           
=========================================
  Hits         11876     11876

Impacted Files	Coverage Δ
src/sqlfluff/rules/L016.py	`100.00% <100.00%> (ø)`

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update c819e77...720909c. Read the comment docs.

tunetheweb · 2022-02-09T19:48:41Z

src/sqlfluff/rules/L016.py

+            if (
+                slice[0] == slice[1] and not segment.is_meta
+            ) or slice not in seen_slices:


This is much nicer! And easier to understand.

Do wonder if there's any other zero-length segments like meta? Are comments?

Not sure. Note that if they're 0-length, it's safe to process them anyway. I added is_meta more to clarify the intent. It works fine either way.

I checked a comment in the debugger. Length works as expected (not 0).

(Pdb) this_line[-1] <CommentSegment: ([L: 6, P: 1]) "-- The following is the slope of the regression line. Note that CORR (which is the Pearson's correlation"> (Pdb) this_line[-1].raw "-- The following is the slope of the regression line. Note that CORR (which is the Pearson's correlation" (Pdb) self._compute_segment_length(this_line[-1]) 104

barrywhart · 2022-02-09T19:49:57Z

src/sqlfluff/rules/L016.py

+            # segment case described above because new segments are never templated
+            # (because "sqlfluff fix" produced them, not the templater!).
+            if (
+                slice[0] == slice[1] and not segment.is_meta


@tunetheweb: This is a slightly different way of implementing a similar check. I used the Python debugger to run these two L016 test cases and that none of the (templated) segments pass this check.

test_compute_line_length_before_template_expansion_1: # Line 3 is fine before expansion. Too long after expansion is NOT considered # a violation. pass_str: | SELECT user_id FROM `{{bi_ecommerce_orders}}` {{table_at_job_start}} configs: core: dialect: bigquery templater: jinja: context: table_at_job_start: FOR SYSTEM_TIME AS OF CAST('2021-03-02T01:22:59+00:00' AS TIMESTAMP) bi_ecommerce_orders: bq-business-intelligence.user.ecommerce_orders test_compute_line_length_before_template_expansion_2: # Line 3 is too long before expansion. It's fine after expansion, but the rule # does not look at that. fail_str: | SELECT user_id FROM `{{bi_ecommerce_orders_bi_ecommerce_orders}}` AS {{table_alias_table_alias_table_alias_table_alias_table_alias_table_alias}} configs: core: dialect: bigquery templater: jinja: context: bi_ecommerce_orders_bi_ecommerce_orders: bq-business-intelligence.user.ecommerce_orders table_alias_table_alias_table_alias_table_alias_table_alias_table_alias: t

…ectly_after_linter_pass_1

tunetheweb

LGTM!

…ectly_after_linter_pass_1

Bug fix: L016 ("Line is too long") should consider length of prior fixes

07e4d34

barrywhart requested a review from tunetheweb February 9, 2022 18:27

barrywhart commented Feb 9, 2022

View reviewed changes

Barry Hart added 2 commits February 9, 2022 13:32

Comments

7adf4a1

Add missing file

10ac586

Barry Hart added 2 commits February 9, 2022 14:44

Tweak for PR review

eeec137

More tweak

0731832

tunetheweb reviewed Feb 9, 2022

View reviewed changes

barrywhart commented Feb 9, 2022

View reviewed changes

Merge branch 'main' into bhart-issue_2584_l016_computes_length_incorr…

53911c2

…ectly_after_linter_pass_1

tunetheweb approved these changes Feb 9, 2022

View reviewed changes

Merge branch 'main' into bhart-issue_2584_l016_computes_length_incorr…

720909c

…ectly_after_linter_pass_1

barrywhart merged commit c846a9e into sqlfluff:main Feb 9, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bug fix: L016 ("Line is too long") should consider length of prior fixes #2587

Bug fix: L016 ("Line is too long") should consider length of prior fixes #2587

barrywhart commented Feb 9, 2022 •

edited

barrywhart Feb 9, 2022

tunetheweb Feb 9, 2022

barrywhart Feb 9, 2022

codecov bot commented Feb 9, 2022 •

edited

tunetheweb Feb 9, 2022

barrywhart Feb 9, 2022

barrywhart Feb 9, 2022

barrywhart Feb 9, 2022

tunetheweb left a comment

Bug fix: L016 ("Line is too long") should consider length of prior fixes #2587

Bug fix: L016 ("Line is too long") should consider length of prior fixes #2587

Conversation

barrywhart commented Feb 9, 2022 • edited

Brief summary of the change made

Are there any other side effects of this change that we should be aware of?

Pull Request checklist

barrywhart Feb 9, 2022

Choose a reason for hiding this comment

tunetheweb Feb 9, 2022

Choose a reason for hiding this comment

barrywhart Feb 9, 2022

Choose a reason for hiding this comment

codecov bot commented Feb 9, 2022 • edited

Codecov Report

tunetheweb Feb 9, 2022

Choose a reason for hiding this comment

barrywhart Feb 9, 2022

Choose a reason for hiding this comment

barrywhart Feb 9, 2022

Choose a reason for hiding this comment

barrywhart Feb 9, 2022

Choose a reason for hiding this comment

tunetheweb left a comment

Choose a reason for hiding this comment

barrywhart commented Feb 9, 2022 •

edited

codecov bot commented Feb 9, 2022 •

edited