Resolve errors for unknown binding should consider commented out code #105469

estebank · 2022-12-08T18:51:02Z

When refactoring code, it is not uncommon to comment out pieces of code try try alternatives. When doing so, it's easy to forget to uncomment at least one of the alternatives. Given code like

fn main() {
    let option = Some("");
    // let Some(string) = x else { return; };
    println!("{string}");
}

the compiler should emit output similar to

error[E0425]: cannot find value `string` in this scope
 --> src/main.rs:4:16
  |
3 |     // let Some(string) = x else { return; };
  |     --          ------ you might have meant to use this commented out binding
  |     |
  |     note the comment starts here
4 |     println!("{string}");
  |                ^^^^^^ not found in this scope

The text was updated successfully, but these errors were encountered:

Ezrashaw · 2023-01-11T06:16:04Z

@rustbot claim

I'll have a go at this but I need some mentoring instructions. This seems to requires parsing comments?

Ezrashaw · 2023-01-11T08:04:09Z

@estebank I've been thinking about this and I think that perhaps the best way to do this would be as follows:

In the lexer, if we see a comment, continue lexing until we see another comment (as in two // in one line) or an EOL. We then place/mark these tokens separately.
In the parser, we try to parse these tokens normally, again marking AST nodes separately.
In any other parts of the compiler, like during variable resolution, we could then check to see if any variables exist in our special AST nodes (just like how this suggestion normally works).

Obviously we immediately back off silently if bad syntax is encountered (think Option, not Result). I wonder about the perf impact this could have, I assume little if we back off a lot.

estebank · 2023-01-11T20:33:29Z

@Ezrashaw I think that a best effort of keeping an AST node (or a spanned list of comments per file) for comments and a simple string search might be enough for this, as any commented out code is bound to be syntactically suspect (and whatever be add has to be cheap both in execution and maintenance burden).

Ezrashaw · 2023-01-11T21:06:00Z

@Ezrashaw I think that a best effort of keeping an AST node (or a spanned list of comments per file) for comments and a simple string search might be enough for this, as any commented out code is bound to be syntactically suspect (and whatever be add has to be cheap both in execution and maintenance burden).

Right, so all I'm suggesting in addition to that is that we use AST nodes (for easy scoping etc) and that instead of a finicky string search, we just try to parse a let statement (which can be expanded later). AFAIK, parsing is a very small part of the compilers runtime and just parsing a few comments would have a small effect even on that.

Edit: the maintenance burden of this would be small, we can just reuse existing machinery.

estebank · 2023-01-12T06:32:45Z

My concern is more around the state of the code than anything else: the commented out code might not be parseable by any reasonable parser. But we could tokenize the comments to aid us, and have simplified parsing for things like let ident or ident = and nothing else.

Ezrashaw · 2023-01-12T06:55:08Z

@estebank Hmm, why not just use the normal parsing? I don't really see the reason not to. If the simplified parsing fails, then we ignore silently. Why not with proper parsing too, keep in mind that it's parsing, we aren't doing type checking or anything.

estebank · 2023-01-12T18:38:30Z

My concern is that you could have something like

fn foo(
//    bar: i32,
) {
    // qux(); let bar = 42;
...

That would require you to try multiple alternative parses of the comments, including some that only work within other items, like the arg parse. That being said, if only a handful of cases are handled I'd still be happy.

Ezrashaw · 2023-01-12T20:02:51Z

@estebank Yeah, I think for now that your example probably won't be handled, although we could just do completely normal parsing. My main issue is how to decide whether a comment is code or junk (ideally in the lexer), so that we don't have to even parse it.

rustbot assigned Ezrashaw Jan 11, 2023

Dylan-DPC unassigned Ezrashaw Sep 9, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Resolve errors for unknown binding should consider commented out code #105469

Resolve errors for unknown binding should consider commented out code #105469

estebank commented Dec 8, 2022 •

edited by rustbot

Loading

Ezrashaw commented Jan 11, 2023

Ezrashaw commented Jan 11, 2023

estebank commented Jan 11, 2023

Ezrashaw commented Jan 11, 2023 •

edited

Loading

estebank commented Jan 12, 2023

Ezrashaw commented Jan 12, 2023

estebank commented Jan 12, 2023

Ezrashaw commented Jan 12, 2023

Resolve errors for unknown binding should consider commented out code #105469

Resolve errors for unknown binding should consider commented out code #105469

Comments

estebank commented Dec 8, 2022 • edited by rustbot Loading

Ezrashaw commented Jan 11, 2023

Ezrashaw commented Jan 11, 2023

estebank commented Jan 11, 2023

Ezrashaw commented Jan 11, 2023 • edited Loading

estebank commented Jan 12, 2023

Ezrashaw commented Jan 12, 2023

estebank commented Jan 12, 2023

Ezrashaw commented Jan 12, 2023

estebank commented Dec 8, 2022 •

edited by rustbot

Loading

Ezrashaw commented Jan 11, 2023 •

edited

Loading