Improve Docstring location information #90

analog-cbarber · 2022-06-27T23:20:16Z

The Docstring init method currently does this:

        self.value: str = inspect.cleandoc(value.rstrip())
        self.lineno: int | None = lineno
        self.endlineno: int | None = endlineno

The net result leading whitespace is removed from each line. This is what we want,
but it now means that the there is no information about how much to add to get the
correct column offset.

It would be nice if the class could provide enough information to be able to associate
characters in the docstring with their line/column in the original source. Accordingly,
I suggest:

add a col_offset field indicating the size of the margin that was removed

(bonus feature) add a method to convert an offset into the cleaned up doc string into original line,col.
Something like:

def offset_to_line_col(self, offset:int) -> Tuple[int,int]:
    line = self.lineno + self.value.count('\n', 0, offset)
    col = self.col_offset + offset - self.value.rfind('\n', 0, offset) - 1
    return line, col

Implementing this would either require pulling implementation of cleandoc into the module to
expose the margin or comparing the cleaned up string with the original to infer the correct value.

The text was updated successfully, but these errors were encountered:

analog-cbarber · 2022-06-28T00:17:00Z

You can only infer the column offset for the line after the first based on this information alone. To get the
offset of the first line, it would have to be passed in (from the AST node).

analog-cbarber · 2022-06-28T00:20:16Z

And of course there are tabs. I wonder how many python projects actually allow tabs?

pawamoy · 2024-06-08T20:36:17Z

Since we store the original source code in the lines collection, we could use the docstring's lineno and endlineno to fetch the original docstring, including the first line. I'm not sure what the use-case is here though.

Would that work for you if we add a property to docstrings that returns the original value? Something like this:

@property
def original(self) -> str:
    return "\n".join(self.parent.lines_collection[self.parent.filepath][self.lineno-1:self.endlineno])

analog-cbarber · 2024-06-12T15:51:07Z

I think that might work. The use case is the ability to produce error messages that correctly report the line and column of tokens in the doc string so that IDEs can navigate directly to the write characters and/or highlight errors directly in the source.

pawamoy · 2024-06-12T17:20:14Z

I see, thanks, that's what I thought I remembered from our discussions about relative cross-references.

Lets do this then, I'll add this property to the Docstring class 🙂 PRs welcome too!

analog-cbarber added the feature New feature or request label Jun 27, 2022

pawamoy added the griffe: docstrings Related to docstring parsing label Mar 11, 2024

pawamoy self-assigned this Jun 8, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve Docstring location information #90

Improve Docstring location information #90

analog-cbarber commented Jun 27, 2022 •

edited

analog-cbarber commented Jun 28, 2022 •

edited

analog-cbarber commented Jun 28, 2022

pawamoy commented Jun 8, 2024

analog-cbarber commented Jun 12, 2024

pawamoy commented Jun 12, 2024 •

edited

Improve Docstring location information #90

Improve Docstring location information #90

Comments

analog-cbarber commented Jun 27, 2022 • edited

analog-cbarber commented Jun 28, 2022 • edited

analog-cbarber commented Jun 28, 2022

pawamoy commented Jun 8, 2024

analog-cbarber commented Jun 12, 2024

pawamoy commented Jun 12, 2024 • edited

analog-cbarber commented Jun 27, 2022 •

edited

analog-cbarber commented Jun 28, 2022 •

edited

pawamoy commented Jun 12, 2024 •

edited