inconsistency for comment at start of file #44

ze42 · 2014-06-16T09:29:26Z

Hello,

Little unconsistency when parsing comment at start of file.

If it starts with a newline, we alternate endl/comment:

parse('\n#foo\n#bar\n')
Out[7]: 
[{'formatting': [], 'indent': '', 'type': 'endl', 'value': '\n'},
 {'formatting': [], 'type': 'comment', 'value': '#foo'},
 {'formatting': [], 'indent': '', 'type': 'endl', 'value': '\n'},
 {'formatting': [], 'type': 'comment', 'value': '#bar'},
 {'formatting': [], 'indent': '', 'type': 'endl', 'value': '\n'}]

If it starts with a comment, we get a endl that includes the comment as formatting.

parse('#foo\n#bar\n')
[{'formatting': [{'formatting': [], 'type': 'comment', 'value': '#foo'}],
  'indent': '',
  'type': 'endl',
  'value': '\n'},
 {'formatting': [], 'type': 'comment', 'value': '#bar'},
 {'formatting': [], 'indent': '', 'type': 'endl', 'value': '\n'}]

Not too sure what the proper behaviour is supposed to be, but I guess it should be something similar in both cases.

The text was updated successfully, but these errors were encountered:

ibizaman · 2014-06-16T10:05:21Z

I tested your example and have the same behaviour.

IMHO the second case should give what you thought:

[{'formatting': [], 'type': 'comment', 'value': '#foo'},
 {'formatting': [], 'indent': '', 'type': 'endl', 'value': '\n'},
 {'formatting': [], 'type': 'comment', 'value': '#bar'},
 {'formatting': [], 'indent': '', 'type': 'endl', 'value': '\n'}]

Psycojoker · 2014-06-17T00:31:22Z

Thanks for reporting :)

Indeed, this should be fixed in this function https://github.com/Psycojoker/baron/blob/master/baron/grammator.py#L52-64

But to warn you, that's the result of an edge case of the Baron approach. Since the comments aren't present in the python grammar and that they can appears anywhere, their handling is quite complex because they need to be "unpacked" from the tokens that are present in the grammar node (in this case, they are always on an ENDL token).

ibizaman · 2014-12-28T00:58:11Z

I think there should be a fix here: https://github.com/Psycojoker/baron/blob/master/baron/formatting_grouper.py#L120-122.

At that moment, the COMMENT is placed inside an ENDL node although there are no endl node there. So the fix can't happen after that because then the information that no ENDL was there is lost.

In the second example given, the input sequence to the group_generator function is:

[
    ('COMMENT', '#foo'),
    ('ENDL', '\n'),
    ('COMMENT', '#bar'),
    ('ENDL', '\n'),
    ('ENDMARKER', ''),
    None
]

And its output is:

[
    ('ENDL', '\n', [('COMMENT', '#foo')], [('COMMENT', '#bar')]),
    ('ENDL', '\n'),
    ('ENDMARKER', '')
]

For comparison, here's the input and output with the first example (i.e. with a leading '\n'):

[
    ('ENDL', '\n'),
    ('COMMENT', '#foo'),
    ('ENDL', '\n'),
    ('COMMENT', '#bar'),
    ('ENDL', '\n'),
    ('ENDMARKER', ''),
    None
]
[
    ('ENDL', '\n', [], [('COMMENT', '#foo')]),
    ('ENDL', '\n', [], [('COMMENT', '#bar')]),
    ('ENDL', '\n'),
    ('ENDMARKER', '')
]

I tried to fix this but no luck for now...

gtors · 2016-08-30T13:56:22Z

I replace group_generator (from formatting_grouper.py) with:

def group_generator(sequence):
    iterator = FlexibleIterator(sequence)
    current = None, None
    while not iterator.end():
        current = next(iterator)

        if current is None:
            return

        # -------------> Remove COMMENT from here <-------------
        if current[0] in ("SPACE") and iterator.show_next() and iterator.show_next()[0] in GROUP_SPACE_BEFORE:
            new_current = next(iterator)
            current = (new_current[0], new_current[1], [current])

        if current[0] in GROUP_SPACE_AFTER + STRING and\
            # -------------------------------------------------> And here <-------
            (iterator.show_next() and iterator.show_next()[0] in ("SPACE")) and\
    #... rest part of function

After that, parse('#foo\n#bar\n') works as expected:

[{'formatting': [], 'type': 'comment', 'value': '#foo'},
 {'formatting': [], 'indent': '', 'type': 'endl', 'value': '\n'},
 {'formatting': [], 'type': 'comment', 'value': '#bar'},
 {'formatting': [], 'indent': '', 'type': 'endl', 'value': '\n'}]

And dumps also works well:

dumps([{'formatting': [], 'type': 'comment', 'value': '#foo'},
 {'formatting': [], 'indent': '', 'type': 'endl', 'value': '\n'},
 {'formatting': [], 'type': 'comment', 'value': '#bar'},
 {'formatting': [], 'indent': '', 'type': 'endl', 'value': '\n'}])

So, why COMMENT need be grouped at all?

ibizaman · 2016-08-30T16:29:18Z

@gtors nice catch! Thanks for figuring this out!

bootandy · 2016-12-04T21:04:06Z

I have just tried gtors ' change and it seems to be correct. Is there any reason this fix doesn't have a pull request? If not I will create one.

ibizaman · 2016-12-05T22:52:43Z

@bootandy no, just not having much time to devote to baron lately. Sorry about that. 😭

Psycojoker · 2016-12-06T02:11:57Z

Me neither :/

I'll try to do a small bugfix release next weekend with this fix and PyCQA/redbaron#118 but can't promise..

RedBaron is a really concentration demanding project and I don't have the time I need to really get back into it right, sorry about that.

rojaster · 2016-12-08T20:21:43Z

I fixed it for my fork of baron :) and hey , another guy found similar bug

ibizaman · 2016-12-22T06:06:57Z

@bootandy Thanks for the PR @ze42 Closing as it's fix. Don't hesitate to reopen if you have any other issue.

ibizaman added the bug label Jun 16, 2014

Psycojoker mentioned this issue May 2, 2015

EndlNode instead of CommentNode PyCQA/redbaron#67

Closed

b5y mentioned this issue Jun 15, 2016

Comment not parsed PyCQA/redbaron#95

Closed

bootandy mentioned this issue Dec 7, 2016

Handle spaces before comments #88

Merged

ibizaman closed this as completed Dec 22, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

inconsistency for comment at start of file #44

inconsistency for comment at start of file #44

ze42 commented Jun 16, 2014

ibizaman commented Jun 16, 2014

Psycojoker commented Jun 17, 2014

ibizaman commented Dec 28, 2014

gtors commented Aug 30, 2016 •

edited

ibizaman commented Aug 30, 2016

bootandy commented Dec 4, 2016

ibizaman commented Dec 5, 2016

Psycojoker commented Dec 6, 2016 •

edited

rojaster commented Dec 8, 2016

ibizaman commented Dec 22, 2016

inconsistency for comment at start of file #44

inconsistency for comment at start of file #44

Comments

ze42 commented Jun 16, 2014

ibizaman commented Jun 16, 2014

Psycojoker commented Jun 17, 2014

ibizaman commented Dec 28, 2014

gtors commented Aug 30, 2016 • edited

ibizaman commented Aug 30, 2016

bootandy commented Dec 4, 2016

ibizaman commented Dec 5, 2016

Psycojoker commented Dec 6, 2016 • edited

rojaster commented Dec 8, 2016

ibizaman commented Dec 22, 2016

gtors commented Aug 30, 2016 •

edited

Psycojoker commented Dec 6, 2016 •

edited