Allow `#` to be used as an inline comment tag #1401

dylanahsmith · 2021-02-19T16:31:28Z

Fixes #1393
Depends on Shopify/liquid-c#144

@Shopify/guardians-of-the-liquid

Allow # to be used as an inline comment tag, ignoring everything after it in the tag.

For example, instead of

{% comment %}Single line comment{% endcomment %}

you can use

{% # Single line comment %}

Spaces around # aren't necessary, so this also works

{%#This also works %}

Using this after another tag (e.g. {% assign discouraged = true # This comment is parsed as part of the assign tag %}) would cause it to be parsed as part of that tag, so is not recommended since we can't ensure consistency between tags and backwards compatibility (especially considering that there are application defined tags). Instead, use a separate tag

{% assign recommended = true  %}{%# Comment on this tag -%}

Whitespace control characters work the same way as other tags.

This inline comment also works in liquid tags, so instead of

{% liquid
  echo "..."
  comment Single line comment
  endcomment
  echo "..."
%}

you can use

{% liquid
  echo "..."
  # Works well for single line comments
  # or even multi-line comments
  echo "..."
%}

pushrax

{% assign discouraged = true # This comment is parsed as part of the assign tag %}

I can definitely picture people learning about comment syntax by example and then assuming this would work. We need some kind of strategy there.

dylanahsmith · 2021-02-19T17:45:24Z

Perhaps we should assess whether # can be supported in all standard and Shopify tags to determine if we can support this consistently or whether it would result in any ambiguities. I think it is likely that we can add support for it. If so, I think developers would appreciate us adding support for that.

From an implementation perspective, we wouldn't be able to just ignore everything after the first # so we don't break syntax that supports # (e.g. a quoted string), but we could add a method like Liquid::Parser#consume_markup_end that can raise a parse error if it doesn't find an inline comment or end of string.

tjoyal · 2021-02-22T13:57:18Z

Same feedback:

  def test_conditional
    assert_template_result('true', '{% if true # comment %}true{% else %}false{% endif %}')
    # InlineCommentTest#test_conditional:
    # Liquid::SyntaxError: Liquid syntax error (line 1): Unexpected character # in "true # comment"
  end

  def test_assign
    assert_template_result('true', '{% assign test = true # comment %}{{test}}')
    # InlineCommentTest#test_assign:
    # Liquid::SyntaxError: Liquid syntax error (line 1): Unexpected character # in "{{true # comment }}"
  end

Not being familiar with the parsing process, is the only option to implement comments as a tag? Would we want to have a way to strip these completely when generating an intermediary representation so they are not part of the final execution path? I would rather not have tags even be aware there was comments in the first place.

dylanahsmith · 2021-02-22T14:20:56Z

Not being familiar with the parsing process, is the only option to implement comments as a tag?

Comments are already implemented as tags, doing the same for inline comments makes the changes simpler and less likely to introduce a breaking change.

Basically, I'm avoiding introducing a special case where there isn't a 1-1 correspondence between the tag syntax and a tag node. Breaking that would make the implementation more awkward, thus supporting inline comments at the end of other tags wouldn't introduce an InlineComment tag node for the same reason.

Would we want to have a way to strip these completely when generating an intermediary representation so they are not part of the final execution path?

We can remove these when compiling to liquid-c VM code as will be done for the Comment tag in Shopify/liquid-c#96

I would rather not have tags even be aware there was comments in the first place.

This is just an implementation detail that should be opaque to all the other tags, since tag classes shouldn't be coupled to each other

tjoyal · 2021-02-23T12:30:43Z

If # is a tag and we make it clear, even if somewhat confusing at first, it does map to the behaviour me and @pushrax are describing. Works as expected and move forward with the proposition?

dylanahsmith · 2021-02-23T13:40:58Z

Do you think we should ship this as is then iterate by trying to follow-up with support for comments at the end of tags? Or should I hold off on that exploration?

tjoyal · 2021-02-23T16:00:53Z

If we consider it a tag, then maybe it's even better(easier to form a mental model) that you cannot add it at the end of another tag.

The spaces around # aren't necessary, so this also works

Reviewing back, should this not be made possible then?

pushrax · 2021-02-23T22:07:40Z

Spaces around tags aren't strictly required in any case, e.g. {%endfor%} is valid. The internal syntax of some tags requires spaces though, e.g. {%for foo in bar%}.

I don't really have much of an opinion on comments at the end of tags, as long as it crashes gracefully.

tjoyal · 2021-02-23T23:05:01Z

My feedback was not for space before, but after a tag.

{% assign_some_extra_chars test = 1 %} is obviously invalid
My take is that {% #test %} should also be invalid and require a space (the parser behave differently the way the regex is written).

dylanahsmith · 2021-02-24T14:24:29Z

My take is that {% #test %} should also be invalid and require a space (the parser behave differently the way the regex is written).

I guess it makes sense to have that be invalid until we add support for comments at the end of tags.

dylanahsmith · 2021-02-24T15:35:51Z

I've updated the code to require a space after # and updated the PR description accordingly.

tjoyal · 2021-02-24T20:51:28Z

Gemfile

@@ -22,6 +22,6 @@ group :test do
  gem 'rubocop-performance', require: false

  platform :mri, :truffleruby do
-    gem 'liquid-c', github: 'Shopify/liquid-c', ref: 'master'
+    gem 'liquid-c', github: 'Shopify/liquid-c', ref: 'inline-comment'


What's is the plan here? Will you merge it remove current change so it point to master?

The liquid-c change can be safely merged first, then I'll update this to point back to master before merging.

lib/liquid/block_body.rb

pushrax · 2021-02-24T21:25:03Z

{% assign_some_extra_chars test = 1 %} is obviously invalid

That is because the underscores are part of the tag name and there is no tag called assign_some_extra_chars. {% foo#bar %} or {% foo!bar %} isn't invalid if the implementation of foo supports that syntax.

class Foo  < Liquid::Tag
  def initialize(tag_name, arg, tokens)
    @arg = arg
  end
  def render(_)
    @arg.to_s
  end
end

Liquid::Template.register_tag('foo', Foo)

puts Liquid::Template.parse("{%foo#bar%}").render # => "#bar"
puts Liquid::Template.parse("{%foo!bar%}").render # => "!bar"

I think most people would expect the comment syntax to work without the space.

dylanahsmith · 2021-02-24T22:46:34Z

Oh right, spaces after the tag name are just needed to separate it from other word characters, since it just requires a non-word character to end the tag.

I've updated the code to require a space after # and updated the PR description accordingly.

I've removed that last commit

tjoyal

I think we are good now! I'd love to see @pushrax case in the tests as if anything is to break in future iterations I think it is going to be that (since tag registration are global, I'm not sure it can be done without some tinkering).

samdoiron · 2021-02-25T21:47:21Z

test/integration/tags/inline_comment_test.rb

+  end
+
+  def test_comment_inline_tag
+    assert_template_result('ok', '{% echo "ok" # output something from a tag %}')


Worth noting that this removes a lot of the benefit of treating # as its own tag, because it means the concern leaks into the syntax of other tags too.

This also has implications for the API, since we'd have to either

Do pre-processing on all tag input before passing it, which is a breaking change in a strict sense

Only support it for our own tags, which could lead to inconsistencies for other clients.

The second of these may still be the right call, but special-casing those tags could be more complex in Liquid-C.

What exactly is the benefit of treating it as a tag? My (limited!) experience with parser creation suggests that comments are usually dropped very early in the process of parsing/lexing and this feels like the right approach for liquid as well.

timdmackey · 2021-03-10T22:57:32Z

Using this after another tag (e.g. {% assign discouraged = true # This comment is parsed as part of the assign tag %}) would cause it to be parsed as part of that tag, so is not recommended since we can't ensure consistency between tags and backwards compatibility (especially considering that there are application defined tags).

Do you think we should ship this as is then iterate by trying to follow-up with support for comments at the end of tags? Or should I hold off on that exploration?

I guess it makes sense to have that be invalid until we add support for comments at the end of tags.

I've been thinking about how to extend inline comments to all tags, and it occurred to me—if the plan is to implement # as a new tag to avoid breaking changes, what if # comments were *also* implemented as a filter? If the filter ignores any "arguments" (text following the # key), and if the : isn't strictly necessary, then you could do stuff like this:

{%
  assign myvar = "hello world"
  | # add exclamation marks
  | append: "!!!!"
  | # convert to JSON format
  | json
%} {{ myvar }}

{% assign myvar = "hello world" | # add question marks: | append: "????" | # convert to JSON format: | json %} {{ myvar }}

{{
  product
  | # get image thumbnail:
  | img_url: "240x240"
}}

{{ product | # get fullsize image: | img_url: "1600x1600" }}

Interestingly, all 4 of these examples already render without issue in Shopify's theme editor due to it's lax error handling. They only raise a "potential issue" warning inside of a development shop:

dylanahsmith · 2021-03-10T23:55:24Z

Not all tags support filters, so adding a | character before a tag doesn't help at all.

Interestingly, all 4 of these examples already render without issue in Shopify's theme editor due to its lax error handling.

Yeah, the lax parsing makes it hard to extend the syntax in a backwards compatible way, since avoiding breaking working liquid code (without usage testing) requires finding something that is currently considered an error. Essentially, syntax errors reserve syntax for future features. Instead, the lax parser just scans over a regular expression that it doesn't match on to find the filter, including the # character, so we risk breaking liquid code by changing that behaviour from skipping just the # character to skipping everything after it on the line.

timdmackey · 2021-03-11T00:06:49Z

we risk breaking liquid code by changing that behaviour from skipping just the # character to skipping everything after it on the line.

Not sure this is helpful in the overall picture, but currently doesn't the parser just skip everything until the next pipe character? It appears to me that the parser is failing to match the first word (#), and so it jumps ahead to the next filter. Here's the output of my above code:

 "hello world!!!!"

 "hello world????"

//cdn.shopify.com/shopifycloud/shopify/assets/no-image-2048-5e88c1b20e087fb7bbe9a3771824e743c244f437e4f8ba93bbf7b11b53f7824c_240x240.gif

//cdn.shopify.com/shopifycloud/shopify/assets/no-image-2048-5e88c1b20e087fb7bbe9a3771824e743c244f437e4f8ba93bbf7b11b53f7824c_1600x1600.gif

dylanahsmith · 2021-03-11T00:25:54Z

currently doesn't the parser just skip everything until the next pipe character?

It does that for the first pipe character. After that, it treats anything that doesn't match Liquid::Variable::FilterParser.

Not sure this is helpful in the overall picture

What would help would be if less invalid code was saved in Shopify, so it doesn't make it look like code that depends on these quirks. This invalid liquid can be tested using this gem directly (e.g. ruby -rliquid -e 'print Liquid::Template.parse(STDIN.read).render. < test.liquid).

dylanahsmith · 2021-03-29T22:15:31Z

test/integration/tags/inline_comment_test.rb

+  def test_test_syntax_error
+    assert_template_result('fail', '{% #This doesnt work %}')
+
+    assert false 
+  rescue 
+    # ok good
+  end


If I understand correctly, this test is saying that a space is required after #.

dylanahsmith · 2021-03-29T22:27:14Z

test/integration/tags/inline_comment_test.rb

+  def test_tag_ws_stripping
+    assert_template_result('', '   {%#- This text gets ignored -#%}    ')


I don't understand the reason you are proposing supporting this style. Does it have any semantic difference?

Other than this test case, it seems like # is supposed to treat the rest of the line as a comment, but here it seems to be integrated into the start tag. So does this have different semantic for multi-line comments like the following?

{%#- This is clearly a comment echo "but how about this line?" -#%}

I'm also unsure if you are proposing that the -#%} end tag should have any significance over -%}.

colinbendell · 2021-03-30T13:24:35Z

Worth noting that NunJucks (a popular templating language in SSG that is a near relative to liquid) uses the {# bla #} convention. While NunJucks has had a different evolutionary path than liquid, it might be best, for developer ergonomics, to adopt the existing pattern instead of introducing a new pattern.

[sidebar: webstorm integration is really convenient here. comment and uncomment are really easy]

ADTC · 2021-04-01T17:03:14Z

Interestingly, all 4 of these examples already render without issue in Shopify's theme editor due to it's lax error handling. They only raise a "potential issue" warning inside of a development shop:

@timdmackey but does it actually render anything when you load the page? Sometimes, Liquid errors appear dumped in the rendered page even if the theme code editor has no complaints. ~~I'd love to see your comment updated with a screenshot of that~~ 🙂

PS: Actually this works because you can pass any random set of characters as a filter, and the engine will just ignore that unrecognized filter and move on to the next. It has nothing to do with # being a tag. You can try | lorem ipsum dolor sit amet | append: 'that works'

Interestingly, this may allow us to add comments anywhere already by using unrecognized filter as a hack: assign i = i + 1 | incrementing the variable or for item in cart.items | iterating over cart items 🤷 (not sure about the second example though).

Update: It works as I thought it would. But you can't use let's because the apostrophe seems to wreck it.

dylanahsmith · 2022-04-22T16:31:07Z

Closing in favour of #1498

dylanahsmith · 2022-04-22T16:31:07Z

Closing in favour of #1498

pushrax approved these changes Feb 19, 2021

View reviewed changes

EricFromCanada mentioned this pull request Feb 19, 2021

Liquid: update for 5.0.0 rouge-ruby/rouge#1681

Merged

ADTC mentioned this pull request Feb 21, 2021

Request for suggestions: Liquid comments #1393

Closed

tjoyal approved these changes Feb 23, 2021

View reviewed changes

dylanahsmith added 4 commits February 24, 2021 09:28

Use liquid-c inline-comment branch until it is merged

31f7be8

Implement the inline comment tag

fff6c56

Add tests for text immediately following liquid tag

6ddfaec

Add changelog entry

5e8e5e8

dylanahsmith force-pushed the inline-comment branch from 5b3ec2e to d889127 Compare February 24, 2021 14:50

tjoyal reviewed Feb 24, 2021

View reviewed changes

dylanahsmith force-pushed the inline-comment branch from d889127 to 5e8e5e8 Compare February 24, 2021 22:45

tjoyal approved these changes Feb 25, 2021

View reviewed changes

exploring the comment syntax a bit

ab45548

samdoiron reviewed Feb 25, 2021

View reviewed changes

Reserve future support for comment line before a tag name

0fc45ca

dylanahsmith force-pushed the inline-comment branch from 11ce2a6 to 0fc45ca Compare February 26, 2021 12:00

dylanahsmith commented Mar 29, 2021

View reviewed changes

charlespwd mentioned this pull request Dec 16, 2021

Add # inline comment tag. #1498

Merged

dylanahsmith closed this Apr 22, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow `#` to be used as an inline comment tag #1401

Allow `#` to be used as an inline comment tag #1401

dylanahsmith commented Feb 19, 2021 •

edited

Loading

pushrax left a comment

dylanahsmith commented Feb 19, 2021

tjoyal commented Feb 22, 2021

dylanahsmith commented Feb 22, 2021

tjoyal commented Feb 23, 2021

dylanahsmith commented Feb 23, 2021

tjoyal commented Feb 23, 2021 •

edited

Loading

pushrax commented Feb 23, 2021 •

edited

Loading

tjoyal commented Feb 23, 2021

dylanahsmith commented Feb 24, 2021

dylanahsmith commented Feb 24, 2021

tjoyal Feb 24, 2021

dylanahsmith Feb 24, 2021

pushrax commented Feb 24, 2021 •

edited

Loading

dylanahsmith commented Feb 24, 2021

tjoyal left a comment

samdoiron Feb 25, 2021

tobi Mar 24, 2021

timdmackey commented Mar 10, 2021 •

edited

Loading

dylanahsmith commented Mar 10, 2021

timdmackey commented Mar 11, 2021

dylanahsmith commented Mar 11, 2021

dylanahsmith Mar 29, 2021

dylanahsmith Mar 29, 2021

colinbendell commented Mar 30, 2021 •

edited

Loading

ADTC commented Apr 1, 2021 •

edited

Loading

dylanahsmith commented Apr 22, 2022

dylanahsmith commented Apr 22, 2022

		def test_tag_ws_stripping
		assert_template_result('', ' {%#- This text gets ignored -#%} ')

Allow # to be used as an inline comment tag #1401

Allow # to be used as an inline comment tag #1401

Conversation

dylanahsmith commented Feb 19, 2021 • edited Loading

pushrax left a comment

Choose a reason for hiding this comment

dylanahsmith commented Feb 19, 2021

tjoyal commented Feb 22, 2021

dylanahsmith commented Feb 22, 2021

tjoyal commented Feb 23, 2021

dylanahsmith commented Feb 23, 2021

tjoyal commented Feb 23, 2021 • edited Loading

pushrax commented Feb 23, 2021 • edited Loading

tjoyal commented Feb 23, 2021

dylanahsmith commented Feb 24, 2021

dylanahsmith commented Feb 24, 2021

tjoyal Feb 24, 2021

Choose a reason for hiding this comment

dylanahsmith Feb 24, 2021

Choose a reason for hiding this comment

pushrax commented Feb 24, 2021 • edited Loading

dylanahsmith commented Feb 24, 2021

tjoyal left a comment

Choose a reason for hiding this comment

samdoiron Feb 25, 2021

Choose a reason for hiding this comment

tobi Mar 24, 2021

Choose a reason for hiding this comment

timdmackey commented Mar 10, 2021 • edited Loading

dylanahsmith commented Mar 10, 2021

timdmackey commented Mar 11, 2021

dylanahsmith commented Mar 11, 2021

dylanahsmith Mar 29, 2021

Choose a reason for hiding this comment

dylanahsmith Mar 29, 2021

Choose a reason for hiding this comment

colinbendell commented Mar 30, 2021 • edited Loading

ADTC commented Apr 1, 2021 • edited Loading

dylanahsmith commented Apr 22, 2022

dylanahsmith commented Apr 22, 2022

Allow `#` to be used as an inline comment tag #1401

Allow `#` to be used as an inline comment tag #1401

dylanahsmith commented Feb 19, 2021 •

edited

Loading

tjoyal commented Feb 23, 2021 •

edited

Loading

pushrax commented Feb 23, 2021 •

edited

Loading

pushrax commented Feb 24, 2021 •

edited

Loading

timdmackey commented Mar 10, 2021 •

edited

Loading

colinbendell commented Mar 30, 2021 •

edited

Loading

ADTC commented Apr 1, 2021 •

edited

Loading