RFC: New "break this group if you have to, but try to break its parents first" doc command? #3014

suchipi · 2017-10-12T03:25:03Z

I've seen a lot of issues opened lately where a single item in braces, parens, angle brackets, or square brackets gets left on a line alone, and it looks odd. In each instance, it seems that there is an alternative formatting option that looks better and still fits within the line limit.

An idea popped into my head the other day for a way we could address this- if we could consider breaking of certain groups "undesirable", then we could take places where people try to avoid putting breaks (TypeParameterDeclaration, computed MemberExpression property, arrow function expression parameters, etc) and use a new "undesirable-to-break group" command there.

The idea behind the "undesirable-to-break group" is, when a break occurs, and we're iterating up through the parents to break them, if we find an undesirable-to-break group, we skip over it and try breaking the next parent group, and see if that makes everything fit within the desired print-width. If we get all the way up to <wherever we decide to stop- the root node, or a statement context?>, and the part of the code we were formatting still isn't under the desired print width, then we iterate back down over the undesirable-to-break groups, breaking the outermost ones first, and stopping once everything fits under the desired print width (or we've broken all the groups).

It obviously introduces some time complexity, but I think it might be worth it. Does anyone have thoughts?

lydell · 2017-10-12T06:02:24Z

It sounds like a good idea. But I'm worried about potential performance problems, too. One way of going forward is to make a proof of concept PR, and add some crazy nested code as a performance test.

azz · 2017-10-12T12:00:34Z

Maybe the concept of a "strong group"?

Before:

[
  "new ",
  "Foo",
  group(["<", indent([softline, "T"]), softline, ">"]),
  group([
    "(",
    indent([softline, "1", ",", line, "2", ",", line, "3"]),
    softline,
    ")"
  ]),
  ";",
  hardline,
  breakParent
];

After:

[
  "new ",
  "Foo",
  strongGroup(["<", indent([softline, "T"]), softline, ">"]),
  group([
    "(",
    indent([softline, "1", ",", line, "2", ",", line, "3"]),
    softline,
    ")"
  ]),
  ";",
  hardline,
  breakParent
];

azz · 2017-10-12T12:03:12Z

I think it'll be tricky to get right... But if I'm certainly interested.

karl · 2017-10-21T21:12:58Z

Really interesting idea, would love to see a proof of concept!

suchipi · 2017-10-31T21:00:20Z

I've been trying to find a free weekend to work on this for a while; hopefully I'll have some time this weekend.

bakkot · 2017-10-31T21:08:09Z

This might be a solution for #2482: instead of a = b being a single unbreakable unit, have it be a "strong group" consisting of a = and b.

Edit: though, thinking about it more, probably not quite; see comment there.

bakkot · 2017-10-31T21:22:01Z

Also, this is kind of in the direction of dartfmt; if you haven't read that essay it's worth reading.

suchipi · 2017-11-11T00:06:53Z

With the holiday season coming up, it will probably be a while before I'll have time to implement a proof-of-concept for this. If anyone else wants to take a stab at it, they are welcome to.

duailibe · 2017-11-11T01:55:08Z

@suchipi I have some free time in the following weeks but I'm not exactly sure how to implement this. Do you have any general advice?

suchipi · 2017-11-12T22:38:05Z

No; I'm not familiar with the doc printer, so I was gonna dig in and just try to figure stuff out. But I think the implementation should involve two new doc nodes: strongGroup and strongGroupBoundary. A strong group shouldn't break unless we have broken every other group or strong group up to the nearest strong group boundary and code within that strong group boundary still doesn't fit under the print width. There should probably be strong group boundaries around each statement, and around the whole program. And we'll probably end up putting them in a few other places eventually, too.

vjeux · 2018-03-27T20:55:51Z

I'm not sure I understand this issue. "break this group if you have to, but try to break its parents first" is how the prettier algorithm works already. It always breaks the outermost group first.

I believe that the issues you're seeing is that if there are two groups next to each other, prettier algorithm will break the last one which is not always what you want. We sometimes want to control which one to break first.

A technique that we use is to remove a group in those cases. A good example is function arguments vs return types.

With a naive implementation, the following would break like this:

function f(a: int, b: int): React.Component<Props, State> {}

// Would break like this
function f(a: int, b: int): React.Component<
  Props,
  State,
> {}

// But you want to break like this
function f(
  a: int,
  b: int,
): React.Component<Props, State> {}

The doc structure looks something like this:

group([
  "function f(",
  group(comma(["a: int", "b: int"])),
  "): React.Component<",
  group(comma(["Props", "State"])),
  "> {}",
])

If you remove the first group, then it's going to have the behavior that you want:

group([
  "function f(",
  comma(["a: int", "b: int"]), // <-- no more group here
  "): React.Component<",
  group(comma(["Props", "State"])),
  "> {}",
])

I don't know if it applies to this exact situation but this is a handy tool to have on the toolbox to fix those kind of edge cases.

vjeux · 2018-03-27T21:15:47Z

I looked at the issues you linked and outside of the one around lambdas, they are all about member chains. This is by far the most complicated piece of prettier logic. What I would recommend is to try and make those examples work and see what tests break. We have a really good coverage at this point.

We may or may not need new primitives for it but we need to understand what trade-offs have been made. You’ll likely find examples that look very similar based on the ast that in some cases you would print one way and some other cases you would print another way and it’s hard to automatically chose which to use.

vjeux · 2018-03-27T21:31:38Z

For the lambda expansion, we may want to introduce a new concept to prettier. I'm implementing a pretty printer for a different language and I am experimenting with a different approach: #3376 (comment)

So far it looks promising but I still find edge cases so it's not yet guaranteed to work.

suchipi · 2018-03-27T21:47:15Z

@vjeux thanks for the input. It appears you are right, this happens more frequently with sibling groups than parent/child groups.

The algorithm you mentioned in that comment sounds perfect for this kind of stuff- excited about it, hope it pans out.

octogonz · 2019-11-06T21:40:04Z

This proposal hasn't been updated in a while. Did we (mostly) converge on a design, but nobody has stepped up to implement it? Or is the idea still in the speculative stages? Just curious about the status.

alexander-akait · 2019-11-07T10:21:09Z

@octogonz it is just idea, anyway you can send a PR with implementation and we review this

thorn0 · 2021-02-10T14:28:30Z

"Break this group if you have to, but try to break its parents first" is indeed how Prettier already works, but it's just an unfortunate wording. This problem:

a single item in braces, parens, angle brackets, or square brackets gets left on a line alone, and it looks odd. In each instance, it seems that there is an alternative formatting option that looks better and still fits within the line limit.

is a real issue and still doesn't have a solution. E.g., that's how it manifested itself in my attempts to fix assignments (playground):

{
  {
    // (1) -- unbalanced
    //   - big visual distance between the fn name and the arg
    //   - the lone short argument on its own line looks bad
    result.typeParameters = this.convertTSTypeParametersToTypeParametearsFoo1(
      node
    );

    // (2) -- much better than (1)
    result.typeParameters =
      this.convertTSTypeParametersToTypeParametearsFoo1(node);

    // (3) -- but with a longer argument, (2) breaks and becomes bad:
    //   takes an extra line, extra indentation looks unjustified
    result.typeParameters =
      this.convertTSTypeParametersToTypeParametearsFoo1(
        convertTSTypeParametersToTypeParametearsFoo1
      );

    // (4) -- better than (3), but how can we algorithmically distinguish this case from (1)?
    result.typeParameters = this.convertTSTypeParametersToTypeParametearsFoo1(
      convertTSTypeParametersToTypeParametearsFoo1
    );
  }
}

suchipi added the status:needs discussion Issues needing discussion and a decision to be made before action can be taken label Oct 12, 2017

ikatyang added the type:enhancement A potential new feature to be added, or an improvement to how we print something label Oct 15, 2017

jwbay mentioned this issue Oct 18, 2017

Flow Array<Type> angled brackets separated into multiple lines #1825

Closed

lydell mentioned this issue Oct 31, 2017

Prevent expanding function parameters into multiline #3122

Closed

bakkot mentioned this issue Oct 31, 2017

Inconsistency breaking assignments #2482

Closed

suchipi mentioned this issue Nov 11, 2017

Odd splitting when destructuring #3227

Open

lydell mentioned this issue Nov 13, 2017

Strict word wrapping rules #3256

Closed

duailibe mentioned this issue Nov 21, 2017

Template literals: Don't break on identifiers but break if comments #3299

Merged

lydell mentioned this issue Dec 2, 2017

Formatting off for indented parameters to function #3376

Closed

brainkim mentioned this issue Jun 26, 2019

Javascript: Use function literals in arguments to detect function composition #6033

Merged

3 tasks

brodycj mentioned this issue Dec 5, 2019

Awkward formatting of elisions in array assignment patterns #7089

Open

thorn0 added the area:doc printer label Feb 9, 2021

thorn0 mentioned this issue Apr 5, 2021

2.3 regression testing thorn0/prettier-regression-testing#8

Closed

thorn0 mentioned this issue Apr 6, 2021

Tweak object destructuring in assignments #10643

Merged

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

RFC: New "break this group if you have to, but try to break its parents first" doc command? #3014

RFC: New "break this group if you have to, but try to break its parents first" doc command? #3014

suchipi commented Oct 12, 2017 •

edited

Loading

lydell commented Oct 12, 2017

azz commented Oct 12, 2017 •

edited

Loading

azz commented Oct 12, 2017

karl commented Oct 21, 2017

suchipi commented Oct 31, 2017

bakkot commented Oct 31, 2017 •

edited

Loading

bakkot commented Oct 31, 2017

suchipi commented Nov 11, 2017

duailibe commented Nov 11, 2017

suchipi commented Nov 12, 2017

vjeux commented Mar 27, 2018

vjeux commented Mar 27, 2018

vjeux commented Mar 27, 2018

suchipi commented Mar 27, 2018 •

edited

Loading

octogonz commented Nov 6, 2019

alexander-akait commented Nov 7, 2019

thorn0 commented Feb 10, 2021 •

edited

Loading

RFC: New "break this group if you have to, but try to break its parents first" doc command? #3014

RFC: New "break this group if you have to, but try to break its parents first" doc command? #3014

Comments

suchipi commented Oct 12, 2017 • edited Loading

lydell commented Oct 12, 2017

azz commented Oct 12, 2017 • edited Loading

azz commented Oct 12, 2017

karl commented Oct 21, 2017

suchipi commented Oct 31, 2017

bakkot commented Oct 31, 2017 • edited Loading

bakkot commented Oct 31, 2017

suchipi commented Nov 11, 2017

duailibe commented Nov 11, 2017

suchipi commented Nov 12, 2017

vjeux commented Mar 27, 2018

vjeux commented Mar 27, 2018

vjeux commented Mar 27, 2018

suchipi commented Mar 27, 2018 • edited Loading

octogonz commented Nov 6, 2019

alexander-akait commented Nov 7, 2019

thorn0 commented Feb 10, 2021 • edited Loading

suchipi commented Oct 12, 2017 •

edited

Loading

azz commented Oct 12, 2017 •

edited

Loading

bakkot commented Oct 31, 2017 •

edited

Loading

suchipi commented Mar 27, 2018 •

edited

Loading

thorn0 commented Feb 10, 2021 •

edited

Loading