Merge implicitly concatenated string literals that fit on one line #26

max-sixty · 2018-03-15T20:16:00Z

Black could make single-line strings over multiple lines (i.e. a number of single quotes strings on multiple lines surrounded by parentheses) more efficient, by resizing them to the full length of the line.

Even if that was overreach, there's a peculiar situation where you end up with multiple strings on the same line, like below:

-        warnings.warn('Dataset.sel_points is deprecated: use Dataset.sel()'
-                      'instead.', DeprecationWarning, stacklevel=2)
-
+        warnings.warn(
+            'Dataset.sel_points is deprecated: use Dataset.sel()' 'instead.',
+            DeprecationWarning,
+            stacklevel=2,
+        )

ambv · 2018-03-15T21:58:12Z

In this case Black is suggesting that you should merge the two strings into one and the result is more readable that way.

I don't do this automatically (yet?) because it gets complicated if the two strings don't share the same prefix (for example r'something' f'another thing'). This is where user action after the formatting is probably best.

max-sixty · 2018-03-15T22:22:41Z

Right, that makes sense. I think whether the strings on the same line are merge-able is clear (i.e. do they have the same prefix), but yes it's a rare case; feel free to close

And changing beyond that may require discretion (e.g. turning 6 lines of 2/3-long lines into 4 full length lines)

ambv · 2018-03-15T22:31:52Z

There's another related problem: if I merged string literals, I am now making semantic changes to the AST. I'm not opposed to those but this will make safety checks after reformatting trickier.

Let's leave this open for the time being, it's an interesting problem.

zsol · 2018-04-03T18:19:46Z

I'm not even sure what I would expect black to do with code that implicit-concatenates two differently prefixed strings to be honest. I think the path of least surprise is just leaving them alone.

ambv · 2018-04-03T21:09:07Z

Yeah, it they are different prefixes, leave them alone. If they share a prefix and they end up on the same line, they should be merged.

If you really want to be correct here the implementation is going to be hard in the following edge case:

two strings like "STR1" "STR2" don't fit on one line because the closing quote of STR1, the space, and the opening quote of STR2 are the 3 characters that cause the entire thing to not fit in a single line. So you will keep them on two lines.
but if you knew that it's safe to concatenate them, it would fit in a single line (without those 3 extra characters).

I'm inclined not to touch this edge case since that makes it tricky where to perform the merge.

Another small edge case which I'm inclined to avoid is this:

a = (
    "a"
    "bb"
    "ccc"
    "dddd"
    "eeeee"
    "ffffff"
    "ggggggg"
    "hhhhhhhh"
    "iiiiiiiii"
    "jjjjjjjjjj"
    "kkkkkkkkkkk"
    "llllllllllll"
    "mmmmmmmmmmmmmmmm"
)

Technically Black could implement the "fill" algorithm for this case that Prettier also has for JSX. But I think what it currently does is fine for simplicity and obvious for users to recognize.

aldanor · 2018-07-15T12:25:55Z

Another related case I've managed to hit with black is when it joins \-split string into one, and by doing so it violates line length limit. In this case (at least for now, while it's not implemented still), it'd be probably better if it did nothing? E.g.:

$ black -S -l30 --diff long_str.py

--- long_str.py	2018-07-15 12:24:14 +0000
+++ long_str.py	2018-07-15 12:24:41.221434 +0000
@@ -1,5 +1,2 @@
-s = '111111111111111111111' \
-    '222222222222222222222' \
-    '333333333333333333333' \
-    '444444444444444444444'
+s = '111111111111111111111' '222222222222222222222' '333333333333333333333' '444444444444444444444'

e3krisztian · 2018-08-01T12:20:11Z

@aldanor line length violation should be a bug (I have also seen it with black==18.6b4). I think it deserves a separate issue.

graingert · 2018-08-20T08:01:07Z

@ambv Python 3.7 has AST level constant folding: https://bugs.python.org/issue29469 so implicit string concatenation - or lack thereof - would be invisible to the AST check

ambv · 2018-08-20T10:01:33Z

I know, @graingert, but we can't require Python 3.7+ for all Black users. Not yet at least. What I'm pondering is if we should rather switch to do the AST post-check using typed-ast which would make it work exactly the same on both Python 2 and Python 3. And the same on all Python 3 versions.

hugovk · 2018-08-20T10:23:56Z

By the way, here's the pip installs for Black from PyPI for July 2018:

python_version	percent	download_count
3.6	86.07%	15,408
3.7	13.21%	2,364
3.5	0.41%	73
2.7	0.22%	39
3.4	0.08%	14
3.8	0.01%	2
3.3	0.01%	1
2.6	0.01%	1
Total		17,902

Source: pypinfo --start-date 2018-07-01 --end-date 2018-07-31 --percent --markdown black pyversion

ambv · 2018-08-20T10:30:58Z

Haha, one of those 3.8 downloads is me :-)

digitalresistor · 2018-10-16T19:59:00Z

Having Black complete the concatenation would be great.

ofek · 2018-12-02T01:39:52Z

In the time being can we add a warning when this happens so we can manually resolve it?

davidism · 2019-02-18T22:45:19Z

I'd definitely like to see a warning for this, maybe something like "Implicit string concatenation in line N not merged." An issue I've run into is that someone writing multiple similar strings, in tests for example, will add continuations for all the strings so they look the same, even though some would fit on a single line. Then Black moves them back to a single line but leaves the continuation sitting in the middle. The user was trying to satisfy the formatting rules, but ended up producing less ideal formatting without knowing it.

luxcem · 2019-03-28T10:42:03Z

What's your view on this example? Black left it unchanged.

def foo():
    return "Some long string cut in half," " this is really a long string"

def bar(text):
    return text

bar(("Some long string cut in half," " this is really a long string"))

Several string literals got moved to one line by black, but didn't get merged -- the result is odd and undesirable. Although people want Black to make the jump to editing AST for cases exactly like this ( psf/black#26 ), it's doubtful that will happen anytime soon. I would not be surprised, based on that thread, if black just waits for py3.6 to EOL and to support 3.7+ with this.

peterjc · 2019-08-09T23:38:41Z

Is there an open issue for the doing the opposite? I've found when black has left long lines in my code, it is usually overly long strings (mostly error messages, and when defining command line arguments).

Black could break long strings over multiple lines with implicit continuation (e.g. at spaces, or hyphens). I appreciate this would mean black having to set a convention for if the break point space should be trailing at the end of a truncated line, or leading at the start of a continued line.

Found it: See #182

keisheiled · 2019-11-27T08:29:28Z

I wrote a flake8 plugin to forbid these aberrant constructs: https://pypi.org/project/flake8_implicit_str_concat/

Some of these are recent regressions introduced by black psf/black#26 Others are long standing pre-existing style issues.

This pull request's main intention is to wraps long strings (as requested by #182); however, it also provides better string handling in general and, in doing so, closes the following issues: Closes #26 Closes #182 Closes #933 Closes #1183 Closes #1243

ambv changed the title ~~Handling long strings~~ Merge implicitly concatenated string literals that fit on one line Mar 15, 2018

ambv added the T: enhancement New feature or request label Mar 16, 2018

carljm mentioned this issue Mar 22, 2018

join strings if they are put onto the same line #61

Closed

ambv mentioned this issue Mar 23, 2018

Consider being even more opinionated about quotes #51

Closed

ambv added the help wanted Extra attention is needed label Apr 11, 2018

Lukas0907 mentioned this issue Apr 25, 2018

Reformatting print()s that span multiple lines does not remove extraneous "s #170

Closed

zsol mentioned this issue May 12, 2018

Is the concatenating lines with strings intended like this? #205

Closed

zsol mentioned this issue May 26, 2018

Multi-line strings possibly not corrected properly #261

Closed

pradyunsg mentioned this issue Jun 7, 2018

[WIP] Format code with black pypa/pip#5425

Closed

zsol mentioned this issue Jun 21, 2018

Bad formatting of multiline f-strings #369

Closed

TheLonelyGhost mentioned this issue Jul 10, 2018

Black GrafeasGroup/tor#129

Closed

ofek mentioned this issue Mar 14, 2019

Add style checker and formatter DataDog/integrations-core#3299

Merged

ambv mentioned this issue Mar 14, 2019

multipart string left multipart #743

Closed

swenzel mentioned this issue Apr 28, 2019

Use plus sign to concatenate strings #817

Closed

jdufresne mentioned this issue May 1, 2019

Introduce black linting encode/django-rest-framework#6586

Closed

jdufresne mentioned this issue Aug 17, 2019

Combine adjacent strings into one string PyCQA/isort#999

Merged

zsol mentioned this issue Sep 4, 2019

Style concern about continued-line strings #998

Closed

hugovk mentioned this issue Sep 27, 2019

Format code with Black pypa/pip#7084

Closed

JelleZijlstra mentioned this issue Oct 9, 2019

BUG: extra " " inserted when reformatting long string literals #1051

Closed

peterjc added a commit to peterjc/biopython that referenced this issue Nov 29, 2019

Fix string concatenation style in Tests/

1b2619f

Some of these are recent regressions introduced by black psf/black#26 Others are long standing pre-existing style issues.

peterjc added a commit to biopython/biopython that referenced this issue Nov 29, 2019

Fix string concatenation style in Tests/

65ad01e

Some of these are recent regressions introduced by black psf/black#26 Others are long standing pre-existing style issues.

WillAyd mentioned this issue Dec 18, 2019

Ran black on project tableau/TabPy#379

Merged

kalzoo mentioned this issue Dec 23, 2019

Use Black for code style and enforce in the CI rigetti/pyquil#1132

Merged

4 tasks

hugovk mentioned this issue Feb 18, 2020

Concatenation of a string is not correct. #1277

Closed

ambv added the stable label Mar 3, 2020

bbugyi200 mentioned this issue Mar 15, 2020

Improve String Handling #1132

Merged

ofek mentioned this issue Apr 9, 2020

Black is changing string output #1338

Closed

ichard26 mentioned this issue Apr 28, 2020

Black does not capture cases of strings separated by a space #1362

Closed

ambv closed this as completed in #1132 May 8, 2020

mgedmin mentioned this issue May 29, 2020

Resolve flake8 violations in tests/ linkchecker/linkchecker#428

Merged

pradyunsg mentioned this issue Sep 23, 2020

Blacken the codebase pypa/pip#8903

Closed

dargueta mentioned this issue Feb 4, 2021

String literals concatenated with space between and exceed line length #1964

Closed

potiuk mentioned this issue May 9, 2022

Clean up in-line f-string concatenation apache/airflow#23591

Merged

mentonin mentioned this issue Jul 13, 2022

Tools/LogAnalyzer: fix and update loganalyzer ArduPilot/ardupilot#21172

Merged

ds-cbo mentioned this issue Dec 15, 2023

Add linters configuration, reformat whole code alecthomas/voluptuous#503

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Merge implicitly concatenated string literals that fit on one line #26

Merge implicitly concatenated string literals that fit on one line #26

max-sixty commented Mar 15, 2018 •

edited

Loading

ambv commented Mar 15, 2018

max-sixty commented Mar 15, 2018

ambv commented Mar 15, 2018

zsol commented Apr 3, 2018

ambv commented Apr 3, 2018

aldanor commented Jul 15, 2018

e3krisztian commented Aug 1, 2018

graingert commented Aug 20, 2018 •

edited

Loading

ambv commented Aug 20, 2018

hugovk commented Aug 20, 2018

ambv commented Aug 20, 2018

digitalresistor commented Oct 16, 2018

ofek commented Dec 2, 2018

davidism commented Feb 18, 2019

luxcem commented Mar 28, 2019

peterjc commented Aug 9, 2019 •

edited

Loading

keisheiled commented Nov 27, 2019

Merge implicitly concatenated string literals that fit on one line #26

Merge implicitly concatenated string literals that fit on one line #26

Comments

max-sixty commented Mar 15, 2018 • edited Loading

ambv commented Mar 15, 2018

max-sixty commented Mar 15, 2018

ambv commented Mar 15, 2018

zsol commented Apr 3, 2018

ambv commented Apr 3, 2018

aldanor commented Jul 15, 2018

e3krisztian commented Aug 1, 2018

graingert commented Aug 20, 2018 • edited Loading

ambv commented Aug 20, 2018

hugovk commented Aug 20, 2018

ambv commented Aug 20, 2018

digitalresistor commented Oct 16, 2018

ofek commented Dec 2, 2018

davidism commented Feb 18, 2019

luxcem commented Mar 28, 2019

peterjc commented Aug 9, 2019 • edited Loading

keisheiled commented Nov 27, 2019

max-sixty commented Mar 15, 2018 •

edited

Loading

graingert commented Aug 20, 2018 •

edited

Loading

peterjc commented Aug 9, 2019 •

edited

Loading