lines & unlines #28

chemist · 2014-08-05T09:51:25Z

In Bytestring, String and Text EOL always '\n', but in windows its \r\n
how about separate version for lines, unlines?
lines can be universal, and unlines like
unlinesUniversal :: EOL -> [ByteString] -> ByteString

Its not critical, but can be useful.

dcoutts · 2014-11-09T18:40:39Z

I don't like this very much, even on Windows if you open a file in text mode then you'll get \r\n conversion. The more general thing is then splitting, which we should do by integrating or extending the split package for ByteString (and Text).

Leaving this open for discussion though.

rahulmutt · 2018-03-25T13:08:04Z

It would be really useful to have unlinesUniversal or even patch unlines to handle \r too because the current implementation violates the principle of least astonishment.

The intuition any beginner Haskeller gets from using the lines function for String is that it returns proper lines without the \r character. You'll write a code assuming that ByteString's version of lines works in a cross-platform way.

I'd be happy to send a PR if the maintainers are fine with it. I wasted half a day finding a bug in a parser only to find that it was working under the assumption that the lines function strips away the \r as well for Windows. I hope to prevent this for anyone else in the future.

There should at least be a line in the documentation in bold letters saying that lines doesn't strip off \r because that's very important to know.

* add haddock comments about the behaviour of IsString instances (#140) * Correct the documentation of split and so on (fixes #161) * Document the behaviour of unsafeUseAsCStringLen when BS.empty is passed (fixes #207) * s/encodeListWithB/primMapListBounded/ (fixes #50) * Fix broken links in Data.ByteString.Prim * Fix all missing Haddock links * Note that `lines` doesn't handle CR (#28) * encodeByteStringWithF seems to actually mean primMapByteStringFixed

Bodigrim · 2020-09-26T14:04:35Z

The intuition any beginner Haskeller gets from using the lines function for String is that it returns proper lines without the \r character.

I cannot find any special treatment for \r in Data.List.lines:

> lines "foo\r\nbar"
["foo\r","bar"]

So Data.ByteString.lines is in line with its Prelude counterpart. This behaviour has been recently reflected in the documentation. A general splitOn function is discussed in #100.

I'm in favor of closing this.

sjakobi · 2020-10-01T12:49:00Z

I agree with closing this issue.

sjakobi added enhancement Windows labels Jun 25, 2020

fumieval added a commit to fumieval/bytestring that referenced this issue Jul 7, 2020

Note that lines doesn't handle CR (haskell#28)

f80c455

fumieval added a commit to fumieval/bytestring that referenced this issue Jul 8, 2020

Note that lines doesn't handle CR (haskell#28)

7bb8e16

sjakobi closed this as completed Oct 1, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

lines & unlines #28

lines & unlines #28

chemist commented Aug 5, 2014

dcoutts commented Nov 9, 2014

rahulmutt commented Mar 25, 2018 •

edited

Loading

Bodigrim commented Sep 26, 2020

sjakobi commented Oct 1, 2020

lines & unlines #28

lines & unlines #28

Comments

chemist commented Aug 5, 2014

dcoutts commented Nov 9, 2014

rahulmutt commented Mar 25, 2018 • edited Loading

Bodigrim commented Sep 26, 2020

sjakobi commented Oct 1, 2020

rahulmutt commented Mar 25, 2018 •

edited

Loading