Content.Text adds invisible whitespace to output #1105

panesofglass · 2020-06-10T13:22:09Z

The tests for the-benchmarker/web-frameworks#2888 are failing. When printed out, they look correct, e.g. "" = "" or "0" = "0". However, there is evidence of additional characters or at least mismatched encoding. According to the project, encoding should be UTF-8, which should match Content.Text's output. However, something is clearly added that doesn't appear to affect normal parsing or processing.

The text was updated successfully, but these errors were encountered:

granicz · 2020-06-10T14:12:47Z

What content-type is the benchmark expecting?

panesofglass · 2020-06-10T15:23:23Z

I think it expects text/plain, though that is not explicitly stated anywhere.

granicz · 2020-06-10T22:06:54Z

By default, Content.Text uses UTF-8, and it also takes an optional argument for the encoding. See the function here: https://github.com/dotnet-websharper/core/blob/master/src/sitelets/WebSharper.Sitelets/Content.fs#L427

I think the problem might be that it sends empty headers. Can you try replacing Content.Text with the following Text function? If it works, I can change Context.Text accordingly and re-release.

let Text (msg: string) =
    Content.Custom(
        Headers = [Http.Header.Custom "content-type" "text/plain"],
        WriteBody = fun s ->
            let encoding = System.Text.Encoding.UTF8
            use w = new System.IO.StreamWriter(s, encoding)
            w.Write msg
    )

Tarmil · 2020-06-10T23:06:36Z

Alternately, this should be equivalent: Content.Text msg |> Content.WithContentType "text/plain"

panesofglass · 2020-06-11T00:21:31Z

Strangely, the above results in the same issue. In Postman, I'm able to download the the response, even though when I open the file it appears empty. However, with the following, I get a true, empty body:

let Text (msg: string) =
    Content.Custom(
        WriteBody = fun s ->
            let bytes = System.Text.Encoding.UTF8.GetBytes(msg)
            s.Write(bytes, 0, bytes.Length))

Closes dotnet-websharper#1105

granicz · 2020-06-11T09:16:21Z

You mentioned extra characters - can you paste a screenshot of the difference between the two functions above?

panesofglass · 2020-06-11T12:09:35Z

I deleted the file I downloaded from Postman. It's some sort of non-printable whitespace character. When writing "", the body contains some kind of byte. I didn't decode it to find out which. I cannot comprehend why the StreamWriter would add it.

original:

static member Text (text: string, ?encoding: System.Text.Encoding) : Async<Content<'Endpoint>> =
    let encoding = System.Text.Encoding.UTF8
    Content.Custom(
        WriteBody = fun s ->
            use w = new System.IO.StreamWriter(s, encoding)
            w.Write msg
    )

updated:

let Text (msg: string) =
    Content.Custom(
        WriteBody = fun s ->
            let bytes = System.Text.Encoding.UTF8.GetBytes(msg)
            s.Write(bytes, 0, bytes.Length)
    )

The only differences are that I get the bytes and write them directly, whereas the original uses a StreamWriter to write to the stream. You can see the direct comparison in the PR: https://github.com/dotnet-websharper/core/pull/1106/files#diff-ada0621f6daf8e515b65bb44ba7192fa

Tarmil · 2020-06-11T22:09:12Z

It looks like StreamWriter actually writes the 3-byte UTF-8 byte order mark EF BB BF. It would be a better fix to prevent it from doing that, rather than use GetBytes which creates an intermediary byte array. The trick is to use new UTF8Encoding(false) instead of the static Encoding.UTF8. I can confirm that the following writes 0 bytes:

let defaultEncoding = new System.Text.Encoding.UTF8Encoding(false)

let Text (msg: string) =
    Content.Custom(
        WriteBody = fun s ->
            use w = new System.IO.StreamWriter(s, defaultEncoding)
            w.Write(msg)
    )

panesofglass · 2020-06-11T22:34:37Z

Awesome! I didn’t know about that!

panesofglass · 2020-06-12T00:51:07Z

As Content.Text already accepts an encoding parameter, it seems this is not a bug. Should I just close this?

panesofglass added a commit to panesofglass/websharper that referenced this issue Jun 11, 2020

Don't write extraneous characters in Content.Text

bc309c8

Closes dotnet-websharper#1105

panesofglass mentioned this issue Jun 11, 2020

Don't write extraneous characters in Content.Text #1106

Merged

panesofglass mentioned this issue Jun 12, 2020

[F#] WebSharper Updates the-benchmarker/web-frameworks#2939

Merged

granicz closed this as completed in #1106 Jun 12, 2020

Jand42 added a commit that referenced this issue Jun 13, 2020

#1105 fix for Encoding in Content.Text

ab77df2

Jand42 added the 4.6.5.406 label Jun 15, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Content.Text adds invisible whitespace to output #1105

Content.Text adds invisible whitespace to output #1105

panesofglass commented Jun 10, 2020

granicz commented Jun 10, 2020

panesofglass commented Jun 10, 2020

granicz commented Jun 10, 2020

Tarmil commented Jun 10, 2020

panesofglass commented Jun 11, 2020

granicz commented Jun 11, 2020

panesofglass commented Jun 11, 2020 •

edited

Loading

Tarmil commented Jun 11, 2020

panesofglass commented Jun 11, 2020

panesofglass commented Jun 12, 2020 •

edited by granicz

Loading

Content.Text adds invisible whitespace to output #1105

Content.Text adds invisible whitespace to output #1105

Comments

panesofglass commented Jun 10, 2020

granicz commented Jun 10, 2020

panesofglass commented Jun 10, 2020

granicz commented Jun 10, 2020

Tarmil commented Jun 10, 2020

panesofglass commented Jun 11, 2020

granicz commented Jun 11, 2020

panesofglass commented Jun 11, 2020 • edited Loading

Tarmil commented Jun 11, 2020

panesofglass commented Jun 11, 2020

panesofglass commented Jun 12, 2020 • edited by granicz Loading

panesofglass commented Jun 11, 2020 •

edited

Loading

panesofglass commented Jun 12, 2020 •

edited by granicz

Loading