Create generic ftl serializer #241

RumovZ · 2021-11-11T09:14:05Z

This updates @Michael-F-Bryan's work from #184 to work with the current master branch. It started out as more of a quick and ugly fix, so please let me know if there's anything you want me to smooth out.

…parated by newlines

Botch with only `&str` supported.

It seems like `*` is supposed to go _inside_ the indent now.

zbraniecki · 2022-04-22T18:03:44Z

@RumovZ thank you for your contribution and apologies for a late response.

I like your PR and I think it's generally landable, but I'd like to ask for one change - for round trip validation instead of writing unit tests with strings of FTL, could you leverage existing fixtures - https://github.com/projectfluent/fluent-rs/tree/master/fluent-syntax/tests/fixtures ?

The way I imagine it is that you'd add a ./fluent-syntax/tests/serializer_fixtures.rs and pull FTL files from the fixtures directory and round trip them to see if the output is the same.

If any of the FTL doesn't roundtrip properly, you could have a blacklist static array with names of files that don't roundtrip correctly yet to fix in follow ups.

For such files we'd like to make sure that the AST created from it at least doesn't crash when serialized.

Let me know how it sounds to you.

I'm also adding @eemeli as a second reviewer.

zbraniecki · 2022-04-22T18:06:22Z

In the unit tests, I'd like to instead see a very small and basic corpus of tests that are parse -> modify -> serialize operations like:

change ID
change simple string value
add PluralRules variant
change variable reference
remove message
add message at the end
add message in the middle
add message at the top
add a comment
remove a comment
edit a comment

Feel free to pick any of the above, and it's totally ok if you don't add all. We can extend it as we go. I'd like to land your PR soon.

RumovZ · 2022-04-24T08:02:53Z

Sounds good, but it's been a while, and I'll have to reimmerse myself in the matter. Nevertheless, I'll definitely try to implement this.

RumovZ · 2022-04-25T17:15:15Z

I had a look and the fixtures aren't suitable for parse-serialise roundtrip testing (i.e. comparing plaintext), as they are not normalised. In fact, only the two empty files pass the test.
A lot more pass the slightly weaker parse-serialise-parse roundtrip test (i.e. comparing ASTs), but the majority also fail this one. There may be actual issues with the serializer, but I think most fail due to inconsistent whitespace around junk, which might not be trivial for the parser to preserve.
Would you like to see such a parse-serialise-parse roundtrip test? Then I'd investigate those issues.

For such files we'd like to make sure that the AST created from it at least doesn't crash when serialized.

Not sure what you mean by that. Aren't both parsing and serialising infallible, if we allow for junk?

alerque · 2022-04-25T18:02:45Z

Would you like to see such a parse-serialise-parse roundtrip test?

I don't think a serializer would be any use without the ability to round-trip a parsed AST.

I can understand not being able to round-trip from the existing fixtures as many of them are designed to be problematic sources that can be parsed (partially or fully) into functional ASTs. But if you have an AST, you should be able to serialize it and then get an identical AST back by deserializing. And if you can dump the serialized format back to an FTL file, that should also parse back to the same AST. Otherwise something is broken.

RumovZ · 2022-04-25T18:36:55Z

Certainly, but serialize(parse(ftl)) == ftl implies parse(serialize(parse(ftl))) == parse(ftl), whereas the reversal is not true, so I'm asking whether testing the latter is sufficient from the maintainers' point of view.
I was under the impression that @zbraniecki was referring to the former, but maybe I had misunderstood.

gregtatum · 2022-10-27T15:42:57Z

I'm triaging the pull requests, and it appears that this one has feedback that is unaddressed and is stale. Feel free to re-open as needed.

Suggestion: If you want to continue this work, if it's possible to carve it out into smaller PRs, that would make it easier for maintainers to review and land. I'm not sure without digging into the PR if that's feasible here.

RumovZ · 2022-10-27T16:45:31Z

As is stated above, it is not possible to address @zbraniecki's feedback or it needs to be clarified. There is no point in splitting this PR if maintainers are not interested in a merge (nor would it make any sense from a technical point of view).

alerque · 2022-10-27T20:56:45Z

This PR is already a follow up to #184 and is an ongoing work for something that would be useful. It does not make sense to split it up into smaller PRs either. The design requirement needs to be settled on so it can be addressed, not closed. I think the diagnosis of "unaddressed feedback" is backwards here.

gregtatum · 2022-10-28T12:46:09Z

Ok, I can re-open, but I'm not sure I have the bandwidth myself right now to dive into it myself.

gregtatum

Ok, I've had time to take some time with this PR. The implementation looks really nice. I'm marking request changes for two things:

I have several docs requests that I think should be added before landing.
For Zibi's suggestion I can see why he would want to use existing fixtures. Could you please give a few examples of how the existing fixtures are failing to roundtrip? It would be good to know if there are actual bugs, or if it's just whitespace issues.

If it's just whitespace issues, I would like to at least land a couple of tests initially that pass the round tripping, then we can follow-up looking at incorporating more round-tripping fixtures. This would provide more confidence for correctness and long-term maintainability.

fluent-syntax/src/serializer.rs

`\r\n` was parsed as a separate TextElement of `\n`, whereas `\n` is parsed as part of the antecedent TextElement.

gregtatum · 2022-11-07T20:10:22Z

What Zibi says sounds reasonable. Mainly I would like to see tests that provide good coverage to ensure we don't have bugs. The advantage of having .ftl files is that they are easy to write generic tests against beyond this single case. I don't mind the inline tests like you have in the current changeset, since they are acting as unit tests next to where the code is written. However, I'd also be fine with migrating them all to .ftl files at your discretion.

Thanks for getting back to me on this. After two hiatuses of half a year, it would be cathartic to see it merged.

Now that I'm up to speed, I'll make sure and be available to help get this merged in if you are available to help contribute it! Thanks for sticking through the slow process.

I'm not sure if we're talking about the same thing #241 (comment). Please clarify if you're referring to comparing serializations or ASTs, like I've suggested here.

Mostly I was nervous about the comment above that the lack of round tripping may be bugs.

tests/fixtures/well-formed/*

Suggestion: Maybe normalized would be better than well-formed, since whitespace differences would still be well-formed, but maybe not normalized.

* Move old fixtures into resource files. * Test on unnormalized fixtures as well.

Also make Serializer private.

RumovZ · 2022-11-08T17:45:05Z

Is the documentation sufficient?

The roundtrip tests revealed a few minor errors with the serializer and parser. I fixed one parse error, but one remains and prevents the last fixture from roundtripping correctly:

key12 =
    { "." }
        four

The parser swallows the four spaces in front of "four", which is wrong, if the Python implemenation is to be believed. This behaviour is also inconsistent with the parsing of

key12 =
{ "." }
    four

(the actual text in the fixture), in which case the whitespace is preserved.

I find the parser code quite hard to get into, so I won't try to fix this one and this PR is finished from my side.

zbraniecki · 2022-11-08T17:58:52Z

Thanks for catching that! @stasm - do you remember how you intended for the dedentation to work here?

gregtatum

Thanks for making the changes! This is looking almost ready to merge.

I have a few minor pieces of feedback, and it looks like CI is failing on one of the tests.

Also, when you're ready for re-review please hit the little refresh icon next to my review name so it'll show up in my review queue. I believe from your message that you were ready for re-review.

This arrow:

fluent-syntax/src/serializer.rs

fluent-syntax/tests/serializer_fixtures.rs

fluent-syntax/src/parser/pattern.rs

fluent-syntax/src/serializer.rs

fluent-syntax/tests/fixtures/normalized/attributes.ftl

gregtatum · 2022-11-09T19:33:39Z

I find the parser code quite hard to get into, so I won't try to fix this one and this PR is finished from my side.

@RumovZ Would you mind filing this as a new issue?

fluent-syntax/src/lib.rs

gregtatum

Woo! Thanks for being patient and working through the review process. Let's merge this!

RumovZ · 2022-11-10T16:25:33Z

Amazing! Thank you for your feedback and support.
In the end, I didn't submit an issue for c009674. It's not a bug if you consider two TextElements equivalent to a single TextElement with the concatenation of their texts. At least, it's explicitly mentioned now.

alerque · 2022-11-10T18:16:52Z

Thanks for all the work on this guys!

Michael Bryan and others added 14 commits August 9, 2020 21:32

Start writing up the serializer

b860d59

Made sure indentation is handled properly

2eee000

Started handling select expressions

2c2f006

Switched " " to "\t" to make recognising indents easier

f8f3f72

Everything from TypeScript's "Serialize resource" test suite passes

a650135

Added the rest of the tests

f95efce

Added a test to make sure subsequent entries with a comment aren't se…

34f7a93

…parated by newlines

Merge branch '0.11.0' into serializer

0b32979

Fix serializer for fluent-syntax 0.11.0

f3378e2

Botch with only `&str` supported.

Fix empty lines in messages being swallowed

b627c05

Fix indentation of default asterisk

ef6d1a8

It seems like `*` is supposed to go _inside_ the indent now.

Fix lints

1801439

Merge branch 'master' into serializer

0b04aad

Make serialize() generic over Slice

df8c691

zbraniecki requested review from zbraniecki and eemeli April 22, 2022 18:00

zbraniecki mentioned this pull request Apr 22, 2022

Create a FTL serializer #184

Closed

gregtatum closed this Oct 27, 2022

gregtatum reopened this Oct 28, 2022

gregtatum requested changes Nov 7, 2022

View reviewed changes

RumovZ added 6 commits November 7, 2022 20:45

Merge remote-tracking branch 'upstream/master' into serializer

3ff29d5

Handle rare edge case of line terminating \r

1b340e7

Fix redundant line break after after junk

099995a

Align parsing of CRLF-terminated patterns

c009674

`\r\n` was parsed as a separate TextElement of `\n`, whereas `\n` is parsed as part of the antecedent TextElement.

Don't break line before leading dot pattern

c469cf1

Don't implicitly trim when writing literals

6fbf7f9

RumovZ added 6 commits November 8, 2022 16:41

Replace roundtrip tests with manipulation tests

24e6d49

Reintroduce roundtrip tests

99cd884

* Move old fixtures into resource files. * Test on unnormalized fixtures as well.

Fix clippy lints

40e1a42

Document serializer mod and pub functions

5da5604

Also make Serializer private.

Replace unwrap with expect

a1ae7dd

Remove redundant error propogation

2e9c63f

gregtatum requested changes Nov 9, 2022

View reviewed changes

gregtatum reviewed Nov 9, 2022

View reviewed changes

fluent-syntax/src/lib.rs Show resolved Hide resolved

RumovZ added 4 commits November 9, 2022 21:43

BLACKLIST -> IGNORE_LIST

4c23b54

Fix typo

6fa7576

Mention serializer on changelog

c0e1722

Revert c009674 and add crlf.ftl to ignore list

c534eb7

RumovZ requested a review from gregtatum November 9, 2022 22:42

gregtatum approved these changes Nov 10, 2022

View reviewed changes

gregtatum merged commit 00f1499 into projectfluent:master Nov 10, 2022

RumovZ deleted the serializer branch November 10, 2022 16:25

This was referenced May 1, 2024

FTL Serializer #182

Closed

[Feature request] Allow serialize all AST types instead of just Resource #343

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Create generic ftl serializer #241

Create generic ftl serializer #241

RumovZ commented Nov 11, 2021

zbraniecki commented Apr 22, 2022

zbraniecki commented Apr 22, 2022

RumovZ commented Apr 24, 2022

RumovZ commented Apr 25, 2022

alerque commented Apr 25, 2022

RumovZ commented Apr 25, 2022

gregtatum commented Oct 27, 2022

RumovZ commented Oct 27, 2022

alerque commented Oct 27, 2022

gregtatum commented Oct 28, 2022

gregtatum left a comment

gregtatum commented Nov 7, 2022

RumovZ commented Nov 8, 2022

zbraniecki commented Nov 8, 2022

gregtatum left a comment

gregtatum commented Nov 9, 2022

gregtatum left a comment

RumovZ commented Nov 10, 2022

alerque commented Nov 10, 2022

Create generic ftl serializer #241

Create generic ftl serializer #241

Conversation

RumovZ commented Nov 11, 2021

zbraniecki commented Apr 22, 2022

zbraniecki commented Apr 22, 2022

RumovZ commented Apr 24, 2022

RumovZ commented Apr 25, 2022

alerque commented Apr 25, 2022

RumovZ commented Apr 25, 2022

gregtatum commented Oct 27, 2022

RumovZ commented Oct 27, 2022

alerque commented Oct 27, 2022

gregtatum commented Oct 28, 2022

gregtatum left a comment

Choose a reason for hiding this comment

gregtatum commented Nov 7, 2022

RumovZ commented Nov 8, 2022

zbraniecki commented Nov 8, 2022

gregtatum left a comment

Choose a reason for hiding this comment

gregtatum commented Nov 9, 2022

gregtatum left a comment

Choose a reason for hiding this comment

RumovZ commented Nov 10, 2022

alerque commented Nov 10, 2022