Drop the Literal quoted flag from the data model #443

eemeli · 2023-07-29T09:42:19Z

At the moment, the data model includes a boolean quoted property on the Literal construct. This should be dropped, as it's not meant to effect anything during the runtime.

As the data model is extensible by implementations, this does allow for an implementation to add the field back in as a private extension, should it have a need to track this information.

aphillips

Good start. Some wording tweaks suggested.

spec/data-model.md

aphillips · 2023-07-29T15:51:36Z

spec/data-model.md

+Both _quoted_ and _unquoted_ values are represented by `Literal`,
+as the use or lack of quotation is a presentation detail
+which has no effect on the meaning of the _literal_.
 The `value` of `Literal` is the "cooked" value (i.e. escape sequences are processed).


I would avoid the jargon-ish "cooked" here and I would note the non-inclusion of the quotes (where they exist and as you did for reserved's sigils elsewhere)

Suggested change

The `value` of `Literal` is the "cooked" value (i.e. escape sequences are processed).

The `value` of `Literal` does not include surrounding quotes (where present)

and replaces `quoted-escape` sequences with the unescaped character.

I'd like for this to be considered in a separate issue or PR, for two reasons:

The data model uses the "raw" and "cooked" terms also with respect to Text and Reserved, which ought to be updated simultaneously. I would rather keep that outside the scope of this rather focused PR.

At the moment, the data model is described and explained via equivalences with the MF2 syntax. If there is a desire to describe it as an explicit result of parsing the syntax, as suggested by the term "replaces" here, that's a much bigger change that ought to be accompanied by some additional documentation.

If "raw" and "cooked" are terms, define them has terms (in a terminology section that needs to be added) and link them.

A different way of saying this would be to go more Unicode-like:

Suggested change

The `value` of `Literal` is the "cooked" value (i.e. escape sequences are processed).

The `value` of `Literal` is the code point sequence contained by the _literal_,

with external syntax (such as quotes) removed

and escape sequences resolved to the characters that they represent.

This would apply to any representation, not just MF2. For example, a JS string would replace \u20ac notation with € in the Literal.

If there is a desire to describe it as an explicit result of parsing the syntax, as suggested by the term "replaces" here, that's a much bigger change that ought to be accompanied by some additional documentation.

I think we should stipulate that the data model representation can round-trip any MF2 string without loss of information, although doing so would canonicalize the representation, such as syntax (non-literal) whitespace and the presence or absence of quotes around literals such that the resulting round-trip string might not be a character-by-character match to the original.

Co-authored-by: Addison Phillips <addisonI18N@gmail.com>

ryzokuken

In the absence of this flag in the data model, do we still need to make this explicit distinction between "quoted" and "unquoted" values ?

Literal represents all literal values, both quoted and unquoted.
The presence or absence of quotes is not preserved by the data model.

If this is implied by the absence of the flag, then perhaps it's more confusing to leave it in as opposed to just dropping these lines.

eemeli · 2023-08-04T16:15:29Z

In the absence of this flag in the data model, do we still need to make this explicit distinction between "quoted" and "unquoted" values ?

I think it's good to include, to clarify that the mapping of these potentially separately representable syntax rules into a single data model interface is wholly intentional.

…t-wg#443

Drop the Literal quoted flag from the data model

e5d9374

eemeli added the data model Issues related to the Interchange Data Model label Jul 29, 2023

aphillips requested changes Jul 29, 2023

View reviewed changes

Update spec/data-model.md

83ba735

Co-authored-by: Addison Phillips <addisonI18N@gmail.com>

eemeli requested a review from aphillips August 1, 2023 15:13

ryzokuken approved these changes Aug 4, 2023

View reviewed changes

aphillips merged commit 6ae5373 into unicode-org:main Aug 7, 2023

eemeli deleted the no-quoted branch August 7, 2023 20:00

eemeli added a commit to messageformat/messageformat that referenced this pull request Aug 13, 2023

feat(mf2): Drop quoted from Literal, as per unicode-org/message-forma…

ffd6b0a

…t-wg#443

eemeli mentioned this pull request Feb 19, 2024

Drop "quoted" remnants from JSON Schema & DTD data models #672

Merged

XM5jDcsHTyGJtQqlCi added a commit to XM5jDcsHTyGJtQqlCi/messageformat that referenced this pull request Oct 12, 2025

feat(mf2): Drop quoted from Literal, as per unicode-org/message-forma…

2912037

…t-wg#443

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Drop the Literal quoted flag from the data model #443

Drop the Literal quoted flag from the data model #443

Uh oh!

eemeli commented Jul 29, 2023

Uh oh!

aphillips left a comment

Uh oh!

Uh oh!

aphillips Jul 29, 2023

Uh oh!

eemeli Jul 30, 2023

Uh oh!

aphillips Jul 30, 2023

Uh oh!

ryzokuken left a comment

Uh oh!

eemeli commented Aug 4, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

	The `value` of `Literal` is the "cooked" value (i.e. escape sequences are processed).
	The `value` of `Literal` does not include surrounding quotes (where present)
	and replaces `quoted-escape` sequences with the unescaped character.

-The `value` of `Literal` is the "cooked" value (i.e. escape sequences are processed).
+The `value` of `Literal` is the code point sequence contained by the _literal_,
+with external syntax (such as quotes) removed
+and escape sequences resolved to the characters that they represent.

Uh oh!

Drop the Literal quoted flag from the data model #443

Drop the Literal quoted flag from the data model #443

Uh oh!

Conversation

eemeli commented Jul 29, 2023

Uh oh!

aphillips left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

aphillips Jul 29, 2023

Choose a reason for hiding this comment

Uh oh!

eemeli Jul 30, 2023

Choose a reason for hiding this comment

Uh oh!

aphillips Jul 30, 2023

Choose a reason for hiding this comment

Uh oh!

ryzokuken left a comment

Choose a reason for hiding this comment

Uh oh!

eemeli commented Aug 4, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants