Does ixml have to match the whole input? #24

cmsmcq · 2022-01-11T00:24:03Z

Is an ixml processor obliged to match the whole input against the specified grammar and flag an error otherwise? Can a conforming processor match a prefix of the input and return success? See

and ensuing threads.

cmsmcq · 2022-02-07T16:38:09Z

It may be helpful, in discussions of this issue, to bear in mind what the spec currently says about it. The last item in the spec's definition of processor conformance reads:

The processor must by default parse the input in its entirety against the grammar and return either a parse tree or a failure document. Processors may provide user options for other behaviors, such as parsing the largest, or smallest, prefix of the input that is described by the grammar, or supporting invocation with input streams of indeterminate length.

That suggests to this reader that the question Steven asked when raising this issue already has an answer. He asked:

If a parse succeeds without using all the available input, should that be reported as a parse error, or as an ixml:state="incomplete" (or something similar).

According to the current spec, I think the answer is:

The processor may provide a user option for handling cases of success over part of the input as a parse with ixml:state="incomplete" or something similar.
The processor need not do so.
Unless the processor has provided and the user activated such a user option, the parser should report a parsing failure.

Given that as the status quo, I suppose Steven's question (and thus this issue) should be read as a suggestion that we change some aspect of these rules, but without a specific proposal for what to change. None of the changes I have been able to imagine seem to me an improvement on the status quo, but I may be persuadable.

spemberton · 2022-02-08T14:28:57Z

My question was indeed a suggestion.

My proposal is to add a distinguishing "fail" state, for instance "incomplete" or "prefix" that indicates that while the input was not fully consumed, there was a prefix of the input that satisfied the grammar (and here is the tree for that input).

So "fail" gets split into two:
fail: no parse
incomplete: a prefix of the input succeeded.

cmsmcq · 2022-02-14T23:36:23Z

This issue was discussed on the call of 8 February. The upshot was that it should continue to be legal for processors to offer a user option to report matches against a prefix of the input string, and that it should continue not to be required. The marking ixml:status="prefix" (or possibly some other keyword value, tbd) should be defined for use in this situation.

This involved the proposed text and discussion at https://www.w3.org/2022/02/15-ixml-minutes.html#t04 It also adds proposed text for @ handling on the root node.

spemberton · 2022-02-22T14:06:09Z

New adopted text added to the spec, and issue closed.

spemberton added a commit that referenced this issue Feb 22, 2022

ACTION: integrate resolution #24 into the spec

ab7ccd0

This involved the proposed text and discussion at https://www.w3.org/2022/02/15-ixml-minutes.html#t04 It also adds proposed text for @ handling on the root node.

spemberton closed this as completed Feb 22, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Does ixml have to match the whole input? #24

Does ixml have to match the whole input? #24

cmsmcq commented Jan 11, 2022 •

edited

cmsmcq commented Feb 7, 2022

spemberton commented Feb 8, 2022

cmsmcq commented Feb 14, 2022

spemberton commented Feb 22, 2022

Does ixml have to match the whole input? #24

Does ixml have to match the whole input? #24

Comments

cmsmcq commented Jan 11, 2022 • edited

cmsmcq commented Feb 7, 2022

spemberton commented Feb 8, 2022

cmsmcq commented Feb 14, 2022

spemberton commented Feb 22, 2022

cmsmcq commented Jan 11, 2022 •

edited