Normalization of XML #5

note · 2017-06-22T11:13:01Z

At some point we will want to have reasonable output. Outside of pure formatting aspect it would be nice to e.g. try to avoid multiple namespace declarations for the same namespaces. Probably all namespace declarations should be moved to root element.

Such operations should be optional - there may be some cases when user want to avoid unneccessary transformations as want to have output as much similar to input as it's possible.

There's an example of such behavior (namely - many namespace declarations for one namespace) in test replaceOrAddAttr for ResolvedNameMatcher in OpticsBuilderSpec

The text was updated successfully, but these errors were encountered:

note · 2017-12-20T08:18:01Z

After some thought - I think normalization will be actually more useful for being sure that declarations of used namespaces in fact exist in XML document. Also, as normalization may be a costly operation so it may a good idea to do it when parsing.

Idea sketch:

Add def parseNormalized: Either[FailType, (XmlDocument, Set[Namespace]). Besides of returning set of defined namespaces (which is reflected on return type) it would also move all namespace declarations to root element. Then, if user is not interested in adding new namespace and just working on already defined he can use Set[Namespace] returned by mentioned method for e.g. creating new elements.

We can have a symmetrical print method e.g. def printNormalized(doc: XmlDocument, namespaces: Set[Namespace]) which would print all namespaces in root element.

The problem with that idea is that it still relies on assumption that user uses just namespaces returned by parseNormalized. The API itself would not restrict him to e.g. add some element with completely different (and potentially not declared) namespace.

Another idea would be to use path dependant types to restrict user to use just declared namespaces. The disadvantage may be that AST itself would probably need to carry that info on typelevel. Reasonable solution would be to allow for arbitrary namespaces usages on AST and optics level and do the whole normalization thing (more precisely - restricting usages of namespaces just to declared ones) on DSL level.

note mentioned this issue Jun 22, 2017

Reasonable equal implementation for Node #7

Open

note added the release label Dec 20, 2017

note removed the release label Dec 20, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Normalization of XML #5

Normalization of XML #5

note commented Jun 22, 2017

note commented Dec 20, 2017 •

edited

Normalization of XML #5

Normalization of XML #5

Comments

note commented Jun 22, 2017

note commented Dec 20, 2017 • edited

note commented Dec 20, 2017 •

edited