Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Illegal whitespace created during TEI export #74

Closed
dvglc opened this issue Oct 19, 2019 · 3 comments
Closed

Illegal whitespace created during TEI export #74

dvglc opened this issue Oct 19, 2019 · 3 comments
Assignees
Labels
bug Something isn't working
Milestone

Comments

@dvglc
Copy link
Contributor

dvglc commented Oct 19, 2019

Observed in W0034:

`

                <tei:g ref="#char017f">ſ</tei:g>onas excelentes <tei:choice xml:id="W0034-00-0003-ce-041a">
                    <tei:abbr>
                        <tei:g ref="#char204a">⁊</tei:g>
                    </tei:abbr>
                    <tei:expan resp="#CB" cert="high">y</tei:expan>
                </tei:choice> yo lo he comen<tei:g ref="#charc0327">ç</tei:g>ado a collegir delo

`

Apparently eXist tries to pretty-print something here, but doing so in mixed-content elements such as tei:abbr is clearly illegal since it adds text nodes where there were none before. An XML-based solution might be @xml:space=preserve.

@dvglc dvglc added the bug Something isn't working label Oct 19, 2019
@dvglc dvglc added this to the v1.6 milestone Oct 19, 2019
@dvglc dvglc self-assigned this Oct 19, 2019
@awagner-mainz
Copy link
Member

More often than not, I've seen this in oXygen rather than in eXist-db. Maybe worth checking out that all formatting and pretty-printing there is disabled, too. I've tried to provide for that as far as I could via the svsal-framework, but you never know what the configuration is in individual installations and whether it's overridden by a local option.

@dvglc
Copy link
Contributor Author

dvglc commented Oct 19, 2019

oXygen might also be problematic with regards to pretty-printing, but this one occurred while requesting TEI from our application: http://id.salamanca.school/texts/W0034?format=tei. The data for W0034 in the svsal-tei package/app is okay, so that something must happen during exporting the data. (An other indicator that eXist is doing additional stuff is that some elements suddenly have tei: prefixes after the export, which isn't really an issue, but somewhat strange nonetheless...)

@dvglc
Copy link
Contributor Author

dvglc commented Nov 20, 2019

27bbed7 is not an elegant solution since it globally suppresses indentation in the document, but serialization parameters in util:expand() have proven not to work as described in the eXist docs (at least not with our way of using util:declare-option() at the same time).

@dvglc dvglc closed this as completed in 27bbed7 Jan 30, 2020
@awagner-mainz awagner-mainz modified the milestones: v1.6, v2.0 Mar 9, 2020
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

2 participants