Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[tei2folia] Convert w@norm, w@join, fix list handling and various conversion problems #12

Closed
proycon opened this issue Aug 17, 2020 · 1 comment
Assignees
Labels
bug Something isn't working enhancement New feature or request ready
Milestone

Comments

@proycon
Copy link
Owner

proycon commented Aug 17, 2020

Some issues arose trying to convert a TEI document to FoLiA: http://www.deutschestextarchiv.de/book/show/wolff_anfangsgruende01_1710

There a lot of elements that can not be converted:

  • [tei2folia WARNING] Unhandled tag in structure context: s (in p) (I wonder what triggers this because this seems rather basi
  • [tei2folia WARNING] Unhandled tag in structure context: fw (in div) (we don't handle fw yet)
  • [tei2folia WARNING] Unknown tag in structure context: s (in item) (list processing seems to go wrong)

It seems this document diverges from the TEI collections we were used to hitherto.

In addition, there are interesting extra attributes in this document that should be converted, such as norm and join (on w)

@proycon proycon self-assigned this Aug 17, 2020
@proycon proycon changed the title [folia2tei] Convert w@norm, w@join and fix list handling [folia2tei] Convert w@norm, w@join, fix list handling and various conversion problems Aug 17, 2020
@proycon proycon changed the title [folia2tei] Convert w@norm, w@join, fix list handling and various conversion problems [tei2folia] Convert w@norm, w@join, fix list handling and various conversion problems Aug 17, 2020
@proycon
Copy link
Owner Author

proycon commented Aug 17, 2020

This TEI dialect is documented here: http://www.deutschestextarchiv.de/doku/basisformat/

proycon added a commit that referenced this issue Aug 17, 2020
… ISBN, then DTADirName (specific to DTA), fall back to untyped but check we get something sane out of it. #12
proycon added a commit that referenced this issue Aug 17, 2020
…ly according to TEI P5 spec but Deutsches Text Archiv uses it. #12
@proycon proycon added bug Something isn't working enhancement New feature or request labels Aug 17, 2020
proycon added a commit that referenced this issue Aug 17, 2020
proycon added a commit that referenced this issue Aug 17, 2020
…, for now.. hard to convert with XSLT.. postprocessor may be able to reapply it #12
@proycon proycon added this to the v2.3.0 milestone Aug 19, 2020
@proycon proycon added the ready label Aug 19, 2020
@proycon proycon closed this as completed Sep 10, 2020
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working enhancement New feature or request ready
Projects
None yet
Development

No branches or pull requests

1 participant