Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

.docx to TEI P5 XML Document conversion fails #405

Open
fricke-steyer opened this issue Nov 19, 2019 · 3 comments
Open

.docx to TEI P5 XML Document conversion fails #405

fricke-steyer opened this issue Nov 19, 2019 · 3 comments
Assignees
Labels
conversion: docx resp: helpWanted No-one on Council has the expertise required to deal with this issue. type: bug A bug report.

Comments

@fricke-steyer
Copy link

Can you help me? Our other files are ok, only this one doesn't work. Whats wrong?
Kind regards,
Henrike

emotion_analysis_2019.docx

Error occured. Please check the filetype and try again.?

Error: class pl.psnc.dl.ege.exception.ConverterException

Processing terminated by xsl:message at line 130 in fields.xsl

@peterstadler
Copy link
Member

I did a little debugging and the error I get (from running on the command line) is

 fldSimple: unrecognized type REF BMfig_wheel \* MERGEFORMAT 

This originates from the word file here:

<w:fldSimple w:instr="REF BMfig_wheel \* MERGEFORMAT ">
    <w:r w:rsidRPr="005B4B5A">
        <w:rPr>
            <w:rStyle w:val="AbbVerweiszfdgZchn"/>
        </w:rPr>
        <w:t>1</w:t>
    </w:r>
</w:fldSimple>

-- which is the "1" reference in "The wheel (Figure 1) is constructed …"

I'm no docx expert, so I do not know which (arcane) feature this is and how to treat it right. Hence, I'd like to close it here and move it to the Stylesheets issues if anyone thinks we should follow up on this?!

@peterstadler peterstadler transferred this issue from TEIC/oxgarage Dec 11, 2019
@peterstadler peterstadler added the type: bug A bug report. label Dec 11, 2019
@lb42
Copy link
Member

lb42 commented Dec 11, 2019 via email

@peterstadler peterstadler self-assigned this Apr 22, 2020
@sydb sydb added conversion: docx resp: helpWanted No-one on Council has the expertise required to deal with this issue. labels May 26, 2020
@TomazErjavec
Copy link
Contributor

Rather than opening a new issue, I post here another Word file that causes the Stylesheets to fail. At first glance it looks easier to fix than the previous one, the error is:

A sequence of more than one item is not allowed as the first argument of fn:starts-with() ("VAROVALKE_1_brez ozadja copy", "VAROVALKE_2_brez ozadja copy") ; SystemID: file:/project/tei/convert/Stylesheets/docx/from/graphics.xsl; Line#: 83; Column#: 12

TEI_Stylesheet_crash-test.docx

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
conversion: docx resp: helpWanted No-one on Council has the expertise required to deal with this issue. type: bug A bug report.
Projects
None yet
Development

No branches or pull requests

5 participants