Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Text from Shape Format is not extracted #9214

Closed
StephanMeijer opened this issue Nov 24, 2023 · 6 comments
Closed

Text from Shape Format is not extracted #9214

StephanMeijer opened this issue Nov 24, 2023 · 6 comments
Labels

Comments

@StephanMeijer
Copy link
Contributor

Explain the problem.

Text in Shape Format is not extracted

Example:

Screenshot Example of Document with text in a Shape Format
document.xml
<?xml version="1.0" encoding="UTF-8" standalone="yes"?>
<w:document xmlns:wpc="http://schemas.microsoft.com/office/word/2010/wordprocessingCanvas" xmlns:cx="http://schemas.microsoft.com/office/drawing/2014/chartex" xmlns:cx1="http://schemas.microsoft.com/office/drawing/2015/9/8/chartex" xmlns:cx2="http://schemas.microsoft.com/office/drawing/2015/10/21/chartex" xmlns:cx3="http://schemas.microsoft.com/office/drawing/2016/5/9/chartex" xmlns:cx4="http://schemas.microsoft.com/office/drawing/2016/5/10/chartex" xmlns:cx5="http://schemas.microsoft.com/office/drawing/2016/5/11/chartex" xmlns:cx6="http://schemas.microsoft.com/office/drawing/2016/5/12/chartex" xmlns:cx7="http://schemas.microsoft.com/office/drawing/2016/5/13/chartex" xmlns:cx8="http://schemas.microsoft.com/office/drawing/2016/5/14/chartex" xmlns:mc="http://schemas.openxmlformats.org/markup-compatibility/2006" xmlns:aink="http://schemas.microsoft.com/office/drawing/2016/ink" xmlns:am3d="http://schemas.microsoft.com/office/drawing/2017/model3d" xmlns:o="urn:schemas-microsoft-com:office:office" xmlns:oel="http://schemas.microsoft.com/office/2019/extlst" xmlns:r="http://schemas.openxmlformats.org/officeDocument/2006/relationships" xmlns:m="http://schemas.openxmlformats.org/officeDocument/2006/math" xmlns:v="urn:schemas-microsoft-com:vml" xmlns:wp14="http://schemas.microsoft.com/office/word/2010/wordprocessingDrawing" xmlns:wp="http://schemas.openxmlformats.org/drawingml/2006/wordprocessingDrawing" xmlns:w10="urn:schemas-microsoft-com:office:word" xmlns:w="http://schemas.openxmlformats.org/wordprocessingml/2006/main" xmlns:w14="http://schemas.microsoft.com/office/word/2010/wordml" xmlns:w15="http://schemas.microsoft.com/office/word/2012/wordml" xmlns:w16cex="http://schemas.microsoft.com/office/word/2018/wordml/cex" xmlns:w16cid="http://schemas.microsoft.com/office/word/2016/wordml/cid" xmlns:w16="http://schemas.microsoft.com/office/word/2018/wordml" xmlns:w16du="http://schemas.microsoft.com/office/word/2023/wordml/word16du" xmlns:w16sdtdh="http://schemas.microsoft.com/office/word/2020/wordml/sdtdatahash" xmlns:w16se="http://schemas.microsoft.com/office/word/2015/wordml/symex" xmlns:wpg="http://schemas.microsoft.com/office/word/2010/wordprocessingGroup" xmlns:wpi="http://schemas.microsoft.com/office/word/2010/wordprocessingInk" xmlns:wne="http://schemas.microsoft.com/office/word/2006/wordml" xmlns:wps="http://schemas.microsoft.com/office/word/2010/wordprocessingShape" mc:Ignorable="w14 w15 w16se w16cid w16 w16cex w16sdtdh wp14">
  <w:body>
    <w:p w:rsidR="000532D5" w:rsidRDefault="00BC3ACB" w:rsidP="000A1B92">
      <w:pPr>
        <w:jc w:val="left"/>
      </w:pPr>
      <w:r>
        <w:rPr>
          <w:noProof/>
          <w:lang w:val="et-EE" w:eastAsia="et-EE"/>
        </w:rPr>
        <mc:AlternateContent>
          <mc:Choice Requires="wps">
            <w:drawing>
              <wp:anchor distT="0" distB="0" distL="114300" distR="114300" simplePos="0" relativeHeight="251657728" behindDoc="0" locked="0" layoutInCell="1" allowOverlap="0" wp14:anchorId="5AB6B5C0" wp14:editId="2E675B58">
                <wp:simplePos x="0" y="0"/>
                <wp:positionH relativeFrom="column">
                  <wp:align>center</wp:align>
                </wp:positionH>
                <wp:positionV relativeFrom="margin">
                  <wp:align>bottom</wp:align>
                </wp:positionV>
                <wp:extent cx="5759450" cy="385200"/>
                <wp:effectExtent l="0" t="0" r="0" b="0"/>
                <wp:wrapNone/>
                <wp:docPr id="41" name="Text Box 69"/>
                <wp:cNvGraphicFramePr>
                  <a:graphicFrameLocks xmlns:a="http://schemas.openxmlformats.org/drawingml/2006/main"/>
                </wp:cNvGraphicFramePr>
                <a:graphic xmlns:a="http://schemas.openxmlformats.org/drawingml/2006/main">
                  <a:graphicData uri="http://schemas.microsoft.com/office/word/2010/wordprocessingShape">
                    <wps:wsp>
                      <wps:cNvSpPr txBox="1">
                        <a:spLocks noChangeArrowheads="1"/>
                      </wps:cNvSpPr>
                      <wps:spPr bwMode="auto">
                        <a:xfrm>
                          <a:off x="0" y="0"/>
                          <a:ext cx="5759450" cy="385200"/>
                        </a:xfrm>
                        <a:prstGeom prst="rect">
                          <a:avLst/>
                        </a:prstGeom>
                        <a:noFill/>
                        <a:ln>
                          <a:noFill/>
                        </a:ln>
                      </wps:spPr>
                      <wps:txbx>
                        <w:txbxContent>
                          <w:p w:rsidR="005D45CD" w:rsidRPr="005E55F3" w:rsidRDefault="005D45CD" w:rsidP="007B7208">
                            <w:pPr>
                              <w:jc w:val="center"/>
                              <w:rPr>
                                <w:lang w:val="et-EE"/>
                              </w:rPr>
                            </w:pPr>
                            <w:r w:rsidRPr="005E55F3">
                              <w:t xml:space="preserve">Last update: </w:t>
                            </w:r>
                            <w:r>
                              <w:fldChar w:fldCharType="begin"/>
                            </w:r>
                            <w:r>
                              <w:instrText xml:space="preserve"> SAVEDATE  \@ "MMMM d, yyyy"  \* MERGEFORMAT </w:instrText>
                            </w:r>
                            <w:r>
                              <w:fldChar w:fldCharType="separate"/>
                            </w:r>
                            <w:r w:rsidR="00310322">
                              <w:rPr>
                                <w:noProof/>
                              </w:rPr>
                              <w:t>May 1, 2017</w:t>
                            </w:r>
                            <w:r>
                              <w:fldChar w:fldCharType="end"/>
                            </w:r>
                          </w:p>
                        </w:txbxContent>
                      </wps:txbx>
                      <wps:bodyPr rot="0" vert="horz" wrap="square" lIns="36000" tIns="36000" rIns="36000" bIns="36000" anchor="t" anchorCtr="0" upright="1">
                        <a:noAutofit/>
                      </wps:bodyPr>
                    </wps:wsp>
                  </a:graphicData>
                </a:graphic>
                <wp14:sizeRelH relativeFrom="margin">
                  <wp14:pctWidth>100000</wp14:pctWidth>
                </wp14:sizeRelH>
                <wp14:sizeRelV relativeFrom="page">
                  <wp14:pctHeight>0</wp14:pctHeight>
                </wp14:sizeRelV>
              </wp:anchor>
            </w:drawing>
          </mc:Choice>
          <mc:Fallback>
            <w:pict>
              <v:shapetype w14:anchorId="5AB6B5C0" id="_x0000_t202" coordsize="21600,21600" o:spt="202" path="m,l,21600r21600,l21600,xe">
                <v:stroke joinstyle="miter"/>
                <v:path gradientshapeok="t" o:connecttype="rect"/>
              </v:shapetype>
              <v:shape id="Text Box 69" o:spid="_x0000_s1026" type="#_x0000_t202" style="position:absolute;margin-left:0;margin-top:0;width:453.5pt;height:30.35pt;z-index:251657728;visibility:visible;mso-wrap-style:square;mso-width-percent:1000;mso-height-percent:0;mso-wrap-distance-left:9pt;mso-wrap-distance-top:0;mso-wrap-distance-right:9pt;mso-wrap-distance-bottom:0;mso-position-horizontal:center;mso-position-horizontal-relative:text;mso-position-vertical:bottom;mso-position-vertical-relative:margin;mso-width-percent:1000;mso-height-percent:0;mso-width-relative:margin;mso-height-relative:page;v-text-anchor:top" o:gfxdata="UEsDBBQABgAIAAAAIQC2gziS/gAAAOEBAAATAAAAW0NvbnRlbnRfVHlwZXNdLnhtbJSRQU7DMBBF&#13;&#10;90jcwfIWJU67QAgl6YK0S0CoHGBkTxKLZGx5TGhvj5O2G0SRWNoz/78nu9wcxkFMGNg6quQqL6RA&#13;&#10;0s5Y6ir5vt9lD1JwBDIwOMJKHpHlpr69KfdHjyxSmriSfYz+USnWPY7AufNIadK6MEJMx9ApD/oD&#13;&#10;OlTrorhX2lFEilmcO2RdNtjC5xDF9pCuTyYBB5bi6bQ4syoJ3g9WQ0ymaiLzg5KdCXlKLjvcW893&#13;&#10;SUOqXwnz5DrgnHtJTxOsQfEKIT7DmDSUCaxw7Rqn8787ZsmRM9e2VmPeBN4uqYvTtW7jvijg9N/y&#13;&#10;JsXecLq0q+WD6m8AAAD//wMAUEsDBBQABgAIAAAAIQA4/SH/1gAAAJQBAAALAAAAX3JlbHMvLnJl&#13;&#10;bHOkkMFqwzAMhu+DvYPRfXGawxijTi+j0GvpHsDYimMaW0Yy2fr2M4PBMnrbUb/Q94l/f/hMi1qR&#13;&#10;JVI2sOt6UJgd+ZiDgffL8ekFlFSbvV0oo4EbChzGx4f9GRdb25HMsYhqlCwG5lrLq9biZkxWOiqY&#13;&#10;22YiTra2kYMu1l1tQD30/bPm3wwYN0x18gb45AdQl1tp5j/sFB2T0FQ7R0nTNEV3j6o9feQzro1i&#13;&#10;OWA14Fm+Q8a1a8+Bvu/d/dMb2JY5uiPbhG/ktn4cqGU/er3pcvwCAAD//wMAUEsDBBQABgAIAAAA&#13;&#10;IQBpbiVD2wEAAKEDAAAOAAAAZHJzL2Uyb0RvYy54bWysU9tu2zAMfR+wfxD0vthpl64z4hRdiw4D&#13;&#10;ugvQ7gNoWbKN2aJGKbGzrx8lp2m2vg17EURSPjznkF5fTUMvdpp8h7aUy0UuhbYK6842pfz+ePfm&#13;&#10;UgofwNbQo9Wl3GsvrzavX61HV+gzbLGvNQkGsb4YXSnbEFyRZV61egC/QKctFw3SAIFDarKaYGT0&#13;&#10;oc/O8vwiG5FqR6i095y9nYtyk/CN0Sp8NcbrIPpSMreQTkpnFc9ss4aiIXBtpw404B9YDNBZbnqE&#13;&#10;uoUAYkvdC6ihU4QeTVgoHDI0plM6aWA1y/wvNQ8tOJ20sDneHW3y/w9Wfdk9uG8kwvQBJx5gEuHd&#13;&#10;PaofXli8acE2+poIx1ZDzY2X0bJsdL44fBqt9oWPINX4GWseMmwDJqDJ0BBdYZ2C0XkA+6PpegpC&#13;&#10;cXL1bvX+7YpLimvnlyueamoBxdPXjnz4qHEQ8VJK4qEmdNjd+xDZQPH0JDazeNf1fRpsb/9I8MOY&#13;&#10;Sewj4Zl6mKqJX0cVFdZ71kE47wnvNV9apF9SjLwjpfQ/t0Baiv6TZS/OL3ImK8JpQKdBdRqAVQxV&#13;&#10;yiDFfL0J8yJuHXVNy51m9y1es3+mS9KeWR148x4kxYedjYt2GqdXz3/W5jcAAAD//wMAUEsDBBQA&#13;&#10;BgAIAAAAIQAImBBo3gAAAAkBAAAPAAAAZHJzL2Rvd25yZXYueG1sTI9BS8NAEIXvgv9hGcGb3VUh&#13;&#10;rWk2RYziRVBrofa2zY5JMDubZLdN/PeOXvTy4PGYN+/LVpNrxRGH0HjScDlTIJBKbxuqNGzeHi4W&#13;&#10;IEI0ZE3rCTV8YYBVfnqSmdT6kV7xuI6V4BIKqdFQx9ilUoayRmfCzHdInH34wZnIdqikHczI5a6V&#13;&#10;V0ol0pmG+ENtOryrsfxcH5yG4pHU+/PY74rd08s9Xi+Sflv2Wp+fTcWS5XYJIuIU/y7gh4H3Q87D&#13;&#10;9v5ANohWA9PEX+XsRs3Z7jUkag4yz+R/gvwbAAD//wMAUEsBAi0AFAAGAAgAAAAhALaDOJL+AAAA&#13;&#10;4QEAABMAAAAAAAAAAAAAAAAAAAAAAFtDb250ZW50X1R5cGVzXS54bWxQSwECLQAUAAYACAAAACEA&#13;&#10;OP0h/9YAAACUAQAACwAAAAAAAAAAAAAAAAAvAQAAX3JlbHMvLnJlbHNQSwECLQAUAAYACAAAACEA&#13;&#10;aW4lQ9sBAAChAwAADgAAAAAAAAAAAAAAAAAuAgAAZHJzL2Uyb0RvYy54bWxQSwECLQAUAAYACAAA&#13;&#10;ACEACJgQaN4AAAAJAQAADwAAAAAAAAAAAAAAAAA1BAAAZHJzL2Rvd25yZXYueG1sUEsFBgAAAAAE&#13;&#10;AAQA8wAAAEAFAAAAAA==&#13;&#10;" o:allowoverlap="f" filled="f" stroked="f">
                <v:textbox inset="1mm,1mm,1mm,1mm">
                  <w:txbxContent>
                    <w:p w:rsidR="005D45CD" w:rsidRPr="005E55F3" w:rsidRDefault="005D45CD" w:rsidP="007B7208">
                      <w:pPr>
                        <w:jc w:val="center"/>
                        <w:rPr>
                          <w:lang w:val="et-EE"/>
                        </w:rPr>
                      </w:pPr>
                      <w:r w:rsidRPr="005E55F3">
                        <w:t xml:space="preserve">Last update: </w:t>
                      </w:r>
                      <w:r>
                        <w:fldChar w:fldCharType="begin"/>
                      </w:r>
                      <w:r>
                        <w:instrText xml:space="preserve"> SAVEDATE  \@ "MMMM d, yyyy"  \* MERGEFORMAT </w:instrText>
                      </w:r>
                      <w:r>
                        <w:fldChar w:fldCharType="separate"/>
                      </w:r>
                      <w:r w:rsidR="00310322">
                        <w:rPr>
                          <w:noProof/>
                        </w:rPr>
                        <w:t>May 1, 2017</w:t>
                      </w:r>
                      <w:r>
                        <w:fldChar w:fldCharType="end"/>
                      </w:r>
                    </w:p>
                  </w:txbxContent>
                </v:textbox>
                <w10:wrap anchory="margin"/>
              </v:shape>
            </w:pict>
          </mc:Fallback>
        </mc:AlternateContent>
      </w:r>
      <w:r>
        <w:rPr>
          <w:noProof/>
          <w:lang w:val="et-EE" w:eastAsia="et-EE"/>
        </w:rPr>
        <mc:AlternateContent>
          <mc:Choice Requires="wps">
            <w:drawing>
              <wp:anchor distT="0" distB="0" distL="114300" distR="114300" simplePos="0" relativeHeight="251656704" behindDoc="0" locked="0" layoutInCell="1" allowOverlap="0" wp14:anchorId="4E281B94" wp14:editId="73B6EA15">
                <wp:simplePos x="0" y="0"/>
                <wp:positionH relativeFrom="column">
                  <wp:align>center</wp:align>
                </wp:positionH>
                <wp:positionV relativeFrom="margin">
                  <wp:align>center</wp:align>
                </wp:positionV>
                <wp:extent cx="5759450" cy="2070000"/>
                <wp:effectExtent l="0" t="0" r="0" b="6985"/>
                <wp:wrapNone/>
                <wp:docPr id="40" name="Text Box 68"/>
                <wp:cNvGraphicFramePr>
                  <a:graphicFrameLocks xmlns:a="http://schemas.openxmlformats.org/drawingml/2006/main"/>
                </wp:cNvGraphicFramePr>
                <a:graphic xmlns:a="http://schemas.openxmlformats.org/drawingml/2006/main">
                  <a:graphicData uri="http://schemas.microsoft.com/office/word/2010/wordprocessingShape">
                    <wps:wsp>
                      <wps:cNvSpPr txBox="1">
                        <a:spLocks noChangeArrowheads="1"/>
                      </wps:cNvSpPr>
                      <wps:spPr bwMode="auto">
                        <a:xfrm>
                          <a:off x="0" y="0"/>
                          <a:ext cx="5759450" cy="2070000"/>
                        </a:xfrm>
                        <a:prstGeom prst="rect">
                          <a:avLst/>
                        </a:prstGeom>
                        <a:noFill/>
                        <a:ln>
                          <a:noFill/>
                        </a:ln>
                      </wps:spPr>
                      <wps:txbx>
                        <w:txbxContent>
                          <w:p w:rsidR="005D45CD" w:rsidRPr="00255E3F" w:rsidRDefault="005D45CD" w:rsidP="007531C5">
                            <w:pPr>
                              <w:jc w:val="center"/>
                              <w:rPr>
                                <w:rFonts w:ascii="Arial Black" w:hAnsi="Arial Black" w:cs="Arial"/>
                                <w:sz w:val="36"/>
                                <w:szCs w:val="36"/>
                              </w:rPr>
                            </w:pPr>
                            <w:r w:rsidRPr="00255E3F">
                              <w:rPr>
                                <w:rFonts w:ascii="Arial Black" w:hAnsi="Arial Black" w:cs="Arial"/>
                                <w:sz w:val="44"/>
                                <w:szCs w:val="44"/>
                              </w:rPr>
                              <w:t>U</w:t>
                            </w:r>
                            <w:bookmarkStart w:id="0" w:name="_Ref219403712"/>
                            <w:bookmarkEnd w:id="0"/>
                            <w:r>
                              <w:rPr>
                                <w:rFonts w:ascii="Arial Black" w:hAnsi="Arial Black" w:cs="Arial"/>
                                <w:sz w:val="44"/>
                                <w:szCs w:val="44"/>
                              </w:rPr>
                              <w:t xml:space="preserve">sing Microsoft Word </w:t>
                            </w:r>
                            <w:r w:rsidRPr="00255E3F">
                              <w:rPr>
                                <w:rFonts w:ascii="Arial Black" w:hAnsi="Arial Black" w:cs="Arial"/>
                                <w:sz w:val="44"/>
                                <w:szCs w:val="44"/>
                              </w:rPr>
                              <w:t>200</w:t>
                            </w:r>
                            <w:r>
                              <w:rPr>
                                <w:rFonts w:ascii="Arial Black" w:hAnsi="Arial Black" w:cs="Arial"/>
                                <w:sz w:val="44"/>
                                <w:szCs w:val="44"/>
                              </w:rPr>
                              <w:t>7/2010</w:t>
                            </w:r>
                            <w:r w:rsidRPr="00255E3F">
                              <w:rPr>
                                <w:rFonts w:ascii="Arial Black" w:hAnsi="Arial Black" w:cs="Arial"/>
                                <w:sz w:val="44"/>
                                <w:szCs w:val="44"/>
                              </w:rPr>
                              <w:br/>
                            </w:r>
                            <w:r w:rsidRPr="00255E3F">
                              <w:rPr>
                                <w:rFonts w:ascii="Arial Black" w:hAnsi="Arial Black" w:cs="Arial"/>
                                <w:sz w:val="32"/>
                                <w:szCs w:val="32"/>
                              </w:rPr>
                              <w:t>for Writing Technical Documents</w:t>
                            </w:r>
                          </w:p>
                          <w:p w:rsidR="005D45CD" w:rsidRDefault="005D45CD" w:rsidP="007531C5">
                            <w:pPr>
                              <w:jc w:val="center"/>
                            </w:pPr>
                            <w:r w:rsidRPr="00255E3F">
                              <w:t>Valter Kiisk</w:t>
                            </w:r>
                          </w:p>
                          <w:p w:rsidR="005D45CD" w:rsidRPr="00255E3F" w:rsidRDefault="005D45CD" w:rsidP="007531C5">
                            <w:pPr>
                              <w:jc w:val="center"/>
                            </w:pPr>
                            <w:r>
                              <w:t>Institute of Physics, University of Tartu</w:t>
                            </w:r>
                          </w:p>
                        </w:txbxContent>
                      </wps:txbx>
                      <wps:bodyPr rot="0" vert="horz" wrap="square" lIns="36000" tIns="36000" rIns="36000" bIns="36000" anchor="t" anchorCtr="0" upright="1">
                        <a:noAutofit/>
                      </wps:bodyPr>
                    </wps:wsp>
                  </a:graphicData>
                </a:graphic>
                <wp14:sizeRelH relativeFrom="margin">
                  <wp14:pctWidth>100000</wp14:pctWidth>
                </wp14:sizeRelH>
                <wp14:sizeRelV relativeFrom="page">
                  <wp14:pctHeight>0</wp14:pctHeight>
                </wp14:sizeRelV>
              </wp:anchor>
            </w:drawing>
          </mc:Choice>
          <mc:Fallback>
            <w:pict>
              <v:shape w14:anchorId="4E281B94" id="Text Box 68" o:spid="_x0000_s1027" type="#_x0000_t202" style="position:absolute;margin-left:0;margin-top:0;width:453.5pt;height:163pt;z-index:251656704;visibility:visible;mso-wrap-style:square;mso-width-percent:1000;mso-height-percent:0;mso-wrap-distance-left:9pt;mso-wrap-distance-top:0;mso-wrap-distance-right:9pt;mso-wrap-distance-bottom:0;mso-position-horizontal:center;mso-position-horizontal-relative:text;mso-position-vertical:center;mso-position-vertical-relative:margin;mso-width-percent:1000;mso-height-percent:0;mso-width-relative:margin;mso-height-relative:page;v-text-anchor:top" o:gfxdata="UEsDBBQABgAIAAAAIQC2gziS/gAAAOEBAAATAAAAW0NvbnRlbnRfVHlwZXNdLnhtbJSRQU7DMBBF&#10;90jcwfIWJU67QAgl6YK0S0CoHGBkTxKLZGx5TGhvj5O2G0SRWNoz/78nu9wcxkFMGNg6quQqL6RA&#10;0s5Y6ir5vt9lD1JwBDIwOMJKHpHlpr69KfdHjyxSmriSfYz+USnWPY7AufNIadK6MEJMx9ApD/oD&#10;OlTrorhX2lFEilmcO2RdNtjC5xDF9pCuTyYBB5bi6bQ4syoJ3g9WQ0ymaiLzg5KdCXlKLjvcW893&#10;SUOqXwnz5DrgnHtJTxOsQfEKIT7DmDSUCaxw7Rqn8787ZsmRM9e2VmPeBN4uqYvTtW7jvijg9N/y&#10;JsXecLq0q+WD6m8AAAD//wMAUEsDBBQABgAIAAAAIQA4/SH/1gAAAJQBAAALAAAAX3JlbHMvLnJl&#10;bHOkkMFqwzAMhu+DvYPRfXGawxijTi+j0GvpHsDYimMaW0Yy2fr2M4PBMnrbUb/Q94l/f/hMi1qR&#10;JVI2sOt6UJgd+ZiDgffL8ekFlFSbvV0oo4EbChzGx4f9GRdb25HMsYhqlCwG5lrLq9biZkxWOiqY&#10;22YiTra2kYMu1l1tQD30/bPm3wwYN0x18gb45AdQl1tp5j/sFB2T0FQ7R0nTNEV3j6o9feQzro1i&#10;OWA14Fm+Q8a1a8+Bvu/d/dMb2JY5uiPbhG/ktn4cqGU/er3pcvwCAAD//wMAUEsDBBQABgAIAAAA&#10;IQAv1gAq9AEAANsDAAAOAAAAZHJzL2Uyb0RvYy54bWysU9tu2zAMfR+wfxD0vtjJmrQz4hRdiw4D&#10;ugvQ7gNoWY6F2aJGKbGzrx8lp2m2vQ3zgyBedHgOSa+vx74Te03eoC3lfJZLoa3C2thtKb893b+5&#10;ksIHsDV0aHUpD9rL683rV+vBFXqBLXa1JsEg1heDK2UbgiuyzKtW9+Bn6LTlYIPUQ2CTtllNMDB6&#10;32WLPF9lA1LtCJX2nr13U1BuEn7TaBW+NI3XQXSlZG4hnZTOKp7ZZg3FlsC1Rh1pwD+w6MFYLnqC&#10;uoMAYkfmL6jeKEKPTZgp7DNsGqN00sBq5vkfah5bcDpp4eZ4d2qT/3+w6vP+KwlTl/KC22Oh5xk9&#10;6TGI9ziK1VXsz+B8wWmPjhPDyH6ec9Lq3QOq715YvG3BbvUNEQ6thpr5zePL7OzphOMjSDV8wprr&#10;wC5gAhob6mPzuB2C0ZnI4TSbyEWxc3m5fHex5JDi2CK/zPlLNaB4fu7Ihw8aexEvpSQefoKH/YMP&#10;kQ4UzymxmsV703VpATr7m4MTJw8XPz6NSiL5SUYYqzE1LcmMsQrrA0sjnDaM/wi+tEg/pRh4u0rp&#10;f+yAtBTdR8vtebuK9EU4N+jcqM4NsIqhShmkmK63YVrhnSOzbbnSNBCLN9zSxiSxL6yOg+ANSj04&#10;bntc0XM7Zb38k5tfAAAA//8DAFBLAwQUAAYACAAAACEAj1qeKt0AAAAFAQAADwAAAGRycy9kb3du&#10;cmV2LnhtbEyPQUvDQBCF74L/YRnBm91tC7HGbIpYxYugrYL2ts1Ok2B2Nslum/jvHb3Yy4PHG977&#10;JluOrhFH7EPtScN0okAgFd7WVGp4f3u8WoAI0ZA1jSfU8I0Blvn5WWZS6wda43ETS8ElFFKjoYqx&#10;TaUMRYXOhIlvkTjb+96ZyLYvpe3NwOWukTOlEulMTbxQmRbvKyy+NgenYfVE6vNl6Lar7fPrA84X&#10;SfdRdFpfXox3tyAijvH/GH7xGR1yZtr5A9kgGg38SPxTzm7UNdudhvksUSDzTJ7S5z8AAAD//wMA&#10;UEsBAi0AFAAGAAgAAAAhALaDOJL+AAAA4QEAABMAAAAAAAAAAAAAAAAAAAAAAFtDb250ZW50X1R5&#10;cGVzXS54bWxQSwECLQAUAAYACAAAACEAOP0h/9YAAACUAQAACwAAAAAAAAAAAAAAAAAvAQAAX3Jl&#10;bHMvLnJlbHNQSwECLQAUAAYACAAAACEAL9YAKvQBAADbAwAADgAAAAAAAAAAAAAAAAAuAgAAZHJz&#10;L2Uyb0RvYy54bWxQSwECLQAUAAYACAAAACEAj1qeKt0AAAAFAQAADwAAAAAAAAAAAAAAAABOBAAA&#10;ZHJzL2Rvd25yZXYueG1sUEsFBgAAAAAEAAQA8wAAAFgFAAAAAA==&#10;" o:allowoverlap="f" filled="f" stroked="f">
                <v:textbox inset="1mm,1mm,1mm,1mm">
                  <w:txbxContent>
                    <w:p w:rsidR="005D45CD" w:rsidRPr="00255E3F" w:rsidRDefault="005D45CD" w:rsidP="007531C5">
                      <w:pPr>
                        <w:jc w:val="center"/>
                        <w:rPr>
                          <w:rFonts w:ascii="Arial Black" w:hAnsi="Arial Black" w:cs="Arial"/>
                          <w:sz w:val="36"/>
                          <w:szCs w:val="36"/>
                        </w:rPr>
                      </w:pPr>
                      <w:r w:rsidRPr="00255E3F">
                        <w:rPr>
                          <w:rFonts w:ascii="Arial Black" w:hAnsi="Arial Black" w:cs="Arial"/>
                          <w:sz w:val="44"/>
                          <w:szCs w:val="44"/>
                        </w:rPr>
                        <w:t>U</w:t>
                      </w:r>
                      <w:bookmarkStart w:id="1" w:name="_Ref219403712"/>
                      <w:bookmarkEnd w:id="1"/>
                      <w:r>
                        <w:rPr>
                          <w:rFonts w:ascii="Arial Black" w:hAnsi="Arial Black" w:cs="Arial"/>
                          <w:sz w:val="44"/>
                          <w:szCs w:val="44"/>
                        </w:rPr>
                        <w:t xml:space="preserve">sing Microsoft Word </w:t>
                      </w:r>
                      <w:r w:rsidRPr="00255E3F">
                        <w:rPr>
                          <w:rFonts w:ascii="Arial Black" w:hAnsi="Arial Black" w:cs="Arial"/>
                          <w:sz w:val="44"/>
                          <w:szCs w:val="44"/>
                        </w:rPr>
                        <w:t>200</w:t>
                      </w:r>
                      <w:r>
                        <w:rPr>
                          <w:rFonts w:ascii="Arial Black" w:hAnsi="Arial Black" w:cs="Arial"/>
                          <w:sz w:val="44"/>
                          <w:szCs w:val="44"/>
                        </w:rPr>
                        <w:t>7/2010</w:t>
                      </w:r>
                      <w:r w:rsidRPr="00255E3F">
                        <w:rPr>
                          <w:rFonts w:ascii="Arial Black" w:hAnsi="Arial Black" w:cs="Arial"/>
                          <w:sz w:val="44"/>
                          <w:szCs w:val="44"/>
                        </w:rPr>
                        <w:br/>
                      </w:r>
                      <w:r w:rsidRPr="00255E3F">
                        <w:rPr>
                          <w:rFonts w:ascii="Arial Black" w:hAnsi="Arial Black" w:cs="Arial"/>
                          <w:sz w:val="32"/>
                          <w:szCs w:val="32"/>
                        </w:rPr>
                        <w:t>for Writing Technical Documents</w:t>
                      </w:r>
                    </w:p>
                    <w:p w:rsidR="005D45CD" w:rsidRDefault="005D45CD" w:rsidP="007531C5">
                      <w:pPr>
                        <w:jc w:val="center"/>
                      </w:pPr>
                      <w:r w:rsidRPr="00255E3F">
                        <w:t>Valter Kiisk</w:t>
                      </w:r>
                    </w:p>
                    <w:p w:rsidR="005D45CD" w:rsidRPr="00255E3F" w:rsidRDefault="005D45CD" w:rsidP="007531C5">
                      <w:pPr>
                        <w:jc w:val="center"/>
                      </w:pPr>
                      <w:smartTag w:uri="urn:schemas-microsoft-com:office:smarttags" w:element="PlaceType">
                        <w:r>
                          <w:t>Institute</w:t>
                        </w:r>
                      </w:smartTag>
                      <w:r>
                        <w:t xml:space="preserve"> of </w:t>
                      </w:r>
                      <w:smartTag w:uri="urn:schemas-microsoft-com:office:smarttags" w:element="PlaceName">
                        <w:r>
                          <w:t>Physics</w:t>
                        </w:r>
                      </w:smartTag>
                      <w:r>
                        <w:t xml:space="preserve">, </w:t>
                      </w:r>
                      <w:smartTag w:uri="urn:schemas-microsoft-com:office:smarttags" w:element="place">
                        <w:smartTag w:uri="urn:schemas-microsoft-com:office:smarttags" w:element="PlaceType">
                          <w:r>
                            <w:t>University</w:t>
                          </w:r>
                        </w:smartTag>
                        <w:r>
                          <w:t xml:space="preserve"> of </w:t>
                        </w:r>
                        <w:smartTag w:uri="urn:schemas-microsoft-com:office:smarttags" w:element="PlaceName">
                          <w:r>
                            <w:t>Tartu</w:t>
                          </w:r>
                        </w:smartTag>
                      </w:smartTag>
                    </w:p>
                  </w:txbxContent>
                </v:textbox>
                <w10:wrap anchory="margin"/>
              </v:shape>
            </w:pict>
          </mc:Fallback>
        </mc:AlternateContent>
      </w:r>
    </w:p>
    <w:p w:rsidR="00B2425D" w:rsidRDefault="00B2425D">
      <w:pPr>
        <w:spacing w:before="0" w:after="0"/>
        <w:jc w:val="left"/>
        <w:rPr>
          <w:rFonts w:ascii="Arial Black" w:hAnsi="Arial Black" w:cs="Arial"/>
          <w:bCs/>
          <w:caps/>
          <w:color w:val="E36C0A" w:themeColor="accent6" w:themeShade="BF"/>
          <w:kern w:val="32"/>
          <w:sz w:val="28"/>
          <w:szCs w:val="32"/>
        </w:rPr>
      </w:pPr>
      <w:bookmarkStart w:id="1" w:name="_Toc219459029"/>
      <w:bookmarkStart w:id="2" w:name="_Toc219459248"/>
      <w:bookmarkEnd w:id="1"/>
      <w:bookmarkEnd w:id="2"/>
    </w:p>
    <w:sectPr w:rsidR="00B2425D" w:rsidSect="00310322">
      <w:headerReference w:type="default" r:id="rId8"/>
      <w:pgSz w:w="11906" w:h="16838" w:code="9"/>
      <w:pgMar w:top="1418" w:right="1418" w:bottom="1418" w:left="1418" w:header="709" w:footer="709" w:gutter="0"/>
      <w:pgNumType w:fmt="upperRoman" w:start="1"/>
      <w:cols w:space="708"/>
      <w:titlePg/>
      <w:docGrid w:linePitch="360"/>
    </w:sectPr>
  </w:body>
</w:document>

MsWord.docx

Pandoc version: 3.1.9

@jgm
Copy link
Owner

jgm commented Nov 24, 2023

You're converting from what to what?

What do you mean by "shape format"?

@StephanMeijer
Copy link
Contributor Author

StephanMeijer commented Nov 24, 2023

From Docx to HTML.

image

Shape Format is some Microsoft Word feature allowing user for freely positioning text, using Word art, positioning images, among others. A feature that probably shouldn't be used.

Currently working on a PR for Pandoc to investigate and if possible fix.

StephanMeijer added a commit to StephanMeijer/pandoc that referenced this issue Nov 24, 2023
@StephanMeijer
Copy link
Contributor Author

This would probably require some extreme measures in src/Text/Pandoc/Readers/Docx/Parse.hs as logic has to be changed: A w:p can also contain more paragraphs, not only runs..

@StephanMeijer
Copy link
Contributor Author

More info can be found in ECMA-376 Part 1

image image

StephanMeijer added a commit to StephanMeijer/pandoc that referenced this issue Nov 28, 2023
StephanMeijer added a commit to StephanMeijer/pandoc that referenced this issue Nov 28, 2023
StephanMeijer added a commit to StephanMeijer/pandoc that referenced this issue Nov 28, 2023
StephanMeijer added a commit to StephanMeijer/pandoc that referenced this issue Nov 28, 2023
jgm pushed a commit that referenced this issue Nov 30, 2023
* #9214 text in shape format test document

* #9214 support Text in Shape Format

* #9214 remove irrelevant code
@jgm
Copy link
Owner

jgm commented Nov 30, 2023

Closed by #9223 - I accidentally hit enter before finishing the description of the squashed commit.

@jgm jgm closed this as completed Nov 30, 2023
@StephanMeijer
Copy link
Contributor Author

@jgm many thanks for merging! I will publish some test-cases and possible fixes on my code for VML-based images probably tomorrow to make sure those are still supported within context of shape format.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

2 participants